WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) Internal! nal Patent Classification 6 : 

C12N 15/31, C07K 14/315, 16/12, C12Q 
1/68 



A2 



(11) Interna U nal Publlcati n Number: 
(43) International Publics ti n Date: 



WO 98/18931 

7 May 1998 (07.05.98) 



(21) International Application Number: PCT/US97/ 19588 

(22) International Filing Date: 30 October 1997 (30.10.97) 



US 



(tl^Applicant (for all de^gna^d^Stesn^ept US): HUMAN 
mNOWHE--SCrENCES, INC. (US/US]; 9410 Key West 




Avenue, Rockville, MD 20850 (US), 



(72) Inventors; and 

(75) Inventors/Applicants (for US only): KUNSCH, Charles, A. 
[US/US); 2398B Donwoody Crossing, Atlanta, GA 30338 
(US). CHOI, Gil. H. [KR/US]; 1 1429 Potomac Oaks E)rive, 
Rockville, MD 20850 (US). DILLON. Patrick. J. [US/US]; 
1055 Snipe Court, Carlsbad, CA 92009 (US). ROSEN, 
Craig, A. [US/US]; 22400 Rolling Hill Road, Laytonsville, 
MD 20882 (US). BARASH. Steven, C. [US/US]; 582 Col- 
lege Parkway #303, Rockville, MD 20850 (US). FAN- 
NON, Michael [US/US]; 13501 Rippling Brook Drive. Sil- 
ver Spring. MD 20850 (US). DOUGHERTY, Brian. A. 
[US/US]; 708 Meadow Field Court, Mount Airy, MD 21771 
(US). 



(74) Agents: BROOKES, A., Anders et al.; Human Genome 
Sciences. Inc., 9410 Key West Avenue, Rockville, MD 
20850 (US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU. CZ, DE. DK, EE, ES, FI, GB, GE, 
GH, HU. ID. IL. IS. JP, KE, KG, KP, KR. KZ, LC. UC, 
LR, LS, LT, LU, LV, MD. MG, MK, MN, MW, MX, NO, 
NZ. PL, PT. RO. RU. SD. SE, SG, SI. SK. SL. 77. TM, TR. 
TT. UA, UG, US, UZ, VN, YU, ZW. ARIPO patent (GH, 
KE. LS. MW. SD, SZ, UG, ZW). Eurasian patent (AM. AZ, 
BY, KG. KZ, MD. RU, TJ, TM), European patent (AT, BE, 
CH. DE. DK, ES. FI, FR, GB, GR, IE, IT. LU, MC, NL, 
PT, SE), OAPI patent (BF, BJ; CF, CG, CI, CM, GA, GN, 
ML, MR, NE, SN, TD, TG). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: STREPTOCOCCUS PNEUMONIAE POLYNUCLEOTIDES AND SEQUENCES 



Computer System 102 
\ 



nus 



104 



106 

w 



Main Memory 



108 



Secondary Storage 



yv/ Devices HO 



Hard Drive 



L>4 



Removable Medium 
Storage Device 



116 



Removable Storage 
Medium 



(57) Abstract 

The prcseni invention provides polynucleotide sequences of the genome of Streptococcus pneumoniae, polypeptide sequences encoded 
by the polynucleotide sequences, corresponding polynucleotides and polypeptide, vectors and hosts comprising the polynucleotides, and 
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Streptococcus pneumoniae Polynucleotides and Sequences 

FTFI.D OF THE INVENTION 

5 The present invention relates to the field of molecular biology. In 

particular, it relates to, among other things, nucleotide sequences of Streptococcus 
pneumoniae* contigs, ORFs, fragments, probes, primers and related 
polynucleotides thereof, peptides and polypeptides encoded by the sequences, and 
uses of the polynucleotides and sequences thereof, such as in fermentation, 
10 polypeptide production, assays and pharmaceutical development, among others. 

BACKGROUND OF THE INVENTION 

Streptococcus pneumoniae has been one of the most extensively studied 

15 microorganisms since its first isolation in 1881. It was the object of many 
investigations that led to imponant scientific discoveries. In 1928, Griffith 
observed that when heat-killed encapsulated pneumococci and live strains 
constitutively lacking any capsule were concomitantly injected into mice, the 
noncncapsulated could be converted into encapsulated pneumococci with the same 

20 capsular type as the heat-killed strain. Years later, the nature of this "transforming 
principle," or carrier of genetic information, was shown to be DNA. (Avery, O.T., 
etaLJ. Exp. Med.. 79:137-157 (1944)). 

In spite of the vast number of publications on 5. pneumoniae many 
questions about its virulence are still unanswered, and this pathogen remains a 

25 major causative agent of serious human disease, especially community-acquired 
pneumonia. (Johnston, R.B., et Rev. Infect. Dis. 7J(Suppl. 6):S509-517 
( 1991)). In addition, in developing countries, the pneumococcus is responsible for 
the death of a large number of children under the age of 5 years from pneumococcal 
pneumonia. The incidence of pneumococcal disease is highest in infants under 2 

30 years of age and in people over 60 years of age. Pneumococci are the second most 
frequent cause (after Haemophilus influenzae type b) of bacterial meningitis and 
otitis media in children. With the recent introduction of conjugate vaccines for H. 
influenzae type b, pneumococcal meningitis is likely to become increasingly 
prominent. S. pneumoniae is the most imponant etiologic agent of community- 
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acquired pneumonia in adults and is the second most common cause of bacterial 
meningitis behind Neisseria meningitidis. 

The antibiotic generally prescribed to treat S. pneumoniae is 
benzylpenicillin, although resistance to this and to other antibiotics is found 

5 occasionally. Pneumococcal resistance to penicillin results from mutations in its 
penicillin-binding proteins. In uncomplicated pneumococcal pneumonia caused by 
a sensitive strain, treatment with penicillin is usually successful unless started too 
late. Erythromycin or clindamycin can be used to treat pneumonia in patients 
hypersensitive to penicillin, but resistant strains to these drugs exist. Broad 

10 spectrum antibiotics (e.g., the tetracyclines) may also be effective, although 
tetracycline-resistant strains are not rare. In spite of the availability of antibiotics, 
the mortality of pneumococcal bacteremia in the last four decades has remained 
stable between 25 and 29%. (Gillespie, S.H., et a/., /. Med. Microbiol. 25:237- 
248 (1989). 

15 S. pneumoniae is carried in the upper respiratory tract by many healthy 

individuals. It has been suggested thai attachment of pneumococci is mediated by a 
disaccharide receptor on fibronectin. present on human pharyngeal epithelial cells. 
(Anderson, B.J., era/., J. Immunol. / 42:2464-2468 (1989). The mechanisms by 
which pneumococci translocate from the nasopharynx to the lung, thereby causing 

20 pneumonia, or migrate to the blood, giving rise to bacteremia or septicemia, are 
poorly understood. (Johnston, R.B., et a/.. Rev. Infect. Dis. 73(Suppi. 6):S509- 
517(1991). 

Various proteins have been suggested to be involved in the pathogenicity of 
S. pneumoniae, however, only a few of them have actually been confirmed as 

25 virulence factors. Pneumococci produce an IgAl protease that might interfere with 
host defense at mucosal surfaces. (Korntleld, S.J., et a/., Rev. Inf. Dis. J:52l- 
534 (1981). S. pneumoniae also produces neuraminidase, an enzyme that may 
facilitate attachment to epithelial cells by cleaving sialic acid from the host 
glycolipids and gangliosides. Partially purified neuraminidase was observed to 

30 induce meningitis-like symptoms in mice: however, the reliability of this finding 
has been questioned because the neuraminidase preparations used were probably 
contaminated with cell wall products. Other pneumococcal proteins besides 
neuraminidase are involved in the adhesion of pneumococci to epithelial and 
endothelial cells. These pneumococcal proteins have as yet not been identified. 

35 Recently, Cundell et. al. % reported that peptide permeases can modulate 
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pneumococcal adherence to epithelial and endothelial cells. It was, however, 
unclear whether these permeases function directly as adhesions or whether they 
enhance adherence by modulating the expression of pneumococcal adhesions. 
(DeVelasco, E.A., etaL Micro. Rev. 59:591-603 (1995). A better understanding 
5 of the virulence factors determining its pathogenicity will need to be developed to 
cope with the devastating effects of pneumococcal disease in humans. 

Ironically, despite the prominent role of 5. pneumoniae in the discovery of 
DNA, little is known about the molecular genetics of the organism. The S. 
pneumoniae genome consists of one circular, covalently closed, double-stranded 

10 DNA and a collection of so-called variable accessory elements, such as prophages, 
plasinids, transposons and the like. Most physical characteristics and almost all of 
the genes of 5. pneumoniae are unknown. Among the few that have been 
identified, most have not been physically mapped or characterized in detail. Only a 
few genes of this organism have been sequenced. (See, for instance current 

15 versions of GENBANK and other nucleic acid databases, and references that relate 
to the genome of 5. pneumoniae such as those set out elsewhere herein.) 

It is clear that the etiology of diseases mediated or exacerbated by 5. 
pneumoniae, infection involves the programmed expression of 5. pneumoniae 
genes, and that characterizing the genes and their patterns of expression would add 

20 dramatically to our understanding of the organism and its host interactions. 
Knowledge of S. pneumoniae genes and genomic organization would improve our 
understanding of disease etiology and lead to improved and new ways of 
preventing, ameliorating, arresting and reversing diseases. Moreover, 
characterized genes and genomic fragments of S. pneumoniae would provide 

25 reagents for, among other things, detecting, characterizing and controlling 5. 
pneumoniae infections. There is a need to characterize the genome of S. 
pneumoniae and for polynucleotides of this organism. 
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SUMMARY OF T HE INVENTION 

The present invention is based on the sequencing of fragments of the 
5 Streptococcus pneumoniae genome. The primary nucleotide sequences which were 
generated are provided in SEQ ID NOS: 1-391. 

The present invention provides the nucleotide sequence of several hundred 
contigs of the Streptococcus pneumoniae genome, which are listed in tables below 
and set out in the Sequence Listing submitted herewith* and representative 
io fragments thereof, in a form which can be readily used, analyzed, and interpreted 
by a skilled artisan. In one embodiment, the present invention is provided as 
contiguous strings of primary sequence information corresponding to the 
nucleotide sequences depicted in SEQ ID NOS: 1-391. 

The present invention further provides nucleotide sequences which are at 
1 5 least 95% identical to the nucleotide sequences of SEQ ID NOS: I -39 1 . 

The nucleotide sequence of SEQ ID NOS: 1-391, a representative fragment 
thereof, or a nucleotide sequence which is at least 95% identical to the nucleotide 
sequence of SEQ ID NOS: 1-391 may be provided in a variety of mediums to 
facilitate its use. In one application of this embodiment, the sequences of the 
20 present invention are recorded on computer readable media. Such media includes, 
but is not limited to: magnetic storage media, such as floppy discs, hard disc 
storage medium, and magnetic tape: optical storage media such as CD-ROM: 
electrical storage media such as RAM and ROM; and hybrids of these categories 
such as magnetic/optical storage media. 
25 The present invention further provides systems, particularly computer- 

based systems which contain the sequence information herein described stored in a 
data storage means. Such systems are designed to identify commercially important 
fragments of the Streptococcus pneumoniae genome. 

Another embodiment of the present invention is directed to fragments of the 
30 Streptococcus pneumoniae genome having particular structural or functional 
attributes. Such fragments of the Streptococcus pneumoniae genome of the present 
invention include, but are not limited to, fragments which encode peptides, 
hereinafter referred to as open reading frames or ORFs. fragments which modulate 
the expression of an operably linked ORF, hereinafter referred to as expression 
35 modulating fragments or EMFs. and fragments which can be used to diagnose the 
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presence of Streptococcus pneumoniae in a sample, hereinafter referred to as 
diagnostic fragments or DFs. 

Each of the ORFs in fragments of the Streptococcus pneumoniae genome 
disclosed in Tables 1-3, and the EMFs found 5* to the ORFs, can be used in 
5 numerous ways as polynucleotide reagents. For instance, the sequences can be 
used as diagnostic probes or amplification primers for detecting or determining ihe 
presence of a specific microbe in a sample, to selectively control gene expression in 
a host and in the production of polypeptides, such as polypeptides encoded by 
ORFs of the present invention, particular those polypeptides that have a 

to pharmacological activity. 

The present invention further includes recombinant constructs comprising 
one or more fragments of the Streptococcus pneumoniae genome of the present 
invention. The recombinant constructs of the present invention comprise vectors, 
such as a plasmid or viral vector, into which a fragment of the Streptococcus 

15 pneumoniae has been inserted. 

The present invention further provides host cells containing any of the 
isolated fragments of the Streptococcus pneumoniae genome of the present 
invention. The host cells can be a higher eukaryotic host cell, such as a mammalian 
cell, a lower eukaryotic cell, such as a yeast cell, or a procaryotic cell such as a 

20 bacterial cell. 

The present invention is further directed to isolated polypeptides and 
proteins encoded by ORFs of the present invention. A variety of methods, well 
known to those of skill in the art, routinely may be utilized to obtain any of the 
polypeptides and proteins of the present invention. For instance, polypeptides and 

25 proteins of the present invention having relatively short, simple amino acid 
sequences readily can be synthesized using commercially available automated 
peptide synthesizers. Polypeptides and proteins of the present invention also may 
be purified from bacterial cells which naturally produce the protein. Yet another 
alternative is to purify polypeptide and proteins of the present invention from cells 

30 which have been altered to express them. 

The invention further provides methods of obtaining homologs of the 
fragments of the Streptococcus pneumoniae genome of the present invention and 
homologs of the proteins encoded by the ORFs of the present invention. 
Specifically, by using the nucleotide and amino acid sequences disclosed herein as 
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a probe or as primers, and techniques such asJJCR cloning and colony/plaque 
hybridization, one skilled in the art can obtain homoiogs. 

The invention further provides antibodies which selectively bind 
polypeptides and proteins of the present invention. Such antibodies include both 
5 monoclonal and polyclonal antibodies. 

The invention further provides hybridomas which produce the above- 
described antibodies. A hybridoma is an immortalized cell line which is capable of 
secreting a specific monoclonal antibody. 

The present invention further provides methods of identifying test samples 
10 derived from cells which express one of the ORFs of the present invention, or a 
homolog thereof. Such methods comprise incubating a test sample with one or 
more of the antibodies of the present invention, or one or more of the DFs of the 
present invention, under conditions which allow a skilled artisan to determine if the 
sample contains the ORF or product produced therefrom. 
15 In another embodiment of the present invention, kits are provided which 

contain the necessary reagents to carry out the above -described assays. 

Specifically, the invention provides a compartmentalized kit to receive, in 
close confinement, one or more containers which comprises: (a) a first container 
comprising one of the antibodies, or one of the DFs of the present invention; and 
20 (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of bound antibodies or hybridized 
DFs. 

Using the isolated proteins of the present invention, the present invention 
further provides methods of obtaining and identifying agents capable of binding to 

25 a polypeptide or protein encoded by one of the ORFs of the present invention. 
Specifically, such agents include, as further described below, antibodies, peptides, 
carbohydrates, pharmaceutical agents and the like. Such methods comprise steps 
of: (a) contacting an agent with an isolated protein encoded by one of the ORFs of 
the present invention; and (b) determining whether the agent binds to said protein. 

30 The present genomic sequences of Streptococcus pneumoniae will be of 

great value to all laboratories working with this organism and for a variety of 
commercial purposes. Many fragments of the Streptococcus pneumoniae genome 
will be immediately identified by similarity searches against GenBank or protein 
databases and will be of immediate value to Streptococcus pneumoniae researchers 
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and for immediate commercial value for the production of proteins or to control 
gene expression. 

The methodology and technology for elucidating extensive genomic 
sequences of bacterial and other genomes has and will greatly enhance the ability to 
5 analyze and understand chromosomal organization. In particular, sequenced 
contigs and genomes will provide the models for developing tools for the analysis 
of chromosome structure and function, including the ability to identify genes within 
large segments of genomic DNA, the structure, position, and spacing of regulatory 
elements, the identification of genes with potential industrial applications, and the 
10 ability to do comparative genomic and molecular phyiogeny. 

DESCRIPTION OF THE FIGURES 

FIGURE 1 is a block diagram of a computer system (102) that can be 
15 used to implement computer-based systems of present invention. 

FIGURE 2 is a schematic diagram depicting the data flow and computer 
programs used to collect, assemble, edit and annotate the contigs of the 
Streptococcus pneumoniae genome of the present invention. Both Macintosh and 

20 Unix platforms are used to handle the AB 373 and 377 sequence data files, largely 
as described in Kerlavage et aL % Proceedings of the Twenty-Sixth Annual Hawaii 
International Conference on System Sciences. 585, IEEE Computer Society Press, 
Washington D.C. (1993). Factura (AB) is a Macintosh program designed for 
automatic vector sequence removal and end-trimming of sequence files. The 

25 program Loadis runs on a Macintosh platform and parses the feature data extracted 
from the sequence files by Factura to the Unix based Streptococcus pneumoniae 
relational database. Assembly of contigs (and whole genome sequences) is 
accomplished by retrieving a specific set of sequence files and their associated 
features using Extrseq, a Unix utility for retrieving sequences from an SQL 

30 database. The resulting sequence file is processed by seq_filter to trim portions of 
the sequences with more than 2% ambiguous nucleotides. The sequence files were 
assembled using TIGR Assembler, an assembly engine designed at The Institute 
for Genomic Research ( TIGR ) for rapid and accurate assembly of thousands of 
sequence fragments. The collection of contigs generated by the assembly step is 

35 loaded inro the database with the lassie program. Identification of open reading 



BNSDOCID: <WO 9818931A2 L> 



WO 98/18931 



8 



PCT/US97/19588 



frames (ORFs) is accomplished by processing contigs with zorf or GenMark. The 
ORFs are searched against S. pneumoniae sequences from GenBank and against all 
protein sequences using the BLASTN and BLASTP programs, described in 
Altschul et aL, J. MoL Biol. 215: 403-410 (1990)). Results of the ORF 
5 determination and similarity searching steps were loaded into the database. As 
described below, some results of the determination and the searches are set out in 
Tables 1-3. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

10 

The present invention is based on the sequencing of fragments of the 
Streptococcus pneumoniae genome and analysis of the sequences. The primary 
nucleotide sequences generated by sequencing the fragments are provided in SEQ 
ID NOS: 1-391. (As used herein, the "primary sequence" refers to the nucleotide 

15 sequence represented by the IUPAC nomenclature system.) 

In addition to the aforementioned Streptococcus pneumoniae polynucleotide 
and polynucleotide sequences, the present invention provides the nucleotide 
sequences of SEQ ID NOS: 1-391, or representative fragments thereof, in a form 
which can be readily used, analyzed, and interpreted by a skilled artisan. 

20 As used herein, a "representative fragment of the nucleotide sequence 

depicted in SEQ ID NOS: i -391" refers to any portion of the SEQ CD NOS: I -39 1 
which is not presently represented within a publicly available database. Preferred 
representative fragments of the present invention are Streptococcus pneumoniae 
open reading frames ( ORFs ), expression modulating fragment ( EMFs ) and 

25 fragments which can be used to diagnose the presence of Streptococcus 
pneumoniae in sample ( DFs ). A non-limiting identification of preferred 
representative fragments is provided in Tables 1-3. As discussed in detail below, 
the information provided in SEQ ED NOS: 1-391 and in Tables 1-3 together with 
routine cloning, synthesis, sequencing and assay methods will enable those skilled 

30 in the art to clone and sequence all "representative fragments" of interest, including 
open reading frames encoding a large variety of Streptococcus pneumoniae 
proteins. 

While the presently disclosed sequences of SEQ ID NOS: 1-39 1 are highly 
accurate, sequencing techniques are not perfect and, in relatively rare instances, 
35 further investigation of a fragment or sequence of the invention may reveal a 
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nucleotide sequence error present in a nucleotide sequence disclosed in SEQ ID 
NOS: 1-391. However, once the present invention is made available {i.e., once the 
information in SEQ ID NOS: 1-391 and Tables 1-3 has been made available), 
resolving a rare sequencing error in SEQ ID NOS: 1-391 will be well within the 
5 skill of the art. The present disclosure makes available sufficient sequence 
information to allow any of the described contigs or portions thereof to be obtained 
readily by straightforward application of routine techniques. Further sequencing of 
such polynucleotide may proceed in like manner using manual and automated 
sequencing methods which are employed ubiquitous in the art. Nucleotide 

10 sequence editing software is publicly available. For example. Applied Biosy stem's 
(AB) AutoAssembler can be used as an aid during visual inspection of nucleotide 
sequences. By employing such routine techniques potential errors readily may be 
identified and the correct sequence then may be ascertained by targeting further 
sequencing effort, also of a routine nature, to the region containing the potential 

15 error. 

Even if all of the very rare sequencing errors in SEQ ED NOS: 1-391 were 
corrected, the resulting nucleotide sequences would still be at least 95% identical, 
nearly all would be at least 99% identical, and the great majority would be at least 
99.9% identical to the nucleotide sequences of SEQ ID NOS: 1-391. 

20 As discussed elsewhere herein, polynucleotides of the present invention 

readily may be obtained by routine application of well known and standard 
procedures for cloning and sequencing DNA. Detailed methods for obtaining 
libraries and for sequencing are provided below, for instance. A wide variety of 
Streptococcus pneumoniae strains that can be used to prepare S. pneumoniae 

25 genomic DNA for cloning and for obtaining polynucleotides of the present 
invention are available to the public from recognized depository institutions, such 
as the American Type Culture Collection ( ATCC ). While the present invention is 
enabled by the sequences and other information herein disclosed, the S, 
pneumoniae strain that provided the DNA of the present Sequence Listing, Strain 

30 7/87 14.8.91, has been deposited in the ATCC, as a convenience to those of skill 
in the art. As a further convenience, a library of S. pneumoniae genomic DNA. 
derived from the same strain, also has been deposited in the ATCC. The S. 
pneumoniae strain was deposited on October 10, 1996, and was given Deposit No. 
55840. and the cDNA library was deposited on October 1 1, 1996 and was given 

35 Deposit No. 97755. The genomic fragments in the library are 15 to 20 kb 
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fragments generated by partial Sau3Al digestion and they are inserted into the 
BamHI site in the well-known lambda-derived vector lambda DASH II (Stratagene, 
La Jolla, CA). The provision of the deposits is not a waiver of any rights of the 
inventors or their assignees in the present subject matter. 
5 The nucleotide sequences of the genomes from different strains of 

Streptococcus pneumoniae differ somewhat. However, the nucleotide sequences 
of the genomes of all Streptococcus pneumoniae strains will be at least 95% 
identical, in corresponding pan, to the nucleotide sequences provided in SEQ ID 
NOS: 1-391. Nearly all will be at least 99% identical and the great majority will be 

10 99.9% identical. 

Thus, the present invention further provides nucleotide sequences which 
are at least 95%, preferably 99% and most preferably 99.9% identical to the 
nucleotide sequences of SEQ ID NOS: 1-391, in a form which can be readily used, 
analyzed and interpreted by the skilled artisan. 

15 Methods for determining whether a nucleotide sequence is at least 95%, at 

least 99% or at least 99.9% identical to the nucleotide sequences of SEQ ID 
NOS: 1-391 are routine and readily available to the skilled artisan. For example, the 
well known fasta algorithm described in Pearson and Lipman, Proc. Natl. Acad. 
ScL USA 85: 2444 (1988) can be used to generate the percent identity of nucleotide 

20 sequences. The BLASTN program also can be used to generate an identity score 
of poiynucieotides compared to one another. 

COMPUTER RELATED EMBODIMENTS 

The nucleotide sequences provided in SEQ ID NOS: 1-391, a representative 
25 fragment thereof, or a nucleotide sequence at least 95%, preferably at least 99% 
and most preferably at least 99.9% identical to a polynucleotide sequence of SEQ 
ID NOS: 1-391 may be "provided" in a variety of mediums to facilitate use thereof. 
As used herein, provided refers to a manufacture, other than an isolated nucleic 
acid molecule, which contains a nucleotide sequence of the present invention; i.e., 
30 a nucleotide sequence provided in SEQ ID NOS: 1-391, a representative fragment 
thereof, or a nucleotide sequence at least 95%, preferably at least 99% and most 
preferably at least 99.9% identical to a polynucleotide of SEQ ID NOS: 1-391. 
Such a manufacture provides a large portion of the Streptococcus pneumoniae 
genome and pans thereof a Streptococcus pneumoniae open reading frame 

35 (ORF)) in a form which allows a skilled anisan to examine the manufacture using 
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means not directly applicable to examining the Streptococcus pneumoniae genome 
or a subset thereof as it exists in nature or in purified form. 

In one application of this embodiment, a nucleotide sequence of the present 
invention can be recorded on computer readable media. As used herein, "computer 
5 readable media" refers to any medium which can-be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media, 
such as floppy discs, hard disc storage medium, and magnetic tape; optical storage 
media such as CD- ROM; electrical storage media such as RAM and ROM; and 
hybrids of these categories, such as magnetic/optical storage media. A skilled 

10 artisan can readily appreciate how any of the presently known computer readable 
mediums can be used to create a manufacture comprising computer readable 
medium having recorded thereon a nucleotide sequence of the present invention. 
Likewise, it will be clear to those of skill how additional computer readable media 
that may be developed also can be used to create analogous manufactures having 

15 recorded thereon a nucleotide sequence of the present invention. 

As used herein, "recorded" refers to a process for storing information on 
computer readable medium. A skilled artisan can readily adopt any of the presently 
know methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present 

20 invention. A variety of data storage structures are available to a skilled artisan 
for creating a computer readable medium having recorded thereon a nucleotide 
sequence of the present invention. The choice of the data storage structure will 
generally be based on the means chosen to access the stored information. In 
addition, a variety of data processor programs and formats can be used to store the 

25 nucleotide sequence information of the present invention on computer readable 
medium. The sequence information can be represented in a word processing text 
file, formatted in commercially- available software such as WordPerfect and 
Microsoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily 

30 adapt any number of data-processor structuring formats (e.g.. text file or database) 
in order to obtain computer readable medium having recorded thereon the 
nucleotide sequence information of the present invention. 

Computer software is publicly available which allows a skilled artisan to 
access sequence information provided in a computer readable medium. Thus, by 

35 providing in computer readable form (he nucleotide sequences of SEQ ID NOS:l- 
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391, a representative fragment thereof, or a nucleotide sequence at least 95%, 
preferably at least 99% and most preferably at least 99.9% identical to a sequence 
of SEQ ID NOS: 1-391 the present invention enables the skilled artisan routinely to 
access the provided sequence information for a wide variety of purposes. 
5 The examples which follow demonstrate how software which implements 

the BLAST (Altschul et at., 7. Mol Bioi 275:403-410 (1990)) and BLAZE 
(Brutlag et aL % Comp. Chem. 77:203-207 (1993)) search algorithms on a Sybase 
system was used to identify open reading frames (ORFs) within the Streptococcus 
pneumoniae genome which contain homology to ORFs or proteins from both 
10 Streptococcus pneumoniae and from other organisms. Among the ORFs discussed 
herein are protein encoding fragments of the Streptococcus pneumoniae genome 
useful in producing commercially important proteins, such as enzymes used in 
fermentation reactions and in the production of commercially useful metabolites. 

The present invention further provides systems, particularly computer- 
15 based systems, which contain the sequence information described herein. Such 
systems are designed to identify, among other things, commercially imponant 
fragments of the Streptococcus pneumoniae genome. 

As used herein, "a computer-based system" refers to the hardware means, 
software means, and data storage means used to analyze the nucleotide sequence 
20 information of the present invention. The minimum hardware means of the 
computer-based systems of the present invention comprises a central processing 
unit (CPU), input means, output means, and data storage means. A skilled artisan 
can readily appreciate that any one of the currently available computer-based 
systems are suitable for use in the present invention. 
25 As stated above, the computer-based systems of the present invention 

comprise a data storage means having stored therein a nucleotide sequence of the 
present invention and the necessary hardware means and software means for 
supporting and implementing a search means. 

As used herein, "data storage means" refers to memory which can store 
30 nucleotide sequence information of the present invention, or a memory access 
means which can access manufactures having recorded thereon the nucleotide 
sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which arc 
implemented on the computer-based system to compare a target sequence or target 
35 structural motif with the sequence information stored within the data storage 
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means. Search means are used to identify fragments or regions of the present 
genomic sequences which match a particular target sequence or target motif. A 
variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the 
5 computer-based systems of the present invention. Examples of such software 
includes, but is not limited to, MacPattem (EMBL), BLASTN and BLASTX 
(NCBIA). A skilled artisan can readily recognize that any one of the available 
algorithms or implementing software packages for conducting homology searches 
can be adapted for use in the present computer-based systems. 

10 As used herein, a "target sequence" can be any DNA or amino acid 

sequence of six or more nucleotides or two or more amino acids. A skilled artisan 
can readily recognize that the longer a target sequence is. the less likely a target 
sequence will be present as a random occurrence in the database. The most 
preferred sequence length of a target sequence is from about 10 to 100 amino acids 

!5 or from about 30 to 300 nucleotide residues. However, it is well recognized that 
searches for commercially important fragments, such as sequence fragments 
involved in gene expression and protein processing, may be of shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any 
rationally selected sequence or combination of sequences in which the sequence(s) 

20 are chosen based on a three-dimensional configuration which is formed upon the 
folding of the target motif. There arc a variety of target motifs known in the art. 
Protein target motifs include, but are not limited to, enzymic active sites and signal 
sequences. Nucleic acid target motifs include, but are not limited to, promoter 
sequences, hairpin structures and inducible expression elements (protein binding 

25 sequences). 

A variety of structural formats for the input and output means can be used 
to input and output the information in the computer-based systems of the present 
invention. A preferred format for an output means ranks fragments of the 
Streptococcus pneumoniae genomic sequences possessing varying degrees of 

30 homology to the target sequence or target motif. Such presentation provides a 
skilled artisan with a ranking of sequences which contain various amounts of the 
target sequence or target motif and identifies the degree of homology contained in 
the identified fragment. 

A variety of comparing means can be used to compare a target sequence or 

35 target motif with the data storage means to identify sequence fragments of the 
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Streptococcus pneumoniae genome. In the present examples, implementing 
software which implement the BLAST and BLAZE algorithms, described in 
Altschul et a/., /. MoL Biol. 215: 403-410 (1990), is used to identify open reading 
frames within the Streptococcus pneumoniae genome. A skilled artisan can readily 
5 recognize that any one of the publicly available homology search programs can be 
used as the search means for the computer-based systems of the present invention. 
Of course, suitable proprietary systems that may be known to those of skill also 
may be employed in this regard. 

Figure I provides a block diagram of a computer system illustrative of 

10 embodiments of this aspect of present invention. The computer system 102 
includes a processor 106 connected to a bus 104. Also connected to the bus 104 
are a main memory 108 (preferably implemented as random access memory, RAM) 
and a variety of secondary storage devices 1 1 0, such as a hard drive 1 1 2 and a 
removable medium storage device 1 14. The removable medium storage device 1 14 

15 may represent, for example, a floppy disk drive, a CD-ROM drive, a magnetic tape 
drive, etc. A removable storage medium 1 16 (such as a floppy disk, a compact 
disk, a magnetic tape, ere.) containing control logic and/or data recorded therein 
may be inserted into the removable medium storage device 1 14. The computer 
system 102 includes appropriate software for reading the control logic and/or the 

20 data from the removable medium storage device 1 14, once it is inserted into the 
removable medium storage device 1 14. 

A nucleotide sequence of the present invention may be stored in a well 
known manner in the main memory 108, any of the secondary storage devices 1 10, 
and/or a removable storage medium 1 16. During execution, software for accessing 

25 and processing the genomic sequence (such as search tools, comparing tools, etc.) 
reside in main memory 108, in accordance with the requirements and operating 
parameters of the operating system, the hardware system and the software program 
or programs. 
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BIOCHEMICAL EMBODIMENTS 

Other embodiments of the present invention are directed to isolated 
fragments of the Streptococcus pneumoniae genome. The fragments of the 
5 Streptococcus pneumoniae genome of the present invention include, but are not 
limited to fragments which encode peptides and polypeptides, hereinafter open 
reading frames (ORFs), fragments which modulate the expression of an operably 
linked ORF, hereinafter expression modulating fragments (EMFs) and fragments 
which can be used to diagnose the presence of Streptococcus pneumoniae in a 

10 sample, hereinafter diagnostic fragments (DFs). 

As used herein, an "isolated nucleic acid molecule" or an "isolated fragment 
of the Streptococcus pneumoniae genome" refers to a nucleic acid molecule 
possessing a specific nucleotide sequence which has been subjected to purification 
means to reduce, from the composition, the number of compounds which are 

15 normally associated with the composition. Particularly, the term refers to the 
nucleic acid molecules having the sequences set out in SEQ ID NOS: 1-391, to 
representative fragments thereof as described above, to polynucleotides at least 
95%, preferably at least 99% and especially preferably at least 99.9% identical in 
sequence thereto, also as set out above. 

20 A variety of purification means can be used to generate the isolated 

fragments of the present invention. These include, but are not limited to methods 
which separate constituents of a solution based on charge, solubility, or size. 

In one embodiment. Streptococcus pneumoniae DNA can be enzymatically 
sheared to produce fragments of 15-20 kb in length. These fragments can then be 

25 used to generate a Streptococcus pneumoniae library by inserting them into lambda 
clones as described in the Examples below. Primers flanking, for example, an 
ORF, such as those enumerated in Tables 1-3 can then be generated using 
nucleotide sequence information provided in SEQ ID NOS: 1-391. Well known 
and routine techniques of PCR cloning then can be used to isolate the ORF from 

30 the lambda DNA library or Streptococcus pneumoniae genomic DNA. Thus, given 
the availability of SEQ ID NOS: 1-391, the information in Tables 1, 2 and 3, and 
the information that may be obtained readily by analysis of the sequences of SEQ 
lDNOS:l-391 using methods set out above, those of skill will be enabled by the 
present disclosure to isolate any ORF-containing or other nucleic acid fragment of 

35 the present invention. 
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The isolated nucleic acid molecules of the present invention include, but arc 
not limited to single stranded and double stranded DNA, and single stranded RNA. 

As used herein, an "open reading frame," ORF, means a series of triplets 
coding for amino acids without any termination codons and is a sequence 
5 translatable into protein. 

Tables 1, 2, and 3 list ORFs in the Streptococcus pneumoniae genomic 
contigs of the present invention that were identified as putative coding regions by 
the GeneMark software using organism-specific second-order Markov probability 
transition matrices. It will be appreciated that other criteria can be used, in 
10 accordance with well known analytical methods, such as those discussed herein, to 
generate more inclusive, more restrictive, or more selective lists. 

Table 1 sets out ORFs in the Streptococcus pneumoniae contigs of the 
present invention that over a continuous region of at least 50 bases are 95% or 
more identical (by BLAST analysis) to a nucleotide sequence available through 
15 GenBank in October, 1997. 

Table 2 sets out ORFs in the Streptococcus pneumoniae contigs of the 
present invention that are not in Table 1 and match, with a BLASTP probability 
score of 0.01 or less, a polypeptide sequence available through GenBank in 
October, 1997. 

20 Table 3 sets out ORFs in the Streptococcus pneumoniae contigs of the 

present invention that do not match significantly, by BLASTP analysis, a 
polypeptide sequence available through GenBank in October, 1997. 

In each table, the first and second columns identify the ORF by, 
respectively, contig number and ORF number within the contig; the third column 

25 indicates the first nucleotide of the ORF (actually the first nucleotide of the stop 
codon immediately proceeding the ORF), counting from the 5* end of the contig 
strand; and the fourth column, "stop (nl)" indicates the last nucleotide of the stop 
codon defining the 3'end of the ORF. 

In Tables 1 and 2, column five, lists the Reference for the closest 

30 matching sequence available through GenBank. These reference numbers are the 
databases entry numbers commonly used by those of skill in the an, who will be 
familiar with their denominators. Descriptions of the nomenclature are available 
from the National Center for Biotechnology Information. Column six in Tables 1 
and 2 provides the gene name of the matching sequence; column seven provides 

35 the BLAST identity score and column eight the BLAST similarity score from the 
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comparison of the ORF and the homologous gene: and column nine indicates the 
length in nucleotides of the highest scoring segment pair identified by the BLAST 
identity analysis. 

Each ORF described in the tables is defined by "start (nt)" (5') and "stop 
5 (nt)" (3 f ) nucleotide position numbers. These position numbers refer to the 
boundaries of each ORF and provide orientation with respect to whether the 
forward or reverse strand is the coding strand and which reading frame the coding 
sequence is contained. The "start" position is the first nucleotide of the triplet 
encoding a stop codon just 5* to the ORF and the "stop" position is the last 

10 nucleotide of the triplet encoding the next in-frame stop codon (i.e., the stop codon 
at the 3' end of the ORF). Those of ordinary skill in the art appreciate thai 
preferred fragments within each ORF described in the table include fragments of 
each ORF which include the entire sequence from the delineated "start" and "stop" 
positions excepting the first and last three nucleotides since these encode stop 

!5 codons. Thus, polynucleotides set out as ORFs in the tables but lacking the three 
(3) 5' nucleotides and the three (3) 3* nucleotides are encompassed by the present 
invention. Those of skill also appreciate that particularly preferred are fragments 
within each ORF that are polynucleotide fragments comprising polypeptide coding 
sequence. As defined herein, "coding sequence" includes the fragment within an 

20 ORF beginning at the first in-framc ATG (triplet encoding methionine) and ending 
with the last nucieotide prior to the triplet encoding the 3' stop codon. Preferred 
are fragments comprising the entire coding sequence and fragments comprising the 
entire coding sequence, excepting the coding sequence for the N-tenninal 
methionine. Those of skill appreciate that the N-terminal methionine is often 

25 removed during post-translational processing and that polynucleotides lacking the 
ATG can be used to facilitate production of N-termainal fusion proteins which may 
be benefical in the production or use of genetically engineered proteins. Of course, 
due to the degeneracy of the genetic code many polynucleotides can encode a given 
polypeptide. Thus, the invention further includes polynucleotides comprising a 

30 nucleotide sequence encoding a polypeptide sequence itself encoded by the coding 
sequence within an ORF described in Tables 1-3 herein. Further, polynucleotides 
at least 95%, preferably at least 99% and especially preferably at least 99.9% 
identical in sequence to the foregoing polynucleotides, are contemplated by the 
present invention. 
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Polypeptides encoded by polynucleotides described above and elsewhere 
herein are also provided by the present invention as are polypeptide comprising a 
an amino acid sequence at least about 95%, preferably at least 97% and even more 
preferably 99% identical to the amino acid sequence of a polypeptide encoded by an 
5 ORF shown in Tables 1-3. These polypeptides may or may not comprise an N- 
terminal methionine. 

The concepts of percent identity and percent similarity of two polypeptide 
sequences is well understood in the art. For example, two polypeptides 10 amino 
acids in length which differ at three amino acid positions (e.g., at positions 1, 3 

10 and 5) are said to have a percent identity of 70%. However, the same two 
polypeptides would be deemed to have a percent similarity of 80% if t for example 
at position 5, the amino acids moieties, although not identical, were "similar" (i.e., 
possessed similar biochemical characteristics). Many programs for analysis of 
nucleotide or amino acid sequence similarity, such as fasta and BLAST specifically 

15 list percent identity of a matching region as an output parameter. Thus, for 
instance. Tables 1 and 2 herein enumerate the percent identity of the highest 
scoring segment pair in each ORF and Us listed relative. Further details 
concerning the algorithms and criteria used for homology searches are provided 
below and are described in the pertinent literature highlighted by the citations 

20 provided below. 

It will be appreciated that other criteria can be used to generate more 
inclusive and more exclusive listings of the types set out in the tables. As those of 
skill will appreciate, narrow and broad searches both are useful. Thus, a skilled 
artisan can readily identify ORFs in contigs of the Streptococcus pneumoniae 

25 genome other than those listed in Tables 1-3, such as ORFs which are overlapping 
or encoded by the opposite strand of an identified ORF in addition to those 
ascertainable using the computer-based systems of the present invention. 

As used herein, an "expression modulating fragment," EMF, means a 
series of nucleotide molecules which modulates the expression of an operably 

30 linked ORF or EMF. 
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As used herein, a sequence is said to "modulate the expression of an 
operably linked sequence" when the expression of the sequence is altered by the 
presence of the EMF. EMFs include, but are not limited to, promoters, and 
promoter modulating sequences (inducible elements). One class of EMFs arc 
5 fragments which induce the expression or an operably linked ORF in response to a 
specific regulatory factor or physiological event. 

EMF sequences can be identified within the contigs of the Streptococcus 
pneumoniae genome by their proximity to the ORFs provided in Tables 1-3. An 
intergenic segment, or a fragment of the intergenic segment, from about 10 to 200 

10 nucleotides in length, taken from any one of the ORFs of Tables 1-3 will modulate 
the expression of an operably linked ORF in a fashion similar to that found with the 
naturally Knked ORF sequence. As used herein, an "intergenic segment" refers to 
fragments of the Streptococcus pneumoniae genome which are between two 
ORF(s) herein described. EMFs also can be identified using known EMFs as a 

15 target sequence or target motif in the computer-based systems of the present 
invention. Further, the two methods can be combined and used together. 

The presence and activity of an EMF can be confirmed using an EMF trap 
vector. An EMF trap vector contains a cloning site linked to a marker sequence. A 
marker sequence encodes an identifiable phenotype, such as antibiotic resistance or 

20 a complementing nutrition auxotrophic factor, which can be identified or assayed 
when the EMF trap vector is placed within an appropriate host under appropriate 
conditions. As described above, a EMF will modulate the expression of an 
operably linked marker sequence. A more detailed discussion of various marker 
sequences is provided below. A sequence which is suspected as being an EMF is 

25 cloned in all three reading frames in one or more restriction sites upstream from the 
marker sequence in the EMF trap vector. The vector is then transformed into an 
appropriate host using known procedures and the phenotype of the transformed 
host in examined under appropriate conditions. As described above, an EMF will 
modulate the expression of an operably linked marker sequence. 

30 As used herein, a "diagnostic fragment," DF, means a series of nucleotide 

molecules which selectively hybridize to Streptococcus pneumoniae sequences. 
DFs can be readily identified by identifying unique sequences within contigs of the 
Streptococcus pneumoniae genome, such as by using well-known computer 
analysis software, and by generating and testing probes or amplification primers 
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consisting of the DF sequence in an appropriate diagnostic format which 
determines amplification or hybridization selectivity. 

The sequences falling within the scope of the present invention are not 
limited to the specific sequences herein described, but also include allelic and 
5 species variations thereof. Allelic and species variations can be routinely 
determined by comparing the sequences provided in SEQ ID NOS; 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95%, preferrably 
at least 99% and most at least preferably 99.9% identical to SEQ ID NOS: 1-391, 
with a sequence from another isolate of the same species. Furthermore, to 
10 accommodate codon variability, the invention includes nucleic acid molecules 
coding for the same amino acid sequences as do the specific ORFs disclosed 
herein. In other words, in the coding region of an ORF, substitution of one codon 
for another which encodes the same amino acid is expressly contemplated. Any 
specific sequence disclosed herein can be readily screened for errors by 
15 resequcncing a particular fragment, such as an ORF, in both directions (i.e., 
sequence both strands). Alternatively, error screening can be performed by 
sequencing corresponding polynucleotides of Streptococcus pneumoniae origin 
isolated by using part or all of the fragments in question as a probe or primer. 

Preferred DFs of the present invention comprise at least about 17, 
20 preferrably at least about 20, and more preferrably at least about 50 contiguous 
nucleotides within an ORF set out in Tables 1-3. Most highly preferred DFs 
specifically hybridize to a polynucleotide containing the sequence of the ORF from 
which they are derived. Specific hybridization occurs even under stringent 
conditions defined elsewhere herein. 
25 Each of the ORFs of the Streptococcus pneumoniae genome disclosed in 

Tables 1, 2 and 3, and the EMFs found 5' to the ORFs, can be used as 
polynucleotide reagents in numerous ways. For example, the sequences can be 
used as diagnostic probes or diagnostic amplification primers to detect the presence 
of a specific microbe in a sample, particularly Streptococcus pneumoniae. 
30 Especially preferred in this regard are ORFs such as those of Table 3, which do not 
match previously characterized sequences from other organisms and thus are most 
likely to be highly selective for Streptococcus pneumoniae. Also particularly 
preferred are ORFs that can be used to distinguish between strains of Streptococcus 
pneumoniae, particularly those that distinguish medically important strain, such afc 
35 drug-resistant strains. 
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In addition, the fragments of the present invention, as broadly described, 
can be used to control gene expression through triple helix formation or antisense 
DNA or RNA, both of which methods are based on the binding of a polynucleotide 
sequence to DNA or RNA. Triple helix-formation optimally results in a shut-off of 
5 RNA transcription from DNA, while antisense RNA hybridization blocks 
translation of an mRNA molecule into polypeptide. Information from the 
sequences of the present invention can be used to design antisense and triple helix- 
forming oligonucleotides. Polynucleotides suitable for use in these methods are 
usually 20 to 40 bases in length and are designed to be complementary to a region 

10 of the gene involved in transcription, for triple-helix formation, or to the mRNA 
itself, for antisense inhibition. Both techniques have been demonstrated to be 
effective in model systems, and the requisite techniques are well known and 
involve routine procedures. Triple helix techniques arc discussed in, for example, 
Lee et al % NucL Acids Res. 6:3073 (1979); Cooney et al % Science 241:456 

15 (1988); and Dervan et aL. Science 257:1360 (1991). Antisense techniques in 
general are discussed in, for instance, Okano, J. Neurochem. 56:560 (1991) and 
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression* CRC Press, 
Boca Raton, FL (1988)). 

The present invention further provides recombinant constructs comprising 

20 one or more fragments of the Streptococcus pneumoniae genomic fragments and 
contigs of the present invention. Certain preferred recombinant constructs of the 
present invention comprise a vector, such as a plasmid or viral vector, into which a 
fragment of the Streptococcus pneumoniae genome has been inserted, in a forward 
or reverse orientation. In the case of a vector comprising one of the ORFs of the 

25 present invention, the vector may further comprise regulatory sequences, including 
for example, a promoter, operably linked to the ORF. For vectors comprising the 
EMFs of the present invention, the vector may further comprise a marker sequence 
or heterologous ORF operably linked to the EMF. 

Large numbers of suitable vectors and promoters are known to those of 

30 skill in the an and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors are provided by way of 
example. Useful bacterial vectors include phagescript, PsiX174, pBluescript SK, 
pBS KS, pNH8a, pNH16a, pNH18a, pNH46a (available from Stratagene); 
pTrc99A, pKK223-3, pKK233-3 t pDR540, pRIT5 (available from Pharmacia). 

35 Useful eukaryotic vectors include pWLneo. pSV2cat, pOG44, pXTl. pSG 
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(available from Stratagene) pSVK3, pBPV, pMSG, pSVL (available from 
Pharmacia). 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
5 Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein- I. Selection of the 
appropriate vector and promoter is well within the level of ordinary skill in the art. 

10 The present invention further provides host cells containing any one of the 

isolated fragments of the Streptococcus pneumoniae genomic fragments and 
contigs of the present invention, wherein the fragment has been introduced into the 
host cell using known methods. The host cell can be a higher eukaryotic host 
cell, such as a mammalian cell, a lower eukaryotic host cell, such as a yeast cell, or 

!5 a procaryotic cell, such as a bacterial cell. 

A polynucleotide of the present invention, such as a recombinant construct 
comprising an ORF of the present invention, may be introduced into the host by a 
variety of well established techniques that are standard in the an, such as calcium 
phosphate transfection, DEAE, dextran mediated transfection and electroporation, 

20 which are described in, for instance, Davis, L. et a/., BASIC METHODS IN 
MOLECULAR BIOLOGY (1986). 

A host cell containing one of the fragments of the Streptococcus 
pneumoniae genomic fragments and contigs of the present invention, can be used 
in conventional manners to produce the gene product encoded by the isolated 

25 fragment (in the case of an ORF) or can be used to produce a heterologous protein 
under the control of the EMF. The present invention further provides 

isolated polypeptides encoded by the nucleic acid fragments of the present 
invention or by degenerate variants of the nucleic acid fragments of the present 
invention. By "degenerate variant" is intended nucleotide fragments which differ 

30 from a nucleic acid fragment of the present invention (e.g.. an ORF) by nucleotide 
sequence but, due to the degeneracy of the Genetic Code, encode an identical 
polypeptide sequence. 

Preferred nucleic acid fragments of the present invention are the ORFs and 
subfragments thereof depicted in Tables 2 and 3 which encode proteins. 
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A variety of methodologies known in the art can be utilized to. obtain any 
one of the isolated polypeptides or proteins of the present invention. At the 
simplest level, the amino acid sequence can be synthesized using commercially 
available peptide synthesizers. This is particularly useful in producing small 
5 peptides and fragments of larger polypeptides. -5uch short fragments as may be 
obtained most readily by synthesis are useful, for example, in generating antibodies 
against the native polypeptide, as discussed further below. 

In an alternative method, the polypeptide or protein is purified from 
bacterial cells which naturally produce the polypeptide or protein. One skilled in 

10 the art can readily employ well-known methods for isolating polypeptides and 
proteins to isolate and purify polypeptides or proteins of the present invention 
produced naturally by a bacterial strain, or by other methods. Methods for 
isolation and purification that can be employed in this regard include, but are not 
limited to, immunochromatography, HPLC, size-exclusion chromatography, ion- 

15 exchange chromatography, and immuno-affinity chromatography. 

The polypeptides and proteins of the present invention also can be purified 
from cells which have been altered to express the desired polypeptide or protein. 
As used herein, a cell is said to be altered to express a desired polypeptide or 
protein when the cell, through genetic manipulation, is made to produce a 

20 polypeptide or protein which it normally does not produce or which the cell 
normally produces at a lower level. Those skilled in the art can readily adapt 
procedures for introducing and expressing either recombinant or synthetic 
sequences into eukaryotic or prokaryotic cells in order to generate a cell which 
produces one of the polypeptides or proteins of the present invention. 

25 Any host/vector system can be used to express one or more of the ORFs of 

the present invention. These include, but are not limited to. eukaryotic hosts such 
as HeLa cells, CV-1 cell, COS cells, and Sf9 cells, as well as prokaryotic host 
such as E. coli and B. subtilis. The most preferred cells are those which do not 
normally express the particular polypeptide or protein or which expresses the 

30 polypeptide or protein at low natural level. 
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"Recombinant," as used herein, means that a polypeptide or protein is 
derived from recombinant (e.g., microbial or mammalian) expression systems. 
"Microbial" refers to recombinant polypeptides or proteins made in bacterial or 
fungal (e.g.. yeast) expression systems. As a product, "recombinant 
5 microbiaTdefines a polypeptide or protein essentially free of native endogenous 
substances and unaccompanied by associated native glycosylation. Polypeptides or 
proteins expressed in most bacterial cultures, e.g., E. co/i, will be free of 
glycosylation modifications; polypeptides or proteins expressed in yeast will have a 
glycosylation pattern different from that expressed in mammalian cells. 

10 "Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. 

Generally, DNA segments encoding the polypeptides and proteins provided by this 
invention are assembled from fragments of the Streptococcus pneumoniae genome 
and short oligonucleotide linkers, or from a series of oligonucleotides, to provide a 
synthetic gene which is capable of being expressed in a recombinant transcriptional 

15 unit comprising regulatory elements derived from a microbial or viral operon. 

Recombinant expression vehicle or vector" refers to a plasmid or phage or 
virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. The 
expression vehicle can comprise a transcriptional unit comprising an assembly of 
(1) a genetic regulatory elements necessary for gene expression in the host, 

20 including elements required to initiate and maintain transcription at a level sufficient 
for suitable expression of the desired polypeptide, including, for example, 
promoters and, where necessary, an enhancer and a polyadenylation signal; (2) a 
structural or coding sequence which is transcribed into mRNA and translated into 
protein, and (3) appropriate signals to initiate translation at the beginning of the 

25 desired coding region and terminate translation at its end. Structural units intended 
for use in yeast or eukaryotic expression systems preferably include a leader 
sequence enabling extracellular secretion of translated protein by a host cell. 
Alternatively, where recombinant protein is expressed without a leader or transport 
sequence, it may include an N-terminal methionine residue. This residue may or 

30 may not be subsequently cleaved from the expressed recombinant protein to 
provide a final product. 

"Recombinant expression system" means host cells which have stably 
integrated a recombinant transcriptional unit into chromosomal DNA or carry the 
recombinant transcriptional unit extra chromosomal ly. The cells can be prokaryotic 

35 or eukaryotic. Recombinant expression systems as defined herein will express 
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heterologous polypeptides or proteins upon induction of the regulatory elements 
linked to the DNA segment or synthetic gene to be expressed. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or 
other cells under the control of appropriate promoters. Cell-free translation 
5 systems can also be employed to produce such proteins using RNAs derived from 
the DNA constructs of the present invention. Appropriate cloning and expression 
vectors for use with prokaryotic and eukaryotic hosts are described in Sambrook et 
«/., Molecular Cloning: A Laboratory Manual 2 nd Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989), the disclosure of which 

10 is hereby incorporated by reference in its entirety. 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transformation of the host cell, e.g., 
the ampicillin resistance gene of £. coli and 5. cerevisiae TRP1 gene, and a 
promoter derived from a highly expressed gene to direct transcription of a 

15 downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3- phosphoglycerate kinase (PGK), alpha- 
factor, acid phosphatase, or heat shock proteins, among others. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and 
termination sequences, and preferably, a leader sequence capable of directing 

20 secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N- 
terminal identification peptide imparting desired characteristics, e.g., stabilization 
or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 

25 structural DNA sequence encoding a desired protein together with suitable 
translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic selectable 
markers and an origin of replication to ensure maintenance of the vector and, when 
desirable, provide amplification within the host. 

30 Suitable prokaryotic hosts for transformation include strains of E. coli, B. 

subtilis. Salmonella typhimurium and various species within the genera 
Pseudomonas and Streptomyces. Others may, also be employed as a matter of 
choice. 

As a representative but non-limiting example, useful expression vectors for 
35 bacteriaJ use can comprise a selectable marker and bacteria] origin of replication 
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derived from commercially available plasmids comprising genetic elements of the 
well known cloning vector pBR322 (ATCC 37017). Such commercial vectors 
include, for example, pKK223-3 (available form Pharmacia Fine Chemicals, 
Uppsala, Sweden) and GEM 1 (available from Promega Biotec, Madison, WI, 
5 USA). These pBR322 "backbone" sections -afe combined with an appropriate 
promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter, where it is inducible, is 
derepressed or induced by appropriate means (e.g., temperature shift or chemical 

10 induction) and cells are cultured for an additional period to provide for expression 
of the induced gene product. Thereafter cells are typically harvested, generally by 
centrifugation, disrupted to release expressed protein, generally by physical or 
chemical means, and the resulting crude extract is retained for further purification. 

Various mammalian cell culture systems can also be employed to express 

15 recombinant protein. Examples of mammalian expression systems include the 
COS-7 lines of monkey kidney fibroblasts, described in Gluzman, Cell 23:115 
( 198 1 X and other cell lines capable of expressing a compatible vector, for example, 
the C127, 3T3, CHO, HeLa and BHK cell lines. 

Mammalian expression vectors will comprise an origin of replication, a 

20 suitable promoter and enhancer, and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, transcriptional termination 
sequences, and 5* flanking nontranscribed sequences. DNA sequences derived 
from the SV40 viral genome, for example, SV40 origin, early promoter, enhancer, 
splice, and polyadenylation sites may be used to provide the required 

25 nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture is 
usually isolated by initial extraction from cell pellets, followed by one or more 
salting-out, aqueous ion exchange or size exclusion chromatography steps. 
Microbial cells employed in expression of proteins can be disrupted by any 

30 convenient method, including freeze-thaw cycling, sonication, mechanical 
disruption, or use of cell lysing agents. Protein refolding steps can be used, as 
necessary, in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC) can be employed for final purification 
steps. 
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The present invention further includes isolated polypeptides, proteins and 
nucleic acid molecules which are substantially equivalent to those herein described. 
As used herein, substantially equivalent can refer both to nucleic acid and amino 
acid sequences, for example a mutant sequence, that varies from a reference 
5 sequence by one or more substitutions, deletions, or additions, the net effect of 
which does not result in an adverse functional dissimilarity between reference and 
subject sequences. For purposes of the present invention, sequences having 
equivalent biological activity, and equivalent expression characteristics are 
considered substantially equivalent. For purposes of determining equivalence, 

10 truncation of the mature sequence should be disregarded. 

The invention further provides methods of obtaining homologs from other 
strains of Streptococcus pneumoniae, of the fragments of the Streptococcus 
pneumoniae genome of the present invention and homologs of the proteins encoded 
by the ORFs of the present invention. As used herein, a sequence or protein of 

15 Streptococcus pneumoniae is defined as a homolog of a fragment of the 
Streptococcus pneumoniae fragments or contigs or a protein encoded by one of the 
ORFs of the present invention, if it shares significant homology to one of the 
fragments of the Streptococcus pneumoniae genome of the present invention or a 
protein encoded by one of the ORFs of the present invention. Specifically, by 

20 using the sequence disclosed herein as a probe or as primers, and techniques such 
as PCR cloning and colony/plaque hybridization, one skilled in the an can obtain 
homologs. 

As used herein, two nucleic acid molecules or proteins are said to "share 
significant homology" if the two contain regions which possess greater than 85% 

25 sequence (amino acid or nucleic acid) homology. Preferred homologs in this 
regard are those with more than 90% homology. Especially preferred are those 
with 93% or more homology. Among especially preferred homologs those with 
95% or more homology are particularly preferred. Very particularly preferred 
among these are those with 97% and even more particularly preferred among those 

30 are homologs with 99% or more homology. The most preferred homologs among 
these are those with 99.9% homology or more. It will be understood that, among 
measures of homology, identity is particularly preferred in this regard. 

Region specific primers or probes derived from the nucleotide sequence 
provided in SEQ ID NOS: 1-391 or from a nucleotide sequence at least 95%, 

35 particularly at least 99%, especially at least 99.5% identical to a sequence of SEQ 
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ED NOS: 1-391 can be used to prime DNA synthesis and PCR amplification, as 
well as to identify colonies containing cloned DNA encoding a homolog. Methods 
suitable to this aspect of the present invention are well known and have been 
described in great detail in many publications such as, for example, Innis et al., 
5 PCR Protocol Academic Press, San Diego, CA ( 1 990)). 

When using primers derived from SEQ ID NOS: 1-391 or from a nucleotide 
sequence having an aforementioned identity to a sequence of SEQ ED NOS: 1-391, 
one skilled in the art will recognize that by employing high stringency conditions 
(e.g., annealing at 50-60°C in 6X SSPC and 50% formamide, and washing at 50- 

10 65°C in 0.5X SSPC) only sequences which are greater than 75% homologous to 
the primer will be amplified. By employing lower stringency conditions {e.g., 
hybridizing at 35-37°C in 5X SSPC and 40-45% formamide, and washing at 42°C 
in 0.5X SSPC), sequences which are greater than 40-50% homologous to the 
primer will also be amplified. 

15 When using DNA probes derived from SEQ ED NOS: 1-39 1, or from a 

nucleotide sequence having an aforementioned identity to a sequence of SEQ ID 
NOS: 1-391, for colony/plaque hybridization, one skilled in the art will recognize 
that by employing high stringency conditions (e.g., hybridizing at 50- 65°C in 5X 
SSPC and 50% formamide, and washing at 50- 65°C in 0.5X SSPC), sequences 
20 having regions which are greater than 90% homologous to the probe can be 
obtained, and that by employing lower stringency conditions (e.g., hybridizing at 
35-37°C in 5X SSPC and 40-45% formamide, and washing at 42°C in 0.5X 
SSPC), sequences having regions which are greater than 35-45% homologous to 
the probe will be obtained. 
25 Any organism can be used as the source for homologs of the present 

invention so long as the organism naturally expresses such a protein or contains 
genes encoding the same. The most preferred organism for isolating homologs are 
bacteria which are closely related to Streptococcus pneumoniae. 

30 ILLUSTRATIVE USES OF COMPOSITIONS OF THE 

INVENTION 

Each ORF provided in Tables 1 and 2 is identified with a function by 
homology to a known gene or polypeptide. As a result, one skilled in the art can 
use the polypeptides of the present invention for commercial, therapeutic and 
35 industrial purposes consistent with the type of putative identification of the 
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polypeptide. Such identifications permit one skilled in the art to use the 
Streptococcus pneumoniae ORFs in a manner similar to the known type of 
sequences for which the identification is made; for example, to ferment a particular 
sugar source or to produce a particular metabolite. A variety of reviews illustrative 
5 of this aspect of the invention are available, including the following reviews on the 
industrial use of enzymes, for example, BIOCHEMICAL ENGINEERING AND 
BIOTECHNOLOGY HANDBOOK, 2nd Ed., MacMillan Publications, Ltd. NY 
(1991) and B IOC ATAL YSTS IN ORGANIC SYNTHESES, Tramper et aL 9 Eds.. 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). A variety of 
10 exemplary uses that illustrate this and similar aspects of the present invention are 
discussed below. 

1. Biosynthetic Enzymes 

Open reading frames encoding proteins involved in mediating the catalytic 

)5 reactions involved in intermediary and macromolecular metabolism, the 
biosynthesis of small molecules, cellular processes and other functions includes 
enzymes involved in the degradation of the intermediary products of metabolism, 
enzymes involved in central intermediary metabolism, enzymes involved in 
respiration, both aerobic and anaerobic, enzymes involved in fermentation, 

20 enzymes involved in ATP proton motor force conversion, enzymes involved in 
broad regulatory function, enzymes involved in amino acid synthesis, enzymes 
involved in nucleotide synthesis, enzymes involved in cofactor and vitamin 
synthesis, can be used for industrial biosynthesis. 

The various metabolic pathways present in Streptococcus pneumoniae can 

25 be identified based on absolute nutritional requirements as well as by examining the 
various enzymes identified in Table i-3 and SEQ ID NOS: 1-391. 

Of particular interest are polypeptides involved in the degradation of 
intermediary metabolites as well as non-macromolecular metabolism. Such 
enzymes include amylases, glucose oxidases, and catalase. 

30 Proteolytic enzymes are another class of commercially important enzymes. 

Proteolytic enzymes find use in a number of industrial processes including the 
processing of flax and other vegetable fibers, in the extraction, clarification and 
depectinization of fruit juices, in the extraction of vegetables 1 oil and in the 
maceration of fruits and vegetables to give unicellular fruits. A detailed review of 

35 the proteolytic enzymes, used in the food industry is provided in Rombouts et ai. 
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Symbiosis 21:19 (1986) and Voragen et ai in Biocatalysts In Agricultural 
Biotechnology, Whitaker et a/., Eds., American Chemical Society Symposium 
Series 559:93(1989). 

The metabolism of sugars is an important aspect of the primary metabolism 
5 of Streptococcus pneumoniae. Enzymes involved in the degradation of sugars, 
such as, particularly, glucose, galactose, fructose and xylose, can be used in 
industrial fermentation. Some of the important sugar transforming enzymes, from 
a commercial viewpoint, include sugar isomerases such as glucose isomerase. 
Other metabolic enzymes have found commercial use such as glucose oxidases 

10 which produces ketogulonic acid (KG A)* KG A is an intermediate in the 
commercial production of ascorbic acid using the Reichstein's procedure, as 
described in Kruegerer ai. Biotechnology 6(A) . Rhine et aL, Eds., Veriag Press, 
Weinhcim, Germany (1984). 

Glucose oxidase (GOD) is commercially available and has been used in 

15 purified form as well as in an immobilized form for the deoxygenation of beer. 
See, for instance, Hanmeir et a/., Biotechnology Leners 7:21 (1979). The most 
important application of GOD is the industrial scale fermentation of gluconic acid. 
Market for gluconic acids which are used in the detergent, textile, leather, 
photographic, pharmaceutical, food, feed and concrete industry, as described, for 

20 example, in Bigelis et a/., beginning on page 357 in GENE MANIPULATIONS 
AND FUNGI; Benett et al., Eds., Academic Press, New York ( 1985). In addition 
to industrial applications, GOD has found applications in medicine for quantitative 
determination of glucose in body fluids recently in biotechnology for analyzing 
syrups from starch and cellulose hydrosylates. This application is described in 

25 Owusu et aL. Biochem. et Biophysica. Acta. 872:S3 ( 1986), for instance. 

The main sweetener used in the world today is sugar which comes from 
sugar beets and sugar cane. In the Held of industrial enzymes, the glucose 
isomerase process shows the largest expansion in the market today. Initially, 
soluble enzymes were used and later immobilized enzymes were developed 

30 (Krueger et ai. Biotechnology, The Textbook of Industrial Microbiology* Sinauer 
Associated Incorporated, Sunderland, Massachusetts (1990)). Today, the use of 
glucose- produced high fructose syrups is by far the largest industrial business 
using immobilized enzymes. A review of the industrial use of these enzymes is 
provided by Jorgensen, Starch 40:307 ( 1988). 
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Proteinases, such as alkaline serine proteinases, axe used as detergent 
additives and thus represent one of the largest volumes of microbial enzymes used 
in the industrial sector. Because of their industrial importance, there is a large body 
of published and unpublished information regarding the use of these enzymes in 
5 industrial processes, (See Faultrnan et aL % Acid Proteases Structure Function and 
Biology, Tang, J., ed. f Plenum Press, New York (1977) and Godfrey et aL f 
Industrial Enzymes, MacMillan Publishers, Surrey, UK (1983) and Hepner et ai % 
Report Industrial Enzymes by 1990, Hel Hepner & Associates, London ( 1986)). 

Another class of commercially usable proteins of the present invention are 
10 the microbial lipases, described by, for instance, Macrae et al. % Philosophical 
Transactions of the Chiral Society of London 310:221 (1985) and Poserke, Journal 
of the American Oil Chemist Society 61 ;1758 (1984). A major use of lipases is in 
the fat and oil industry for the production of neutral glycerides using lipase 
catalyzed inter-esterification of readily available triglycerides. Application of 
15 lipases include the use as a detergent additive to facilitate the removal of fats from 
fabrics in the course of the washing procedures. 

The use of enzymes, and in particular microbial enzymes, as catalyst for 
key steps in the synthesis of complex organic molecules is gaining popularity at a 
great rate. One area of great interest is the preparation of chiral intermediates. 
20 Preparation of chiral intermediates is of interest to a wide range of synthetic 
chemists particularly those scientists involved with the preparation of new 
pharmaceuticals, agrochemicals, fragrances and flavors. (See Davies et al. % Recent 
Advances in the Generation of Chiral Intermediates Using Enzymes. CRC Press, 
Boca Raton, Florida (1990)). The following reactions catalyzed by enzymes are of 
25 interest to organic chemists: hydrolysis of carboxylic acid esters, phosphate esters, 
amides and nitriles, esterification reactions, trans-esterification reactions, synthesis 
of amides, reduction of alkanones and oxoalkanates, oxidation of alcohols to 
carbonyl compounds, oxidation of sulfides to sulfoxides, and carbon bond forming 
reactions such as the aldol reaction. 
30 When considering the use of an enzyme encoded by one of the ORFs of the 

present invention for biotransformation and organic synthesis it is sometimes 
necessary to consider the respective advantages and disadvantages of using a 
microorganism as opposed to an isolated enzyme. Pros and cons of using a whole 
cell system on the one hand or an isolated partially purified enzyme on the other 
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hand, has been described in detail by Bud et a/.. Chemistry in Britain (1987), p. 

127. - 

Amino transferases, enzymes involved in the biosynthesis and metabolism 

of amino acids, arc useful in the catalytic production of amino acids. The 
5 advantages of using microbial based enzyme systems is that the amino transferase 

enzymes catalyze the stereo- selective synthesis of only L-amino acids and 

generally possess uniformly high catalytic rates. A description of the use of amino 

transferases for amino acid production is provided by Roselle-David, Methods of 

Enzymology 1 36:419 (1987). 
10 Another category of useful proteins encoded by the ORFs of the present 

invention include enzymes involved in nucleic acid synthesis, repair, and 

recombination. 

2. Generation of Antibodies 

15 As described here, the proteins of the present invention, as well as 

homologs thereof, can be used in a variety of procedures and methods known in 
the art which arc currently applied to other proteins. The proteins of the present 
invention can further be used to generate an antibody which selectively binds the 
protein. Such antibodies can be either monoclonal or polyclonal antibodies, as well 

20 fragments of these antibodies, and humanized forms. 

The invention further provides antibodies which selectively bind to one of 
the proteins of the present invention and hybridomas which produce these 
antibodies. A hybridoma is an immortalized cell line which is capable of secreting 
a specific monoclonal antibody. 

25 In general, techniques for preparing polyclonal and monoclonal antibodies 

as well as hybridomas capable of producing the desired antibody are well known in 
the art (Campbell, A. M„ Monoclonal Antibody Technology: Laboratory 
Techniques In Biochemistry And Molecular Biology. Elsevier Science Publishers, 
Amsterdam, The Netherlands (1984); St. Groth et at., J. Immunol. Methods 35: I- 

30 21 (1980), Kohler and Milstein, Nature 256:495-497 (1975)), the trioma 
technique, the human B-cell hybridoma technique (Kozbor et aL, Immunology 
Today 4:12 (1983), pgs. 77-96 of Cole et al. 9 in Monoclonal Antibodies And 
Cancer Therapy, Alan R. Liss, Inc. (1985)). Any animal (mouse, rabbit, 

etc.) which is known to produce antibodies can be immunized with the pseudogene 

35 polypeptide. Methods for immunization are well known in the art. Such methods 
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include subcutaneous or interperitoneal injection of the polypeptide. One skilled in 
the art will recognize that the amount of the protein encoded by the ORF of the 
present invention used for immunization will vary based on the animal which is 
immunized, the antigenicity of the peptide and the site of injection. 
5 The protein which is used as an immunogen may be modified or 

administered in an adjuvant in order to increase the protein's antigenicity. Methods 
of increasing the antigenicity of a protein are well known in the art and include, but 
are not limited to coupling the antigen with a heterologous protein (such as globulin 
or galactosidase) or through the inclusion of an adjuvant during immunization. 

10 For monoclonal antibodies, spleen cells from the immunized animals are 

removed, fused with myeloma cells, such as SP2/0-Agi4 myeloma cells, and 
allowed to become monoclonal antibody producing hybridoma cells. 

Any one of a number of methods well known in the art can be used to 
identify the hybridoma cell which produces an antibody with the desired 

15 characteristics. These include screening the hybridomas with an ELISA assay, 
western blot analysis, or radioimmunoassay (Lutz et a/., Exp. Cell Res. 775:109- 
124 (1988)). 

Hybridomas secreting the desired antibodies are cloned and the class and 
subclass is determined using procedures known in the art (Campbell, A. M., 
20 Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and 
Molecular Biology, Elsevier Science Publishers, Amsterdam. The Netherlands 
(1984)). 

Techniques described for the production of single chain antibodies (U. S. 
Patent 4,946,778) can be adapted to produce single chain antibodies to proteins of 
25 the present invention. 

For polyclonal antibodies, antibody containing antisera is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. 

The present invention further provides the above- described antibodies in 
30 detectably labelled form. Antibodies can be detectably labelled through the use of 
radioisotopes, affinity labels (such as biotin, avidin, etc.), enzymatic labels (such 
as horseradish peroxidase, alkaline phosphatase, etc.) fluorescent labels (such as 
FITC or rhodamine, etc.), paramagnetic atoms, etc. Procedures for accomplishing 
such labeling are well-known in the art, for example see Sternberger et a/., J. 
35 Histochem. Cywchem. 18:3 15 ( 1970): Bayer, E. A. et ai< Meth. Enzym. (52:308 
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(1979); Engval, E. et ai, Immunol 109:129 (1972); Goding, J. W., /. Immunol. 
Meth. J 3:21 5 (1976)). 

The labeled antibodies of the present invention can be used for in vitro, in 
vivo, and in situ assays to identify cells or tissues in which a fragment of the 
5 Streptococcus pneumoniae genome is expressed. 

The present invention further provides the above-described antibodies 
immobilized on a solid support. Examples of such solid supports include plastics 
such as polycarbonate, complex carbohydrates such as agarose and sepharose, 
acrylic resins and such as polyacrylamide and latex beads. Techniques for 

10 coupling antibodies to such solid supports are well known in the art (Weir. D. M. 
et ai, "Handbook of Experimental Immunology" 4th Ed., Blackwell Scientific 
Publications, Oxford, England, Chapter 10 (1986); Jacoby, W. D. et aL, Meth. 
Enzym. 34 Academic Press, N. Y. (1974)). The immobilized antibodies of the 
present invention can be used for in vitro, in vivo, and in situ assays as well as for 

15 immunoaffinity purification of the proteins of the present invention. 

3. Diagnostic Assays and Kits 

The present invention further provides methods to identify the expression 
of one of the ORFs of the present invention, or homolog thereof, in a test sample, 

20 using one of the DFs or antibodies of the present invention. 

In detail, such methods comprise incubating a test sample with one or more 
of the antibodies or one or more of the DFs of the present invention and assaying 
for binding of the DFs or antibodies to components within the test sample. 

Conditions for incubating a DF or antibody with a test sample vary. 

25 Incubation conditions depend on the format employed in the assay, the detection 
methods employed, and the type and nature of the DF or antibody used in the 
assay. One skilled in the art will recognize that any one of the commonly available 
hybridization, amplification or immunological assay formats can readily be adapted 
to employ the DFs or antibodies of the present invention. Examples of such assays 

30 can be found in Chard, T., An Introduction to Radioimmunoassay and Related 
Techniques, Elsevier Science Publishers, Amsterdam, The Netherlands (1986); 
Bullock, G. R. et aL, Techniques in Immunocytochemistry, Academic Press, 
Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and 
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Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands 
(1985). 

The test samples of the present invention include cells* protein or membrane 
extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or 
5 urine. The test sample used in the above -described method will vary based on the 
assay format, nature of the detection method and the tissues, cells or extracts used 
as the sample to be assayed. Methods for preparing protein extracts or membrane 
extracts of cells are well known in the an and can be readily be adapted in order to 
obtain a sample which is compatible with the system utilized. 

10 In another embodiment of the present invention, kits are provided which 

contain the necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in 
close confinement, one or more containers which comprises: (a) a first container 
comprising one of the DFs or antibodies of the present invention: and (b) one or 

15 more other containers comprising one or more of the following: wash reagents, 
reagents capable of detecting presence of a bound DF or antibody. 

In detail, a compartmentalized kit includes any kit in which reagents are 
contained in separate containers. Such containers include small glass containers, 
plastic containers or strips of plastic or paper. Such containers allows one to 

20 efficiently transfer reagents from one compartment to another compartment such 
that the samples and reagents are not cross-contaminated, and the agents or 
solutions of each container can be added in a quantitative fashion from one 
compartment to another. Such containers will include a container which will accept 
the test sample, a container which contains the antibodies used in the assay, 

25 containers which contain wash reagents (such as phosphate buffered saline, Tris- 
buffers, etc.) 9 and containers which contain the reagents used to detect the bound 
antibody or DF. 

Types of detection reagents include labelled nucleic acid probes, labelled 
secondary antibodies, or in the alternative, if the primary antibody is labelled, the 
30 enzymatic, or antibody binding reagents which are capable of reacting with the 
labelled antibody. One skilled in the art will readily recognize that the disclbsed 
DFs and antibodies of the present invention can be readily incorporated into one of 
the established kit formats which are well known in the art. 

35 4. Screening Assay for Binding Agents 
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Using the isolated proteins of the present invention, the present invention 
further provides methods of obtaining and identifying agents which bind to a 
protein encoded by one of the ORFs of the present invention or to one of the 
fragments and the Streptococcus pneumoniae fragment and contigs herein 
5 described. 

In general, such methods comprise steps of: 

(a) contacting an agent with an isolated protein encoded by one of the 
ORFs of the present invention, or an isolated fragment of the Streptococcus 
pneumoniae genome; and 
10 (b) determining whether the agent binds to said protein or said fragment. 

The agents screened in the above assay can be, but are not limited to, 
peptides, carbohydrates, vitamin derivatives, or other pharmaceutical agents. The 
agents can be selected and screened at random or rationally selected or designed 
using protein modeling techniques. 
15 For random screening, agents such as peptides, carbohydrates, 

pharmaceutical agents and the like are selected at random and are assayed for their 
ability to bind to the protein encoded by the ORF of the present invention. 

Alternatively, agents may be rationally selected or designed. As used 
herein, an agent is said to be "rationally selected or designed" when the agent is 
20 chosen based on the configuration of the particular protein. For example, one 
skilled in the art can readily adapt currently available procedures to generate 
peptides, pharmaceutical agents and the like capable of binding to a specific peptide 
sequence in order to generate rationally designed antipeptide peptides, for example 
see Hurby et a/., "Application of Synthetic Peptides: Antisense Peptides," in 
25 Synthetic Peptides. A User's Guide, W. H. Freeman, NY (1992), pp. 289-307, 
and Kaspczak et aL, Biochemistry 25:9230-8 (1989), or pharmaceutical agents, or 
the like. 

In addition to the foregoing, one class of agents of the present invention, as 
broadly described, can be used 1 to control gene expression through binding to one 
30 of the ORFs or EMFs of the present invention. As described above, such agents 
can be randomly screened or rationally designed/selected. Targeting the ORF or 
EMF allows a skilled artisan to design sequence specific or element specific agents, 
modulating the expression of either a single ORF or multiple ORFs which rely on 
the same EMF for expression control. 
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One class of DNA binding agents are agents which contain base residues 
which hybridize or form a triple helix by binding to DNA or RNA. Such agents 
can be based on the classic phosphodiester, ribonucleic acid backbone, or can be a 
variety of sulfhydryl or polymeric derivatives which have base attachment capacity. 
5 Agents suitable for use in these methods usually contain 20 to 40 bases and 

are designed to be complementary to a region of the gene involved in transcription 
(triple helix - see Lee et ai, Nucl Acids Res. 6:3073 (1979); Cooney et aL % 
Science 241:456 (1988); and Dervan et al. % Science 257:1360 (1991)) or to the 
mRNA itself (antisense - Okano, 7. Neurochem. 56:560 (1991); 

10 Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression. CRC Press, 
Boca Raton, FL (1988)). Triple helix- formation optimally results in a shut-off of 
RNA transcription from DNA, while antisense RNA hybridization blocks 
translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the 

15 sequences of the present invention can be used to design antisense and triple helix- 
forming oligonucleotides, and other DNA binding agents. 

5. Pharmaceutical Compositions and Vaccines 

The present invention further provides pharmaceutical agents which can be 

20 used to modulate the growth or pathogenicity of Streptococcus pneumoniae, or 
another related organism, in vivo or in vitro. As used herein, a "pharmaceutical 
agent" is defined as a composition of matter which can be formulated using known 
techniques to provide a pharmaceutical compositions. As used herein, the 
"pharmaceutical agents of the present invention" refers the pharmaceutical agents 

25 which are derived from the proteins encoded by the ORFs of the present invention 
or are agents which are identified using the herein described assays. 

As used herein, a pharmaceutical agent is said to "modulate the growth 
pathogenicity of Streptococcus pneumoniae or a related organism, in vivo or in 
vitro" when the agent reduces the rate of growth, rate of division, or viability of 

30 the organism in question. The pharmaceutical agents of the present invention can 
modulate the growth or pathogenicity of an organism in many fashions, although 
an understanding of the underlying mechanism of action is not needed to practice 
the use of the pharmaceutical agents of the present invention. Some agents will 
modulate the growth by binding to an important protein thus blocking the biological 

35 activity of the protein, while other agents may bind to a component of the outer 
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surface of the organism blocking attachment or rendering the organism more prone 
to act the bodies nature immune system. Alternatively, the agent may comprise a 
protein encoded by one of the ORFs of the present invention and serve as a 
vaccine. The development and use of a vaccine based on outer membrane 
5 components are well known in the art. — 

As used herein, a "related organism" is a broad term which refers to any 
organism whose growth can be modulated by one of the pharmaceutical agents of 
the present invention. In general, such an organism will contain a homolog of the 
protein which is the target of the pharmaceutical agent or the protein used as a 
10 vaccine. As such, related organisms do not need to be bacterial but may be fungal 
or viral pathogens. 

The pharmaceutical agents and compositions of the present invention may 
be administered in a convenient manner, such as by the oral, topical, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes. The 
15 pharmaceutical compositions are administered in an amount which is effective for 
treating and/or prophylaxis of the specific indication. In general, they are 
administered in an amount of at least about 1 mg/kg body weight and in most cases 
they will be administered in an amount not in excess of about 1 g/kg body weight 
per day. In most cases, the dosage is from about 0.1 mg/kg to about 10 g/kg body 

20 weight daily, taking into account the routes of administration, symptoms, etc. 

The agents of the present invention can be used in native form -or can be 
modified to form a chemical derivative. As used herein, a molecule is said to be a 
"chemical derivative" of another molecule when it contains additional chemical 
moieties not normally a part of the molecule. Such moieties may improve the 

25 molecule's solubility, absorption, biological half life, etc. The moieties may 
alternatively decrease the toxicity of the molecule, eliminate or attenuate any 
undesirable side effect of the molecule, etc. Moieties capable of mediating such 
effects are disclosed in, among other sources, REMINGTON'S 
PHARMACEUTICAL SCIENCES (1980) cited elsewhere herein. 

30 For example, such moieties may change an immunological character of the 

functional derivative, such as affinity for a given antibody. Such changes in 
immunomodulation activity are measured by the appropriate assay, such as a 
competitive type immunoassay. Modifications of such protein properties as redox 
or thermal stability, biological half-life, hydrophobicity, susceptibility to proteolytic 

35 degradation or the tendency to aggregate with carriers or into multimers also may 
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be effected in this way and can be assayed by methods well known to the skilled 
artisan. 

The therapeutic effects of the agents of the present invention may be 
obtained by providing the agent to a patient by any suitable means (e.g.. inhalation. 
5 intravenously, intramuscularly, subcutaneously. enterally, or parentcrally). It is 
preferred to administer the agent of the present invention so as to achieve an 
effective concentration within the blood or tissue in which the growth of the 
organism is to be controlled. To achieve an effective blood concentration, the 
preferred method is to administer the agent by injection. The administration may be 
10 by continuous infusion, or by single or multiple injections. 

In providing a patient with one of the agents of the present invention, the 
dosage of the administered agent will vary depending upon such factors as the 
patient's age, weight, height, sex, general medical condition, previous medical 
history, etc. In general, it is desirable to provide the recipient with a dosage of 
15 agent which is in the range of from about 1 pg/kg to 10 mg/kg (body weight of 
patient), although a lower or higher dosage may be administered. The 
therapeutically effective dose can be lowered by using combinations of the agents 
of the present invention or another agent. 

As used herein, two or more compounds or agents are said to be 
20 administered M in combination" with each other when either (1) the physiological 
effects of each compound, or (2) the serum concentrations of each compound can 
be measured at the same time. The composition of the present invention can be 
administered concurrently with, prior to, or following the administration of the 
other agent. 

25 The agents of the present invention are intended to be provided to recipient 

subjects in an amount sufficient to decrease the rate of growth (as defined above) of 
the target organism. 

The administration of the agent(s) of the invention may be for either a 
"prophylactic" or "therapeutic" purpose. When provided prophylactically, the 

30 agent(s) are provided in advance of any symptoms indicative of the organisms 
growth. The prophylactic administration of the agent(s) serves to prevent, 
attenuate, or decrease the rate of onset of any subsequent infection. When 
provided therapeutically, the agent(s) are provided at (or shortly after) the onset of 
an indication of infection. The therapeutic administration of the compound(s) 
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serves to attenuate the pathological symptoms of the infection and to increase the 
rate of recovery. 

The agents of the present invention are administered to a subject, such as a 
mammal, or a patient, in a pharmaceutically acceptable form and in a therapeutically 
5 effective concentration. A composition is said to be "pharmacologically acceptable" 
if its administration can be tolerated by a recipient patient. Such an agent is said to 
be administered in a "therapeutically effective amount" if the amount administered 
is physiologically significant. An agent is physiologically significant if its presence 
results in a detectable change in the physiology of a recipient patient. 
10 The agents of the present invention can be formulated according to known 

methods to prepare pharmaceutically useful compositions, whereby these materials, 
or their functional derivatives, are combined in a mixture with a pharmaceutically 
acceptable carrier vehicle. Suitable vehicles and their formulation, inclusive of 
other human proteins* e.g., human serum albumin, are described, for example, in 
15 REMINGTON'S PHARMACEUTICAL SCIENCES, 16 th Ed., Osol, A. t Ed., 
Mack Publishing, Easton PA (1980). In order to form a pharmaceutically 
acceptable composition suitable for effective administration, such compositions will 
contain an effective amount of one or more of the agents of the present invention, 
together with a suitable amount of carrier vehicle. 

20 Additional pharmaceutical methods may be employed to control the duration 

of action. Control release preparations may be achieved through the use of 
polymers to complex or absorb one or more of the agents of the present invention. 
The controlled delivery may be effectuated by a variety of well known techniques, 
including formulation with macromolecules such as, for example, polyesters, 

25 polyamino acids, polyvinyl, pyrrolidone, ethylenevinylacetate, methylcellulose, 
carboxymethylcellulose, or protamine, sulfate, adjusting the concentration of the 
macromolecules and the agent in the formulation, and by appropriate use of 
methods of incorporation, which can be manipulated to effectuate a desired time 
course of release. Another possible method to control the duration of action by 

30 controlled release preparations is to incorporate agents of the present invention into 
panicles of a polymeric material such as polyesters, polyamino acids, hydrogels, 
poly(lactic acid) or ethylene vinylacetate copolymers. Alternatively, instead of 
incorporating these agents into polymeric particles, it is possible to entrap these 
materials in microcapsules prepared, for example, by coacervation techniques or by 

35 interfacial polymerization with, for example, hydroxymethylcellulose or gelatine- 
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microcapsules and poly(methylmethacylate) microcapsules, respectively, or in 
colloidal drug delivery systems, for example, liposomes, albumin microspheres, 
microemulsions, nanoparticles, and nanocapsules or in macroemulsions. Such 
techniques are disclosed in REMINGTON'S PHARMACEUTICAL SCIENCES 
5 (1980). 

The invention further provides a pharmaceutical pack or kit comprising one 
or more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such containers) can be a notice in 
the form prescribed by a governmental agency regulating the manufacture, use or 
10 sale of pharmaceuticals or biological products, which notice reflects approval by 
the agency of manufacture, use or sale for human administration. 

In addition, the agents of the present invention may be employed in 
conjunction with other therapeutic compounds. 

15 6. Shot-Gun Approach to Megabase DNA Sequencing 

The present invention further demonstrates that a large sequence can be 
sequenced using a random shotgun approach. This procedure, described in detail 
in the examples that follow, has eliminated the up front cost of isolating and 
ordering overlapping or contiguous subclones prior to the start of the sequencing 

20 protocols. 

Certain aspects of the present invention are described in greater detail in the 
examples that follow. The examples are provided by way of illustration. Other 
aspects and embodiments of the present invention are contemplated by the 
inventors, as will be clear to those of skill in the art from reading the present 
25 disclosure. 

ILLUSTRATI VE EXAMPLES 

LIBRARIES AND SEQUENCING 
30 1. Shotgun Sequencing Probability Analysis 

The overall strategy for a shotgun approach to whole genome sequencing 

follows from the Lander and Waterman (Landerman and Waterman, Genomics 

2:231 (1988)) application of the equation for the Poisson distribution. According 

to this treatment, the probability, P . that any given base in a sequence of size L, in 

^5 nucleotides, is not sequenced after a certain amount, n, in nucleotides, of random 

0 
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sequence has been determined can be calculated by the equation P = e" m , where m 
is L/n, the fold coverage. For instance, for a genome of 2.8 Mb f m=l when 2.8 
Mb of sequence has been randomly generated (IX coverage). A?that point, P = 
e"l = 0.37. The probability that any given base has not been sequenced is the same 
5 as the probability that any region of the whole sequence L has not been determined 
and, therefore, is equivalent to the fraction of the whole sequence that has yet to be 
determined. Thus, at one-fold coverage, approximately 37% of a polynucleotide of 
size L, in nucleotides has not been sequenced. When 14 Mb of sequence has been 
generated, coverage is 5X for a 2.8 Mb and the unsequenced fraction drops to 

10 .0067 or 0.67%. 5X coverage of a 2.8 Mb sequence can be attained by sequencing 
approximately 17,000 random clones from both insert ends with an average 
sequence read length of 4 10 bp. 

Similarly, the total gap length, G, is determined by the equation G = Le~ m . 
and the average gap size, g, follows the equation, g = L/n. Thus, 5X coverage 

15 leaves about 240 gaps averaging about 82 bp in size in a sequence of a 
polynucleotide 2.8 Mb long. 

The treatment above is essentially that of Lander and Waterman, Genomics 
2:231 (1988). 

20 2. Random Library Construction 

In order to approximate the random model described above during actual 
sequencing, a nearly ideal library of cloned genomic fragments is required. The 
following library construction procedure was developed to achieve this end. 

Streptococcus pneumoniae DNA is prepared by phenol extraction. A 

25 mixture containing 200 jig DNA in 1 .0 ml of 300 mM sodium acetate, 10 mM Tris- 
HCI, 1 mM Na-EDTA, 50% glycerol is processed through a nebulizer (IPI Medical 
Products) with a stream of nitrogen adjusted to 35 Kpa for 2 minutes. The 
sonicated DNA is ethanol precipitated and redissolved in 500 |il TE buffer. 

To create blunt-ends, a 100 jil aliquot of the resuspended DNA is digested 

30 with 5 units of BAL31 nuclease (New England BioLabs) for 10 min at 30°C in 200 
Hi BAL31 buffer. The digested DNA is phenol-extracted, ethanol -precipitated, 
redissolved in 100 \il TE buffer, and then size-fractionated by electrophoresis 
through a 1.0% low melting temperature agarose gel. The section containing DNA 
fragments 1.6-2.0 kb in size is excised from the gel, and the LGT agarose is melted 

35 and the resulting solution is extracted with phenol to separate the agarose from the 
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DNA. DNA is ethanol precipitated and redissolved in 20 |il of TE> buffer for 
ligation to vector. 

A two-step ligation procedure is used to produce a plasmid library with 
97% inserts, of which >99% were single inserts. The first ligation mixture (50 u!) 
5 contains 2 \xg of DNA fragments, 2 jig pUC18 DNA (Pharmacia) cut with Smal 
and dephosphorylated with bacterial alkaline phosphatase, and 10 units of T4 ligase 
(GIBCO/BRL) and is incubated at 14°C for 4 hr. The ligation mixture then is 
phenol extracted and ethanol precipitated, and the precipitated DNA is dissolved in 
20 til TE buffer and electrophoresed on a 1 .0% low melting agarose gel. Discrete 

10 bands in a ladder are visualized by ethidium bromide-staining and UV illumination 
and identified by size as insert (I), vector (v), v+I, v+2i, v+3i, etc. The portion of 
the gel containing v+I DNA is excised and the v+I DNA is recovered and 
resuspended into 20 jxl TE. The v+I DNA then is blunt-ended by T4 polymerase 
treatment for 5 min. at 37°C in a reaction mixture (50 ul) containing the v+I linears, 

15 500 jiM each of the 4 dNTPs, and 9 units of T4 polymerase (New England 
BioLabs), under recommended buffer conditions. After phenol extraction and 
ethanol precipitation the repaired v+I linears are dissolved in 20 \xl TE. The final 
ligation to produce circles is carried out in a 50 p.1 reaction containing 5 ill of v+I 
linears and 5 units of T4 ligase at 14°C overnight. After 10 min. at 70°C the 

20 following day, the reaction mixture is stored at -20°C. 

This two-stage procedure results in a molecularly random collection of 
single-insert plasmid recombinants with minimal contamination from double-insert 
chimeras (< I %) or free vector (<3%). 

Since deviation from randomness can arise from propagation the DNA in 

25 the host. £. coli host cells deficient in all recombination and restriction functions 
(A. Greener, Strategies 3 (1):5 (1990)) are used to prevent rearrangements, 
deletions, and loss of clones by restriction. Furthermore, transformed cells are 
plated directly on antibiotic diffusion plates to avoid the usual broth recovery phase 
which allows multiplication and selection of the most rapidly growing cells. 

30 Plating is carried out as follows. A 100 }il aliquot of Epicurian Coli SURE 

II Supercompetent Cells (Stratagene 200152) is thawed on ice and transferred to a 
chilled Falcon 2059 tube on ice. A 1.7 jxl aliquot of 1.42 M beta-mercaptoethanol 
is added to the aliquot of cells to a final concentration of 25 mM. Cells are 
incubated on ice for 10 min. A I jil aliquot of the final ligation is added to the cells 

35 and incubated on ice for 30 min. The cells are heat pulsed for 30 sec. at 42°C and 
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placed back on ice for 2 min. The outgrowth period, in liquid culture is eliminated 

from this protocol in order to minimize the preferential growth of any given 

transformed cell. Instead the transformation mixture is plated directly on a nutrient 

rich SOB plate containing a 5 ml bottom layer of SOB agar (5% SOB agar: 20 g 

5 tryptone, 5 g yeast extract, 0.5 g NaCl, 1.5% Difco Agar per liter of media). The 5 

ml bottom layer is supplemented with 0.4 ml of 50 mg/ml ampicillin per 100 ml 

SOB agar. The 15 ml top layer of SOB agar is supplemented with 1 ml X-Gal 

(2%), 1 ml MgCl (I M). and 1 ml MgSO /100 ml SOB agar. The 15 ml top layer 

is poured just prior to plating. Our titer is approximately 100 colonies/10 |il aliquot 
o 4 
10 of transformation? 

All colonies are picked for template preparation regardless of size. Thus, 

only clones lost due to "poison" DNA or deleterious gene products are deleted from 

the library, resulting in a slight increase in gap number over that expected. 

15 3. Random DNA Sequencing 

High quality double stranded DNA plasmid templates are prepared using a 
"boiling bead" method developed in collaboration with Advanced Genetic 
Technology Corp. (Gaithersburg, MD) (Adams et al % Science 252:1651 (1991); 
Adams et aL* Nature 555. 632 (1992)). Plasmid preparation is performed in a 96- 

20 well format for all stages of DNA preparation from bacterial growth through final 
DNA purification. Template concentration is determined using Hoechst Dye and a 
Millipore Cytofluor. DNA concentrations are not adjusted, but low-yielding 
templates are identified where possible and not sequenced. 

Templates are also prepared from two Streptococcus pneumoniae lambda 

25 genomic libraries. An amplified library is constructed in the vector Lambda GEM- 
12 (Promega) and an unamplified library is constructed in Lambda DASH II 
(Stratagene). In particular, for the unamplified lambda library, Streptococcus 
pneumoniae DNA (> 100 kb) is partially digested in a reaction mixture (200 ul) 
containing 50 \i% DNA, IX Sau3AI buffer, 20 units Sau3AI for 6 min. at 23°C 

30 The digested DNA was phenol-extracted and electrophoresed on a 0.5% low 
melting agarose gel at 2V/cm for 7 hours. Fragments from 15 to 25 kb are excised 
and recovered in a final volume of 6 ul. One (il of fragments is used with 1 |il of 
DASHII vector (Stratagene) in the recommended ligation reaction. One |il of the 
ligation mixture is used per packaging reaction following the recommended 

35 protocol with the Gigapack II XL Packaging Extract (Stratagene, #2277] I). Phage 
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are plated directly without amplification from the packaging mixture (after dilution 
with 500 pJ of recommended SM buffer and chloroform treatment). Yield is about 
2.5x1 O^ pfu/ul. The amplified library is prepared essentially as above except the 
lambda GEM- 12 vector is used. After packaging, about 3.5x10^ pfu are plated on 
5 the restrictive NM539 host. The lysate is harvested in 2 ml of SM buffer and 
stored frozen in 7% dimethylsulfoxide. The phage titer is approximately txlO^ 
pfu/ml. 

Liquid lysates (100 are prepared from randomly selected plaques (from 
the unamplified library) and template is prepared by long-range PCR using T7 and 
10 T3 vector-specific primers. 

Sequencing reactions are carried out on plasmid and/or PCR templates 
using the AB Catalyst LabStation with Applied Biosystems PRISM Ready 
Reaction Dye Primer Cycle Sequencing Kits for the M 13 forward (Ml 3-21) and 
the M13 reverse (M13RP1) primers (Adams et aL. Nature 368:474 (1994)). Dye 
15 terminator sequencing reactions are carried out on the lambda templates on a 
Perkin-Elmer 9600 Thermocycler using the Applied Biosystems Ready Reaction 
Dye Terminator Cycle Sequencing kits. T7 and SP6 primers are used to sequence 
the ends of the inserts from the Lambda GEM- 12 library and T7 and T3 primers are 
used to sequence the ends of the inserts from the Lambda DASH II library. 

20 Sequencing reactions are performed by eight individuals using an average of 
fourteen AB 373 DNA Sequencers per day. All sequencing reactions are analyzed 
using the Stretch modification of the AB 373, primarily using a 34 cm well-to-read 
distance. The overall sequencing success rate very approximately is about 85% for 
Ml 3-21 and M13RPI sequences and 65% for dye-terminator reactions. The 

25 average usable read length is 485 bp for Ml 3-21 sequences, 445bp for MI3RP1 
sequences, and 375 bp for dye-terminator reactions. 

Richards et aL Chapter 28 in AUTOMATED DNA SEQUENCING AND 
ANALYSIS, M. D. Adams, C. Fields, J. C. Venter, Eds., Academic Press. 
London, (1994) described the value of using sequence from both ends of 

30 sequencing templates to facilitate ordering of contigs in shotgun assembly projects 
of lambda and cosmid clones. We balance the desirability of both-end sequencing 
(including the reduced cost of lower total number of templates) against shorter 
read-lengths for sequencing reactions performed with the M13RP1 (reverse) primer 
compared to the Ml 3-21 (forward) primer. Approximately one-half of the 

35 templates are sequenced from both ends. Random reverse sequencing reactions are 
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done based on successful forward sequencing reactions. Some M13RP1 
sequences are obtained in a semi-directed fashion: Ml 3-21: sequences pointing 
outward at the ends of contigs are chosen for M13RP1 sequencing in an effort to 
specifically order contigs. 

5 

4. Protocol for Automated Cycle Sequencing 

The sequencing is carried out using ABI Catalyst robots and AB 373 
Automated DNA Sequencers, The Catalyst robot is a publicly available 
sophisticated pipetting and temperature control robot which has been developed 

10 specifically for DNA sequencing reactions. The Catalyst combines pre-aliquoted 
templates and reaction mixes consisting of deoxy- and dideoxynucleotides, the 
thermostable Taq DNA polymerase, fluorescently-labelled sequencing primers, and 
reaction buffer. Reaction mixes and templates are combined in the wells of an 
aluminum 96-weIl thermocycling plate. Thirty consecutive cycles of linear 

15 amplification (i.e.., one primer synthesis) steps are performed including 
denaturation, annealing of primer and template, and extension; i.*., DNA 
synthesis. A heated lid with rubber gaskets on the thermocycling plate prevents 
evaporation without the need for an oil overlay. 

Two sequencing protocols are used: one for dye-labelled primers and a 
20 second for dye-labelled dideoxy chain terminators. The shotgun sequencing 
involves use of four dye-labelled sequencing primers, one for each of the four 
terminator nucleotide. Each dye-primer is labelled with a different fluorescent dye, 
permitting the four individual reactions to be combined into one lane of the 373 
DNA Sequencer for electrophoresis, detection, and base-calling. ABI currently 
25 supplies pre-mixed reaction mixes in bulk packages containing all the necessary 
non-template reagents for sequencing. Sequencing can be done with both plasmid 
and PCR- generated templates with both dye-primers and dye- terminators with 
approximately equal fidelity, although plasmid templates generally give longer 
usable sequences. 

30 Thirty-two reactions are loaded per AB373 Sequencer each day. for a total 

of 960 samples. Electrophoresis is run overnight following the manufacturer's 
protocols, and the data is collected for twelve hours. Following electrophoresis 
and fluorescence detection, the ABI 373 performs automatic lane tracking and base- 
calling. The lane-tracking is confirmed visually. Each sequence electropherogram 

35 (or fluorescence lane trace) is inspected visually and assessed for quality. Trailing 
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sequences of low quality are removed and the sequence itself is loaded via software 
to a Sybase database (archived daily to 8mm tape). Leading vector polylinker 
sequence is removed automatically by a software program. Average edited lengths 
of sequences from the standard ABI 373 are around 400 bp and depend mostly on 
5 the quality of the template used for the sequencing reaction. ABI 373 Sequencers 
converted to Stretch Liners provide a longer electrophoresis path prior to 
fluorescence detection and increase the average number of usable bases to 500-600 
bp. 

10 INFORMATICS 

1. Data Management 

A number of information management systems for a large-scale sequencing 
lab have been developed. (For review see, for instance, Kerlavage et al.< 
Proceedings of the Twenty-Sixth Annual Hawaii International Conference on 

15 System Sciences, IEEE Computer Society Press, Washington D. C, 585 (1993)) 
The system used to collect and assemble the sequence data was developed using the 
Sybase relational database management system and was designed to automate data 
flow wherever possible and to reduce user error. The database stores and 
correlates all information collected during the entire operation from template 

20 preparation to final analysis of the genome. Because the raw output of the ABI 373 
Sequencers was based on a Macintosh platform and the data management system 
chosen was based on a Unix platform, it was necessary to design and implement a 
variety of multi- user, client-server applications which allow the raw data as well as 
analysis results to flow seamlessly into the database with a minimum of user effort. 

25 

2„ Assembly 

An assembly engine (TIGR Assembler) developed for the rapid and 
accurate assembly of thousands of sequence fragments was employed to generate 
contigs. The TIGR assembler simultaneously clusters and assembles fragments of 

30 the genome. In order to obtain the speed necessary to assemble more than I0 4 
fragments, the algorithm builds a hash table of 12 bp oligonucleotide subsequences 
to generate a list of potential sequence fragment overlaps. The number of potential 
overlaps for each fragment determines which fragments are likely to fall into 
repetitive elements. Beginning with a single seed sequence fragment. TIGR 

35 Assembler extends the. current comig by attempting to add the best matching 
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fragment based on oligonucleotide content. The contig and candidate fragment are 
aligned using a modified version of the Smith- Waterman algorithm which provides 
for optimal gapped alignments (Waterman, M. S., Methods in Enzymology 
164:165 (1988)). The contig is extended by the fragment only if strict criteria for 
5 the quality of the match are met. The match criteria-include the minimum length of 
overlap, the maximum length of an unmatched end, and the minimum percentage 
match. These criteria are automatically lowered by the algorithm in regions of 
minimal coverage' and raised in regions with a possible repetitive element. The 
number of potential overlaps for each fragment determines which fragments are 

10 likely to fall into repetitive elements. Fragments representing the boundaries of 
repetitive elements and potentially chimeric fragments are often rejected based on 
partial mismatches at the ends of alignments and excluded from the current contig. 
TIGR Assembler is designed to take advantage of clone size information coupled 
with sequencing from both ends of each template. It enforces the constraint that 

15 sequence fragments from two ends of the same template point toward one another 
in the contig and are located within a certain range of base pairs (definable for each 
clone based on the known clone size range for a given library). 

The process resulted in 391 contigs as represented by SEQ ID NOs: 1-391. 

20 3. Identifying Genes 

The predicted coding regions of the Streptococcus pneumoniae genome 
were initially defined with the program GeneMark. which finds ORFs using a 
probabilistic classification technique. The predicted coding region .sequences were 
used in searches against a database of all nucleotide sequences from GenBank 

25 (October, 1997), using the BLASTN search method to identify overlaps of 50 or 
more nucleotides with at least a 95% identity. Those ORFs with nucleotide 
sequence matches are shown in Table 1. The ORFs without such matches were 
translated to protein sequences and compared to a non-redundant database of 
known proteins generated by combining the Swiss-prot, PIR and GenPept 

30 databases. ORFs that matched a database protein with BLASTP probability less 
than or equal to 0.01 are shown in Table 2. The table also lists assigned functions 
based on the closest match in the databases. ORFs that did not match protein or 
nucleotide sequences in the databases at these levels are shown in Table 3. 
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ILLUSTRATIVE APPLICATIONS 

1. Production of an Antibody to a Streptococcus pneumoniae 
Protein 

Substantially pure protein or polypeptide is isolated from the trans fee ted or 
5 transformed cells using any one of the methods known in the art. The protein can 
also be produced in a recombinant prokaryotic expression system, such as E. coli % 
or can be chemically synthesized. Concentration of protein in the final preparation 
is adjusted, for example, by concentration on an Amicon filter device, to the level 
of a few micrograms/ml. Monoclonal or polyclonal antibody to the protein can 
10 then be prepared as follows. 

2. Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and 
isolated as described can be prepared from murine hybridomas according to the 

15 classical method of Kohler, C, and Milstein, C, Nature 256:495 (1975) or 
modifications of the methods thereof. Briefly, a mouse is repetitively inoculated 
with a few micrograms of the selected protein over a period of a few weeks. The 
mouse is then sacrificed, and the antibody producing cells of the spleen isolated. 
The spleen cells are fused by means of polyethylene glycol with mouse myeloma 

20 cells, and the excess unfused cells destroyed by growth of the system on selective 
media comprising aminopterin (HAT media). The successfully fused cells are 
diluted and aliquots of the dilution placed in wells of a microliter plate where 
growth of the culture is continued. Antibody-producing clones are identified by 
detection of antibody in the supernatant fluid of the wells by immunoassay 

25 procedures, such as ELISA, as originally described by Engvall, E., Meth. 
EnzymoL 70:419 (1980), and modified methods thereof. Selected positive clones 
can be expanded and their monoclonal antibody product harvested for use. Detailed 
procedures for monoclonal antibody production are described in Davis, L. et aL, 
Basic Methods in Molecular Biology, Elsevier, New York. Section 21-2 (1989). 

30 
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3. Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogenous epitopes of a 
single protein can be prepared by immunizing suitable animals with the expressed 
protein described above, which can be unmodified or modified to enhance 
5 irnmunogenicity. Effective polyclonal antibody production is affected by many 
factors related both to the antigen and the host species. For example, small 
molecules tend to be less immunogenic than others and may require the use of 
carriers and adjuvant. Also, host animals vary in response to site of inoculations 
and dose, with both inadequate or excessive doses of antigen resulting in low titer 
10 antisera. Small doses (ng level) of antigen administered at multiple intradermal 
sites appears to be most reliable. An effective immunization protocol for rabbits 
can be found in Vaitukaitis, J. et «/., J. Clin. Endocrinol. Metab. 33:988-991 
(1971). 

Booster injections can be given at regular intervals, and antiserum harvested 
15 when antibody titer thereof, as determined semi-quantitatively, for example, by 
double immunodiffusion in agar against known concentrations of the antigen, 
begins to fall. See. for example, Ouchterlony, O. et a/.. Chap. 19 in: Handbook of 
Experimental Immunology, Wier, D.. ed. Blackwell (1973). Plateau concentration 
of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12M). 

20 Affinity of the antisera for the antigen is determined by preparing competitive 
binding curves, as described, for example, by Fisher, D.. Chap. 42 in: Manual of 
Clinical Immunology, second edition. Rose and Friedman, eds., Amcr. Soc. For 
Microbiology, Washington, D. C. (1980) 

Antibody preparations prepared according to either protocol are useful in 

25 quantitative immunoassays which determine concentrations of antigen-bearing 
substances in biological samples: they are also used semi- quantitatively or 
qualitatively to identify the presence of antigen in a biological sample. In addition, 
antibodies are useful in various animal models of pneumococcal disease as a means 
of evaluating the protein used to make the antibody as a potential vaccine target or 

30 as a means of evaluating the antibody as a potential immunotherapeutic or 
immunoprophylactic reagent. 
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4. Preparation of PCR Primers and Amplification f DNA 

Various fragments of the Streptococcus pneumoniae genome, such as those 
of Tables 1-3 and SEQ ID NOS: 1-391 can be used, in accordance with the present 
invention, to prepare PCR primers for a variety of uses. The PCR primers are 
5 preferably at least 15 bases, and more preferably at least 18 bases in length. When 
selecting a primer sequence, it is preferred that the primer pairs have approximately 
the same G/C ratio, so that melting temperatures are approximately the same. The 
PCR primers and amplified DNA of this Example find use in the Examples that 
follow. 

10 

5. Gene expression from DNA Sequences Corresponding to 

ORFs 

A fragment of the Streptococcus pneumoniae genome provided in Tables I- 
3 is introduced into an expression vector using conventional technology. 

15 Techniques to transfer cloned sequences into expression vectors that direct protein 
translation in mammalian, yeast, insect or bacterial expression systems are well 
known in the art. Commercially available vectors and expression systems are 
available from a variety of suppliers including Stratagene (La Jolla, California), 
Promega (Madison, Wisconsin), and Invitrogen (San Diego. California). If 

20 desired, to enhance expression and facilitate proper protein folding, the codon 
context and codon pairing of the sequence may be optimized for the particular 
expression organism, as explained by Hatfield et «/., U. S. Patent No. 5,082,767, 
incorporated herein by this reference. 
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The following is provided as one exemplary method to generate 
polypeptide(s) from cloned ORFs of the Streptococcus pneumoniae genome 
fragment. Bacterial ORFs generally lack a poly A addition signal. The addition 
signal sequence can be added to the construct by, for example, splicing out the poly 
5 A addition sequence from pSG5 (Stratagene)-using Bgll and Sail restriction 
endonuclease enzymes and incorporating it into the mammalian expression vector 
pXTl (Stratagene) for use in eukaryotic expression systems. pXTl contains the 
LTRs and a portion of the gag gene of Moloney Murine Leukemia Vims. The 
positions of the LTRs in the construct allow efficient stable transfection. The 
10 vector includes the Herpes Simplex thymidine kinase promoter and the selectable 
neomycin gene. The Streptococcus pneumoniae DNA is obtained by PCR from the 
bacterial vector using oligonucleotide primers complementary to the Streptococcus 
pneumoniae DNA and containing restriction endonuclease sequences for PstI 
incorporated into the 5 f primer and BglH at the 5 % end of the corresponding 
15 Streptococcus pneumoniae DNA 3* primer, taking care to ensure that the 
Streptococcus pneumoniae DNA is positioned such that its followed with the poly 
A addition sequence. The purified fragment obtained from the resulting PCR 
reaction is digested with PstL blunt ended with an exonuclcasc, digested with 
BgUI, purified and Iigated to pXTl, now containing a poly A addition sequence 
20 and digested BgHI. 

The Iigated product is transfected into mouse NIH 3T3 cells using 
Lipofectin (Life Technologies, Inc., Grand Island, New York) under conditions 
outlined in the product specification. Positive transfectants are selected after 
growing the transfected cells in 600 ug/ml G4I8 (Sigma, St. Louis, Missouri). 
25 The protein is preferably released into the supernatant. However if the protein has 
membrane binding domains, the protein may additionally be retained within the cell 
or expression may be restricted to the cell surface. Since it may be necessary to 
purify and locate the transfected product, synthetic I5-mer peptides synthesized 
from the predicted Streptococcus pneumoniae DNA sequence are injected into mice 
30 to generate antibody to the polypeptide encoded by the Streptococcus pneumoniae 
DNA. 



QMcrwv^in. -win r»o« ooo« »o i - 



r 



WO 98/18931 PCT/US97/19588 

53 



Alternatively and if antibody production 4s-BOt possible, the Streptococcus 
pneumoniae DNA sequence is additionally incorporated into eukaryotic expression 
vectors and expressed as, for example, a globin fusion. Antibody to the giobin 
moiety then is used to purify the chimeric protein. Corresponding protease 
5 cleavage sites are engineered between the globin moiety and the polypeptide 
encoded by the Streptococcus pneumoniae DNA so that the latter may be freed 
from the formed by simple protease digestion. One useful expression vector for 
generating globin chimerics is pSG5 (Stratagene). This vector encodes a rabbit 
globin. Intron II of the rabbit globin gene facilitates splicing of the expressed 
10 transcript, and the polyadenylation signal incorporated into the construct increases 
the level of expression. These techniques are well known to those skilled in the art 
of molecular biology. Standard methods are published in methods texts such as 
Davis et a/., cited elsewhere herein, and many of the methods are available from the 
technical assistance representatives from Stratagene, Life Technologies, Inc., or 
15 Promega. Polypeptides of the invention also may be produced using in vitro 
translation systems such as in vitro ExpressTM Translation Kit (Stratagene). 

While the present invention has been described in some detail for purposes 
of clarity and understanding, one skilled in the art will appreciate that various 
changes in form and detail can be made without departing from the true scope of 
20 the invention. 

All patents, patent applications and publications referred to above are 
hereby incorporated by reference. 
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(1) GENERAL INFORMATION : 

<i> APPLICANT: Charles Kunsch 

Gil H. Choi 
Patrick S. Dillon 
Craig A. Rosen 
Steven C. Barash 
Michael R. Fannon 
Brian A. Dougherty 

(iij TITLE OF INVENTION: Streptococcus pneumoniae Polynucleotides and Sequences 

(iii) NUMBER OF SEQUENCES: 391 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: RocJcville 
<D> STATE: Maryland 

(E) COUNTRY: USA 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Oiskeete, 3.50 inch. 1.4Mb storage 

<B> COMPUTER: HP Veccra 486/33 

(C) OPERATING SYSTEM: MS DOS version 6.2 

(D) SOFTWARE: ASCII Text 



(vi) CURRENT APPLICATION DATA: 
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(A) APPLICATION NUMBER: 

(B) FILING DATE: 
<C> CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/ AGENT INFORMATION: 

<A) NAME: Brookes, A- Ander* 

(B) REGISTRATION NUMBER: 36,373 

(C) REFERENCE /DOCKET NUMBER: PB340P1 

(vi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: <301> 309-8S12 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5625 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CCAAGCAAAA 


CCACCTACAG 


CTAAAGGAAC 


TTACGTAACA 


AACTTGACTA 


TCACAACTAC 


60 


TCAAGGTGTT 


GGTATCAAAG 


TTGACGTAAA 


CTCACTTTAA 


TCAGTACTTA 


AAGTAATGTA 


120 


AAAAAGTTGA 


AGACGCTATG 


TCTCAACTTT 


TTTTGATGTA 


CGACGGGCAT 


CTTGTATAGT 


180 


AGATGTGTAC 


TATTCTAGTT 


TCAATCTACT 


ATAGTAGCTC 


AGAAGTCGGT 


ACTTAAACGT 


240 


GCTATATCAA 


AACCAGTCCT 


TGAAAAACGT 


GGACTGGTTT 


CGTGTTTGGA 


TTATTACCTT 


300 


GAACGACATG 


CGTTAAAAGT 


TAGTTGAACC 


GCCCTATGCC 


CAACGCACGT 


ACGGTGGTGT 


360 


GAGAGGGGCT 


AGAGATTATC 


CCCTACTCGA 


TTTCGAAATC 


TAGTGGAATG 


AATCTGGAAT 


420 


AGTCCATCGA 


GCTTTCTAAT 


ACTCTTCCAA 


AATCTCTTCA 


AACCACGTCA 


ACGTCGCCTT 


480 


GCCGTGCGTA 


TGGTTACTGA 


CTTCGTCAGT 


TCTATCCACA 


ACCTCAAAAC 


AGTGTTTTGA 


540 


GCTGACTACG 


TCACTTCCAT 


CTACAACCTC 


AAAACAGTCT 


TTTGAGCAAC 


CTGCGGCTAG 


600 


TTTCCTAGTT 


TGCTCTTTGG 


TTTTCATTGA 


GTATAACACA 


TTGTTAGAAG 


TTGGTTTAAA 


660 


TTTCCTAATC 


AGTTTGTTCA 


CATTTACCTT 


CGATATATTA 


TATCCCATAG 


TTAAGGTTGG 


720 


TCATACAGAT 


GATTATAGTC 


ATGGAGCCCT 


AAAACTTACT 


GTTTCTTTAG 


TTGACAAAGA 


780 


TGCCATGAAA 


AAAATATTTG 


TAACTGTAAT 


AGGATATTTT 


GAAATAAATA 


TAGATGAAAA 


840 


TATCACCGAT 


ATTCTATACG 


TAAATGCTAC 


TGCTATTCTT 


TATCTTTATT 


TACGTTCAAT 


900 


TGTTTCAATA 


GTTTCGGCAA 


TTGATAGCAG 


TGAAGCAATG 


TTGCTACCTA 


TCATTAATGT 


960 


TTTAGAGTTA 


CTAGATAAAT 


CTCAACCTTT 


TGAAGAAGAA 


TAATTTATTA 


GCTCACTAAA 


1020 


TTGAGGGTAA 


GCAAAAGTAA 


AAGCAGTAAG 


AAAAATGTCT 


TGCATTATAC 


AGCAACCTTT 


1080 


TGGGAATGAG 


TGGATGGATT 


GAATAAAATT 


TGATTAAGAG 


TGGATGATTT 


ATCTGTAGAT 


1140 


TATTATTGGA 


CAGTTAGTCT 


TGAAGTAGTC 


TAAGAATTAG 


GTTATAATCA 


GTAGAAGCCT 


1200 


TGCTAATAAT 


GAGGAGGTTA 


GTTTATGTAT 


AGTAGACTGA 


ATCTAAAATA 


GTACGAAACA 


1260 


ATTGCTAAAA 


CATTTATAGA 


AATTAATTTT 


ACTTTCCCAA 


TCGATTTGTT 


CTCATCTTAT 


1320 


TTCAATCCGC 


TATATATTAT 


GGTATCGAAT 


CTTCATCAGA 


ATGATAAAAT 


TAATCAATTG 


1380 


ATATCTGATT 


ACAAACAGAA 


TATGAAAGCT 


TTTTATATCA 


CTATTGAAAA 


ATTTATACGA 


1440 
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GATGATGAAA 


GCCTTAAGTG 


TTATTTTATA 


AAGGTTATTT 


CAAGTCGTTC 


CAAGGTAACA 


1SOO 


AGTCTAGATC 


AGATTGAAGC 


TGATAAAACG 


ATACAAAGAA 


AATATTCAAG 


TGAGCTAAAA 


1560 


AAATTTATTG 


GATTTTATAA 


TGAGATTATT 


TGTGAGGAAA 


ATACTTTCCT 


ACATGTACGA 


1620 


AAGACGTGGT 


CGAGTTCGTT 


TACGTAGTCC 


ATGCGTGAGT 


TCATAATTCT 


CAGGGTATGC 


1680 


ACTTCTTTTT 


CATGAATGAG 


GTAAAAGAGC 


AGGTATTGTT 


TAGAGACAAT 


CATTCTGAGC 


1740 


ATATTTTCTC 


GATAGAGGGA 


GTATCCGATT 


TTATGATCAA 


AGTTAATACC 


GCCCTCTGGT 


1800 


GAGAAGATGA 


GTAGGTTGGT 


AATTTAAACT 


ATTAAACAGA 


ATTTTTGATT 


AAAACTATTA 


1860 


TTTCATCAGA 


GAAATCCTAA 


TTTCACAATC 


CATAGGCAAA 


CCCTTGCATT 


TCGTTTTTTA 


1920 


TTGGACTATA 


ATACCTTGGT 


ATAAAGCCTT 


CTGTAGTAAT 


AAAATGTAGA 


AGGTGTAGAA 


1980 


AGTAAGGATT 


TAGAATATTT 


GTAGTTAAAA 


ACACAATGTT 


GCTATTCCTT 


ACGATAGGGA 


2040 


CATAGATATG 


GCAATGATAG 


AAGTCGAACA 


TCTTCAGAAA 


AATTTTGTGA 


AGACTGTTAA 


2100 


GGAACCGGGC 


TTGAAGGGGG 


CTTTCCGCTC 


CTTTATTCAT 


CCTGAAAAGC 


AGACCTTTGA 


2160 


AGCGGTCAAG 


GATTTGACCT 


TTGAGGTTCC 


AAAAGGGCAG 


ATTTTAGGAT 


TTATCGGGGC 


2220 


AAATGGTGCT 


GGGAAGTCGA 


CAACCATTAA 


AATGCTGACA 


GGAATTTTGA 


AACCAACATC 


2280 


TGGTTTTTGT 


CCGATTAACG 


CCAACATTCC 


CCAGGACAAT 


CGGCAAGATT 


ATGTCAAAGA 


2340 


TATTGGCGTA 


GTCTTTGGAC 


AACGCACCCA 


GCTATGGTGG 


GATTTGGCTC 


TGCAAGAGAC 


2400 


CTACACTGTC 


TTAAAAGAGA 


TTTATGATGT 


GCCAGACTCG 


CTCTTTCATA 


AGCCTATGGA 


2460 


CTTTTTCAAT 


GAAGTCTTGG 


ATTTGAAGGA 


CTTTATCAAG 


GATCCCGTGC 


GGACTCTTTC 


2520 


ACTGGGACAA 


CCGATGCGGG 


CGGATATTGC 


GGCCTCCTTC 


CTCCACAATC 


CCAACCTTCT 


2580 


TTTTTTAGAT 


GAGCCGACCA 


TTGGTTTGGA 


CGTT T CGGTT 


AAGGATAATA 


TTCGTCGGGC 


2640 


AATTACTCAG 


ATCAATCAAG 


AGGAAGAAAC 


TACCATTCTT 


TTGACCACTC 


ACGATTTGAG 


2700 


TCATATTGAG 


CAACTTTGTG 


ATCGGATTTT 


CATGATTCAC 


AAGGGGCAAG 


AGATTTTTGA 


2760 


TGGAACCGTG 


AGCCAACTCA 


AGGACACCTT 


TGCTAAGATG 


AAGACTCTCT 


CTTTTGAACT 


2820 


GCTACCACGT 


CAAAGTCATC 


TCGTCTCTCA 


CTATCACGGT 


CTCTCTGATA 


TGACCATTGA 


2880 


TAGACAAGGA 


AACAGCCTCA 


ACATTGAATT 


TGATAGTTCT 


CGCTACCAGT 


CAGCTGACAT 


2940 


TATCAACCAA 




ATTTTGAAAT 


CCGCGATTTG 


AAGATGGTGG 


ATACCGATAT 


3000 


TGAGGATATT 


ATCCGTCGCT 


TCTACCGAAA 


GGAGCTCTAG 


GATGATCAAA 


TTGTGGAGAC 


3060 


GTTATAAACC 


CTTTATCAAT 


GCAGGCGTTC 


AGCAGTTGAT 


TACTTACCGA 


GTCAACTTTA 


3120 


TTCTCTATCG 


GATTGGCGAT 


GTCATGGGGG 


CTTTTGTGGC 


CTTTTATCTC 


TCGAAGGCTG 


3180 
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mMtwrHff* A *l m Vf* 


i. a UuL/uvjnu 


4 W A A X VJn 4 Iv 


152 
AGGGCTTCAG 


TATGGCGGAT 


ATCACCCTCT 


3240 


ft ft m*r* k & f 

ACATCATCAT 


GAGTTTTGTG 


WAA i\> 1 1L 


TGACTACATC 


CGATTCGTCC 


TTTATGATTG 


3300 




LAAbOA 1 uuv. 


1 UUA 1 1A1 WA 


TGCCFTTGTT 

A W W 4 A A ft* * « 


GCGACCAGTG 


CATTTTGCGG 


3360 


CCTCCTATCT 


TfTf"" a f*rri Aft 
1 1 rwAwwwAw 


V. 4 l\fVl A A WWA 


AGTGGTTGAT 


TTTTATCAGC 


GTTGGCCTTC 


3420 


CATTTTTAAG 


TGTCATTvTU 




TCATATCGCG 


TCAAGGTATT 


GTAGAGGTGC 


3480 


TAGGATTAAC 


TGTCATTTAT 


Li 1 1 1 1 Aw W A 


TAACGCTCGC 


CTATCTGATT 


AACTTTTTCT 


3S40 


TTAATATTTG 


WI 1 l\»oATTT 


TCAGCCTTTG 


AVw AAA /www* 


TCTTTGGGGT 


TCCAACCTAC 


3600 


TTAAGACTTC 


CATACTTwwwT 


TTTATGTCGG 


WWAw 1 A A Wrt A 


TCCCTTGGCA 

A WWW ft * 


TTTTTTOCAA 


3660 




AGATATTCTw 


TCCTTTTTGC 




CTTG ATT TAT 


ACTCCAGTTA 


3720 


AGGTTG ilTO 


Ullll A WA A w 


TGATCATTGT 


ft ft ft ftfV*ft^* 

TGGAAAATAC 


GATGCCAGTC 


AGATTCTXCA 


UUWAw a WW A * 


TTGCAGTTCT 


3780 


TCTGGCTCTT 


AGTGATGGTG 


GGATTGTCTC 


AGTTAATTTG 


flAAAfWVTTC 


PAGTCCTTTA 

W^^ft* A WW AAA 


3840 


TCACCATTCA 


AGGAGGTTAG 


TATGAAAAAA 


TATCAACGAA 


TCCATCTGAT 


TTTTATCAGA 


3900 


CAATACATCA 


AACAAATCAT 


GGAATATAAG 


GTAGATTTTG 


1 uu I 1 ww lu A 


W A A WWW^W A W 


3960 


TTTCTGACTC 


AAGGCTTGAA 


TCTCTTGTTT 


CTCAATGTV.A 


I w X 1 A wAAw A 


^* ATTi*PATTP 

iAl A>WWr%A AW 


4020 


CTAGAAGGCT 


GGACCTTTCA 


AGAGATAGCT 


TTCATTTATG 


WA 4 A 1 A WW 4 A 


GATTCCCAAG 


4080 


GGAATGGACC 


ATCTCTTTTT 


TGACAATCTC 


TCGGCACTAO 




ACTCCCAAAA 


4140 


GGGGAGTTTG 


ACAAGTATCT 


GACTCGTCCC 


*IH^» ftffu »#*^W* 

ATCAATCCTU 


TCTTTCACAT 


WW A MW A A w * 


4200 


ACCTTTCAGA 


arum*+ ft *&*0+ 0+0+P¥+Wt 

TTGATGCCTT 


^ ^ J^»f^^ ft ft 0+0^^+ 

GGGTGAACTC 


TTAuTUUU ly 


win* A 4 A f\ A A 

4 WW » Aft* * A IVJ 


GGGAACAACA 


4260 
4320 


GTGACCAGCA 


TTGTTTGGAC 


TCTTCCAAAA 


TTCCTGCTTT 




GCGACCTTGA 


TTTATACTTC 


n/MMn i (k ft m •ft/'* 

TCTTAAAATw 


<v*» A/"fVTA 
uUMLAut 


W 4 A 1 WWW W A A 


TTGGACTAAG 


4380 


CAGTCAGGCG 


0+0+ ft ft fwwn ft 

CCATGATTTA 


CATCTTCTAT 


ATGTTGAATG 


AW AAA WW 4 rmr% 


GTATCCGATT 


4440 


TCTATTTACA 


ATTCTCTTCT 


TCGTTGGTTG 


ATTAGCTTTA 


A WW A WWW A a ft 


CGCCTTTACA 


4500 


GCCTACTATC 


LnuV. i Aww 1 A 


^p^ph*^tp^p a aa^ 

1 4 iV>i A ALAU 


CAAAAGfSATG 


TGTTCTTTAA 


CGTAGGAGGT 


4560 


TTGATGTTGA 


TTTCTCTGGT 




ATTTCCCTTA 


AACTTTGGGA 


M%ft 0+0+0+)0+*B+miL 

TAAGGGCTTA 


a. eon 


GATTCCTACG 


AAAGTGCGGG 


TTCGTAAAAG 


CTAAAGTAAG 


ACTAAAATCA 


AGAAAGAAAC 


4680 


TTATGATGTT 


TCTAATTGAA 


GAAGTCAAGG 


ATCAAAATCA 


AAAAAAGGCA 


GTTGTCGCTG 


4740 


AGGTTTTGAA 


GGATTTGCCA 


GAATGGTTTG 


GAATCCCAGA 


AAGCACACAA 


GCCTATATAG 


4800 


AAGGAACCAC 


GACACTGCAA 


GTTTGGACCG 


CCTATCAGGA 


GAGTGATTTG 


ACTAGATTTG 


4860 


TAAGCTTATC 


CTATTCCAGT 


GAAGATTGTG 


CAGAGATTGA 


TTGTCTCGGC 


GTAAAAAAGC 


4920 


TTATCAAGGT 


AGAAAAA7TG 


GGAGCCAATT 


GCTTGCTACT 


TTAGAGAGTG 


AAGCTCGTAA 


4980 
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AAAAGTTCCT TATCTGCAGC TCAAAACAGT GGCAGAAGGT TCTAATAAAG ATTATGATCG 5040 

AACAAATGAC TTTTATCGAG GTCTTGGCTT TAAAAAGTTA GAGATTTTTC CTCAACTATG 5100 

GAATCCGCAA AATCCTTGTC AGATTTTCAT TAAAAAGCTT GAATAATATT ACTTGACATC 5160 

TATTCTCAGA CTGCTATACT CTAAGTGTAA TCGCCGATTT AGCTTAGTTG GTAGAGCAAG 5220 

GCACTCGTAA AGCCTAGCTT AT AGGT AG AT AAACGACTCA GCATTTGAAA AAATAGATAG 5280 

G TAG AAGAT A ACCCTTAAGC CTTACTCTTA GCGGTTATTT ATATTGTTTA ATACCGCTAA 5340 

TATTTTATCA ATTATGCCTG TTTTCGT G TT TCTGGTACTT CTTCAAGTTT ATTGCTACTA 5400 

TTTTT G ATGG TATGAATGTG CTTATAATGT ATCCCGGTTA ACGAAAGTTT TGGACTTATA 5460 

CTCTTCGAAA ATCTCTTCAA ACCACGTCAA CGTCCCCTTG CCCTGCCTAT GGTTATGACT 5520 

TCGTCACTTC TATCCACAAC CTCAAAACAC T C TTTT G AGT GACTACGTCA GTTCCATCTA 5580 

CAACCTCAAA ACACTGTTTT GCCCAATCTG CGGCTAGTTT CCTAG 5625 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



CTCTCCAGCT 


TTCCTTGCGA 


GTTGGCCATG 




AAGAACTCTA 


AAAATATCTC 


60 


CAATAAAACG 


CATCGCTCTC 


TCCTATCTCG 


TTTCTCTGTG 


TGTAGTGTAC 


TTGCCACAAT 


120 


GCTTACAAAA 


TTTATTTACT 


TCTAGTCGTG 


TAGCCTTGAC 


GTTTCCGCTG 


ATCTTGATTG 


180 


AATAGTTTCT 


CGAACCACAA 


ACCGCACAAG 


CTAGGCTTGC 


TTTTTTTAGT 


GCCATAACGC 


240 


CTCCATCTTA 


TCCATTATAA 


CAAGAAAGCT 


AGGCTTTGAC 


AAGCATCTTA 


GCGAAATAGA 


300 


TTGACTATCC 


AATCCCATAT 


TGTTTGAGCC 


TTTTCCTTAA 


TCTTCGCATC 


TGAGATAGCC 


360 


CGGCTAGCCT 


CATCTACTAG 


ACTTTGCGCA 


CGCCCTCGAA 


TATCAGACAA 


ATTATCATCT 


420 


CTCTGGCTAT 


TATCATTGGT 


TTGTACTTGT 


CTTTTTGTAT 


TGGCTGGTGC 


AATTCCATTT 


480 


TGCTTATAAG 


CATTTTCAAC 


CGTAAAGGTA 


CTTCCTGGCG 


TATAAGGTAA 


AATGCTATTG 


540 


GCAATGTTTC 


TAAAGACATG 


AGCTGCACCG 


TTTGAAGTAG 


AGCCAGCTAG 


ATAGTGGTTT 


600 


TCATCAGTGG 


TCGCAAAGCC 


AAGCCAGTGG 


CTAATCACTA 


CATCCGGAGT 


ATAACCAATT 


660 


ACCCACTGGT 


CACTTGTGTA 


CTCCGGATTG 


AAAACTGCTT 


CACTTGTTCC 


AGTTTTCCCT 


720 
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GCCATGACAT AGTCTGCAGG CGATGAACTA ATACCGGTAC CGTTGGTGAA ACTCCCCAAC 780 

ATCATACTGG TCATCTTGTC AGCTACAGAC TTATCAATCA CCCGTTTTTG TGAATTTTTA 840 

TCACTCCCAA TAACTTGTCC ACTAGCATTT TCAATTCTAC TAATAAAATG AGCTTCAGGC 900 

ATTAAACCTT CATTTGCAAA GGCGGCGTAT GCTTGAGCCA TTTGAACAGG CTTGCTTTCA 960 

ACACCGCTTC CCAACGCGAC ACCAAGAACA CGGTCGACCT TTTCCATGTT GAGTCCGAAT 1020 

TTTTCGCCTG CCTCAAAAGC CTTGTCGACA CCCAAATCAT TAACAGTCCC AACAGCAGGT 1080 

ACATTAACCG ATTCTGCCAA GGCTTGATAC ATAGGAACTT CTCGACTCCT TTTGATCCCT 1140 

CCATAGTTAT CAACCTTATA CCTGTCATAC TGCATGCTAT GGTTATCCAA CTGCTTATTC 1200 

AAAGCCCAGC TTGCTTCAAC TGCTGGCGTA TAAACAACTA AAGCCTTAAT TGTAGAACCA 1260 

GGACTACGCT TTGATTGGGT TGCATAGTTG AAATTCCGCA ATCCACTTTT ATCATTGTCA 1320 

GCAACTTGAC CGACAACTCC ACGAACTCCC CCTGTTTTCG GTTCCAGGGC TACACTTCCT 1380 

GATTGAGCAA ACCTTCCATC CTCTGCCCTC CGAAATACCG ATCTCTTTTC ATAAACAATC 1440 

TGC AT ATTTG CTTGGTAGTT TTGCTCCAGC TCT G TGTAAA TGCGGTAGCC ATTATTGACA 1500 

ATCTCTTCCT CTGTTAGATT ATACTTGGAA ACAGCTTCAT TAACCACCGC ATCAAAATAA 1560 

GAGGGGTAAC GGTAATCTGA CATTTTTCCT TCATACTTAT CGTGCAATTC CGAAGTCATA 162 0 

TCAACTTCAG CAGCTTTGGT TTCTTGGTTT TTATCAATAT ATCCTGCTCC AACCATATTC 1680 

TGCAAGACAG TATCGCGCCG ATTAGTAGAA TCTTCTACGC AATTCAAGGG ATTATACAGT 1740 

TCCCGCCCCT TGAGCATCCC TGCCAGAGTC GCAGCTTGAT CCACACTCAC TTCTGATGCA 1800 

GAAACTCCAA AGTATTTCTT ACTCGCATCT TCTACACCCC ACACACCATT TCCAAAATAA 1860 

GC C TTGTTAA GGTACATGGT TAGAATTTGC TCCTTACTAT ATTTTTTGCT TAATTCTAAG 1920 

CCAAGGAAAA ATTCTTTCCC TTTTCTCTCA ACAGTTTGAT CCTGCGATAA ATAGGC G TTT 1980 

TTAGCCAGCT GTTGGGTAAT GGTAGAGCCA CCACCTGAAC GTCCAGCAGT GACAATAGCC 2040 

AAGAAAAAAC GGCCATAGTT AATCCCGTCA TTTTTATAGA AAGAACGGTC TTCTGTCCCA 2100 

ATAACAGCAT TCTGCAAGTT TTTACTGATG TCAGTCAGCT CAACATAGCT TCCC T TTTGA. 2160 

CCAGACAAGG CACCAGCCTC TTTTTCTTCA CGGTCAAAAA TAAGAGTCCG AGTTTTCAAG 2220 

GCATTTTGCA AATCATTGAC ATTGGTCGAC TTGCCTACAG CAAACAAATA GATTCCAACT 2280 

AGCAAGCCTG CACTCAAACC TAGTATAAGG ATAATCTTTG TTAGATGATA ACCACGCCAG 2340 

AATTTTCGAA TCGGACCTAC TTGGGCTAAT TTTTTTCGAT CACTACCAGA GCGACGTAAG 2400 

ATAGTAGAAT CAGAGTCCTC TAGTTCACTT GTTTCTTTTT TAAAAAGAGA AACAAATTTC 24 60 

TCAAATAATT TATCTAATTT CATGCGTTTA TTTTATCATC TTCATCATAG GAAGACAAGA 2520 
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ATTTAGCTAT 


TTCCTATCCA 


AATAGGGCTT 


TTTTTGTTAC 


An I r\ i V. lu I A 


1UV.AAI t^_Al> 


2580 


ATTTACATTA 


CCCGCCTCTC 


TACCTCAAAT 


GACAGTAAAG 






2640 


CCTCATCCCT 


AGAAAAATCC 


CTCATTiTlT 


GAGAATCAAG 


AnALA 1AH 1 


1 uft 1 wA 


2700 


AGAAGAAGTC 


CACTGGAAGG 


AAATCGTAAA 


TCCTGGAGAT 


GTTTGCCAGT 


TGACTTTTGA 




CGAGGAAGAT 


TATTCCCAAA 


AGACGATCCC 


TTGGGGCAAC 


CCAGACTTAG 


TyCAGCAACT 




TTATCAACAT 


CAACACTTGA 


TTATTGTAAA 


CAAACCAGAG 


GGGATGAAAA 


CCCATGGTAA 




TCAACCAAAC 


GAAATTGCCC 


TTCTTAACCA 


TGTCAGTACC 


TATGTTGGCC 


AAACCTGCTA 


2940 


TGTCGTTCAT 


CGTCTGGACA 


TGGAAACCAG 


TGGCTTAGTT 


CTCTTTGCCA 


AAAATCCTTT 


^ AAA 

3000 


TATCCTGCCC 


ATTCTCAATC 


GCTTATTGGA 


GAAAAAAGAG 


ATTTCTAGAG 


AATATTGGGC 


3060 


TCTAGTTGAT 


GGAAATATCA 


ACAGAAAAGA 


ACTTGTTTTC 


AGAGACAAAA 


TTGGACCTGA 


3120 


TCGCCATGAT 


CGTAGAAAAA 


GAATAGTTGA 


TGCAAAAAAT 


GGGCAATATG 


CTGAAACGCA 


3180 


TGTAAGCAGA 


TTAAAGCAAT 


TCTCAAACAA 


GACTTCCTTG 


GCTCATTGCA 


AGCTAAAGAC 


3240 


AGGGCGAACC 


CATCAGATTC 


GTGTGCACCT 


TTCGCATCAT 


AATCTTCCTA 


TCCTGGGAGA 


3300 


CCCTCTCTAT 


AATAGTAAAT 


CAAAGACAAG 


CCGGCTTATG 


CTTCATGCCT 


TCCGACTTTC 


3360 


CTTTACCCAC 


CCACTTACTT 


TAGAGAAGCT 


AACTTTCACT 


ACCCTTTCAA 


ATACATTTGA 


3420 


AAAAGAATTA 


AAAAAGAATG 


GATGATCGTG 


TCATCCATTT 


TTCCATATAA 


AAAAGCAAGA 


3480 


CCACAAACCC 


TTGCTTTCTA 


TCAACTCAAG 


AATTATTTAG 


CAATTTTTGC 


GAAGTATTCA 


3S40 


AGAGTACGAA 


CAAGTTCTGC 


AGTGTATGAC 


ATTTCUTTGT 


CGTACCATGA 


TACAACTTTA 


3600 


ACCAATTGTT 


TACCGTCAAC 


GTCAAGAACT 


TTAGTTTGAG 


TTGCGTCAAA 


CAATGAACCG 




TAAGACATAC 


CTACGATATC 


TGAAGATACG 


ATTGGATCTT 


CTGTGTAACC 


GTATGATTCG 


3 /2U 


TTTCAAGCTG 


CTTTCATAGC 


TGCGTTCACT 


TCATCAACAG 


TAACGTTCTT 


i TwAAuAAtT 


J ' OU 


GCTACCAATT 


CAGTAACTGA 


TCCAGTTGGA 


CTTGGAACCC 


GTTGTGCAGA 


TCCGTCAACT 


3840 


TTACCATTCA 


ATTCTGGCAT 


TACAAGACCG 


ATAGCTTTTG 


CAGCACCAGT 


TGAGTTAGGA 


3900 


ACGATGTTTG 


CAGCACCAGC 


GCGAGCACGG 


CGAAGGTCAC 


CACCACGGTG 


TGGTCCGTCA 


3960 


AGGATCATTT 


GGTCACCAGT 


GTAAGCCTGG 


ATAGTAGTCA 


TCAATCCTTC 


AACAACACCA 


4020 


AAGTTGTCTT 


GAAGAGCTTT 


AGCCATTGGA 


GCCAAGCAGT 


TTGTAGTACA 


TGAAGCACCT 


4080 


GAGATAACTG 


TTTCAGTACC 


GTCAAGAACG 


TCCTGGTTAG 


TGTTGAATAC 


AACTGTTTTA 


4140 


ACGTCCTTTC 


CACCAGGAGC 


AGTGATAACA 


ACTTTTTTAG 


CTCCACCTTT 


AAGGTGTTTT 


4200 


TCAGCTGCTT 


CTTTCTTAGC 


AAAGAAACCA 


GTAGCTTCAA 


GAACGATTTC 


TACACCGTCA 


4260 
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GTAGCCCACT CGATTTGTTC TGGATCACCT TCACCAGAAA CWGATGAA TTTACCGTTA 
ACTTCAAATC CACCTTCTTT AACTTCAACA GTACCGTCGA AACGACCTTG AGTTGTCTCG 
TATTTCAACA AGTGTGCAAG CATAACTGGA TCTGTAAGGT CGTTCATGCC TGTAACTTCA 

ACACcrrcTA ccrrrrGGAT acgacggaaa gcaagacgac cgatacctcc gaaaccgtta 

ATACCAACTT TAACTACCAT TAGTCATTTC CTCCTTATGA AAATCATGAA ATTTTTATTG 
TGAAAAGAGT AACTTGAATC ACTACAAATC ACCrrTCAAC AAACCTATTA tacaactatt 
TGACTTCAAT TGCAAGTATG GCCATTGTTT TTCTATGTTA GTTTCTTTTT AAGACTGTAA 
ACCAAGGAAT CCCTTACTAT TCATAGCATA ACGATTCTAT AGGATCCATT TTACTAATCT 
TACGCGCCGG GAAGTAGGCT GAGACATAAC CAAGTAATAC AGCGAAAACT ACAGTTCCTA 
AAACAGATAA AAGATTTAAT TTAAAAACCT TAGTGATGGA TGGGTAAAAG TGACTTACAA 
TCGCATTCCC CAAACTTCCC ACCCCTTGTG CAACCAAAAA TGCCAGCAGC AAGGCCATCC 
CTACAATCCA GATAGCCTCG TAAATAAAAA TTCCTTTGAC ATCACGATTC TGATAACCAA 
CTGCTTTCAT GACACCTATT TCCTTCGAAC CTTGCATGAT ATTGATGTAA ATAATGATAC 
CAATCATAAC CGCTGCTACC ACAATAGCTT GTGATGAAAG CACAATCAAT AATCCCTGAA 
TAACACGAAT AAAGGTAATC ACAATATCAA GAACTCTCTG TTGAGAAAGC ACAGTATACT 

TCTTArrrrr ctgtaattct t ctgtt act a cttttctctg tgatggatct ttgagttcca 

AGATAAAATA AGATACAGCT TTCGTAAATC CAGCCTCTTT CAAAATCGTT TCCATTTGAT 

GAGACACCAT caaactgttg ctctcctcca tgtcatcttc atcattgatt ACACGTACAA 

TCTTCGTTTG AAATTGAGCA ATCTTACTAG TTTCGGCAGC ACTTTCTACA ATCCTGGCTG 
AGACTGATTT GCCAATAAGA TCATTACCTG TCAAATTTTT TCCTGTCTGT TCATTCCAAT 
TTTTTAGTAA ACTGCTTGGA ATCGTTAATC CCTGTTCATT TGTATCAGTA TAGAGCGATC 
CAGCCAACAC TTTGTCCGTC T CATT ATT AC TAACAGAGAT ACTTGTATCA TCATAAAGAC 
TCACTACTTG ACCATAAGAA GGCATCCTTT CACTCAGATC CATTTCTTGC CCATCTATAG 
TAATATTTGA CATGTTCATC CCAAAAGGAC TCTCCAAATA TTTAATAGCT TCTTTCCCAA 
CTGTATCCGT GATATATAGT CAATTGAAAC AAGAGCAGGA TAAAAAAGCC TCGTAAAAGC 
TATTGCAACT TCGTAATACC TTTTTGAGGT GCTTTTTGAT ATGAGCCCAT GTTTTCTCAA 
TAGGATTGTA CTCAGGCGAG TAGGGAGGAA GAGGTAAAAG TTTATGCCCA AACTCTTCGC 
ATAAAAGTTC TAGCTTCCCC ATTCTATGGA ATCTTACATT ATCCATAATA ATAACCGATG 
GTGTGTTTAA TGTTGGTAAG AGAAAATTCT GAAACCAAGC TTCAAAAAAG TCGCTCCTCA 
TCGTCTCTTC GTAAGTCATT GGAGCGATTA ATT C AC CATT TGTTAGACCT GCAACCAAAG 



4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5620 

58B0 

5940 

6000 

6060 
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ATATCTTCTT 


CCAGATACTT 


TGCCTCTTAT 


TAATTGACCT 


TTTAATGAGC 


6120 




TCGATAAAAA 


TAAGTATCGA 


ATCCTGTTTC 


GTCAATCTAA 


ACAGGTCCTA 


61B0 


GGTGCTTTAA 


ACTATTAAAA 


TTCTTAAGAA 


ATAAGGCT AC 


TTTTTCTGGG 


TCTTGTTCAT 


6240 


AO 1 nuu lu iAJ 


GTTCTTTTTT 


CGAGTGTAGC 


CCATACCTTT 


GAGCGTATAG 


TGGATGGTAG 


6300 


TTGGATviAl-A 


GCCAAATTCA 


GAAGCTATTT 


CAGTCAAATA 


AGCGTCTGGA ttctcagtaa 


6360 


CAT AO lllli 


AAGTCTATCT 


CTATCAACCT 


TTCTTGGTTT 


TATTCCTTTT 


ACTTGGTGGT 


6420 


TTAGCTCTCu 


TGTTTTCTCT 


TTTAGCTTTA 


ACCAGCCATA 


AATGGTATTA CGTGAGATTT 


6480 


GGAAAACGTG 


TGATGCTTCT GTTATACTAC 


CTGTTCCCTC 


ACAATAAGAG 


AGAACTTTTT 


6S40 


TACGAAAATC 


TATTGAATAT GCCATAAAAA 


GATTATACCA 


CATTGTGTAC 


TATTTTTGGT 


6600 


TCATTTTACT 


ATATTTGAAG 


AGGCCTTTAA 


ACTATCTGAC 


ATAAAACTCG 


TTCTAGAGGA 


6660 


AAGACATCCT 


TTAAAAAGTT 


AGTTTATTTT 


ACAACTTAGA 


CATCAAGGTA 


GGTTAACCCC 


6720 


TTCATGGAAA 


AATCAAGACT 


CTTAGCACTA 


TCGGTTAAAC 


1 #\w>V-/\V_ 1 \J\**\ 


GACGTAATCA 


6780 


ATCGCTAAAC 


CACGAAAACG 


GCTAATAGTG 


GTCATATCAA 


TATTTCCAGA 


ACATTCAATC 


6840 


CGAGAACGTC 


CTGCAATTAG 


GGTAATGGCC 


TGTTCAATCT 


GTTCCAATGA 


CATATTATCC 


6900 


AACATGATAA 


TATCAGCACC 


CGCCGCCCCA 


GCTTCTTCGG 


CAGCAGCAAG 


GCTTTCCACT 


6960 


ft /MWI^^ ft 

TCCACCTCGA 


CCATTTTCAC 


AAAAGGGGCA 


TAGGCACCCG 


CTTGAGCAAT 


TGCCTTTTGA 


7020 


ACACTACCTA 


CTGCCGCAAT 


GTGATTGTCT 


TTTAGCAGGA 


TAGCATCTGA 


TAAATTaAAC 


7080 


CGATGATTAT 


AGCCACCGCC 


AACTCTCACG 


CCATATTTCT 


CAAAAAGACG 


TAAATTAGGA 


7140 


GTACTTTTTC 


GAGTATCAAA 


TACCTTAATG 


UiA 1 La • www 


CTAAGGCTTC 


TACATAAGCA 


7200 


GCTGTCATCG 


AAGCAATCCC 


TGATaaATGT 


TCTAAAAAAT 


TCAAGGCAAC 


GCGTTCACAT 


7260 


GTTAAGAGAC 


TTCTCACCGA 


GCCTATGATT 


TCTAAAACCA 


AATCGCCACT 


AGTCAAACGA 


7320 


TCCCCATCCT 


TAAATTCATC 


AGGATTCTGG 


AACGTCACCT 


CGCCATCAAA 


TAGGGTAAAA 


7380 


ACCCTTTGAA 


AAACGGTTAG 


CCCCGCTAAA 


ACACCAGCTT 


CCTTGGCAAA 


AAGCGACACC 


7440 


TTGCCTTGGC 


CATCATGATC 


AAAAATGGCA 


TTGGTACTGT 


AATCTTCGGA 


ATGaACATCT 


7500 


TCTCGCAAGG 


CTGCTTTCAA 


TGTATCATCT 


ATTTGAAAAG 


CGGTTAAATC 


AGTTGAAATG 


7560 


ATTGACATCA 


C 










7571 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 26385 base pairs 

(B) TYPE: nucleic acid 

(C) ST HANDEDNESS : double 
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(Dt TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


3: 






111 X /Vv# 1 w 


GCTTAAATTC 


TTCAGGAAAA 


TCAGGCGTAT 


CTAAAAGTCG 




60 


GTTTCATCTA 


TATAAAGACT 


TCCTGCTCCC 


CCTACAACTA CAAAACGTGT 


CTGTGTTCCA 


120 


CCAAGAAGCT 


GATTAAATAG 


TTCGATTGAT 


TTGCTGTGGA 


GCGGTAGCGT 


ATCTGGTGTA 


180 


TAAGCACCAA 


ACGCTGAAAT 


AACACCATCA 


AATCCAGTAA 


CATCATCTTT 


TGTCAACTCA 


240 


AATAAATCTT 


TTTTAATAAT 


ACACTCAGCT 


TGACTTTTGT 


TTTCAGAACG 


AACAATAGCC 


300 


CTTACTTCAT 


CTCCTCGTTT 


GACTGCTTCT 


TCAACAATTG 


CTTTCCCCGC 


TTGTCCATTT 


360 


GCTGCAATAA 


CTGCTAGTTT 


CATTTTTTAT 


ACCTCTCTTG 


TTGTAATTAT 


TTTAGTTACA 


420 


GAAATTGTGA 


CACTCTTAAT 


AATCAATGTC 


AATAGTCTTC 


CTTAATTATT 


ATCAAAATAT 


480 


TTCTACCAAG 


AAAACTAACC 


ATGATTCTAG 


TGAAAAAAAA 


TCTTCTTTGT 


CAACAAATTT 


540 


ACTTTCTTGT 


TTTAAACATG 


CTATAATAAT 


CATAGCAAGA 


GATCTAAGTT 


CTCTGTTTTT 


600 


TTAAAACGAG 


GTGATTATCA 


TGCGTAGATT 


CTATTCCCAT 


CTCCCCTACT 


ATCTGGTCAT 


660 


ATTATTCTTT 


TATTGGCCAC 


TTTATGAGTT 


GTTCTTACTA 


GTTGTTTCTG 


ACCCCCTTAC 


720 


ACTCAACGGA 


CTCTATATAA 


ACAATCTTCT 


CTTCTTTACA 


CCTCTGCTAA 


TCTTGATTGT 


780 


ATCGTTACTC 


TATAGCTACC 


GTTTCCCTTT 


CTCACTTTCA 


TGGTTACTTG 


GTAACGGACT 


840 


GCTCTTTTAC 


TTTACTATCA 


TAACCTTTGG 


TGACTTTATA 


CTAATTTACT 


TGCTAATCTA 


900 


TGAAACAGTT 


GCTCTGGTCG 


GCATGGATTC 


TGGTATTAGC 


ATCAAGCATA 


TTCTACAAAA 


960 


AATGAAAAAC 


AAAAAACTTT 


CACAAAATCC 


TTGAAAAATC 


TCACAATCAT 


GCTATAATAA 


1020 


TCCATAGAGA 


CAAGTCACTT 




TACTAGACAG 


TCCGTGGTTG 


CTGGAAACGC 


1080 


ATAGGAACTC 


TAAACTGATA 


CTACTCTTGA 


GTTTTTTATG 


AAAACATAAA 


ACGGTGGCCA 


1140 


CGTTAGAGCC 


CATCAGAGGT 


GTCCCTCTCT 


TTTGAGGTAC 


ATAAATGAAG 


CTCCAACCrvC 


1200 


GTTGCGACGT 


CCTTTCGAGG 


ATCTCGCATT 


TTTTTATTAG 


GATACTAATT 


ATGGAGTTGC 


1260 


AACAATTAGT 


GGAGCGCAGT 


TGGGCAATCC 


GACAAGCTTA 


TCACGAACTG 


GAACTTAAGC 


1320 


ATCATGATTC 


CAAGTGGACG 


GTAGAAGAAG 




TTTATCTAAT 


GATATTCGAA 


1380 


ATTTCCAACG 


ACTGGTGATG 


ACAAAGCAAG 


GACGCTACTA 


TGATGAAACA 


CCCTACACAC 


1440 


TGGAACAAAA 


ACTTTCAGAA 


AATATCTGGT 


GGCTATTAGA 


ACTTTCTCAA 


CGTTTGGATA 


1500 


TAGACATTCT 


GACGGAAATG 


GAAAACTTCC 


TCTCTGATAA 


AGAAAAGCAA 


TTGAACGTTA 


1560 


GGACTTGGAA 


CTAGTCTGCT 


GATAAAAAAT 


CAATGCTTAG 


AAACTATGAA 


ATAATAAAAA 


1620 
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AGGAGAAt-AT 


I.A1AjA4 


A A 4 /Vv. * A A UV< 


CAGATGGCGC 


TGTTCGTGAA 


TTCGAATCTC 


1680 


GCGTAACAAC 


TTTTGAAATT 


GCCCAATCTA 


TCAGCAATTC 


CCTAGCTAAA 


AAAGCCTTGG 


1740 


CTGGTAAATT 


CAACCGCAAA 


CTCATCGACA 




TATCACTCAA 


GATGGAAGCA 


1800 


TCGAAATTCT 


GACACCTGAT 


CACCAAGATG 




^44 UVW 4 WiW 


TCAGCAGCTC 


1860 


ACTTGTTCGC 


CCAACCAGCT 


CGTCGTCTTT 




TCACTTGGGA 


GTTGGTCCAG 


1920 


CCATCGAAGA 


TGGTTTCTAC 


TACGATACTG 




A VJ»-» 4 WWW 4 v> 


TCTAACCAAC 


1980 


ACCTTCCTCG 


TATCCAAGAA 


GAAATGCAAA 


AAA i v»Ij I V.#v\ 


nunAnnU 1 1 V* 




2040 


GTGAAGAAGT 


GACTAAAGAC 


GAGGCACGTG 






TACAAGTTGC 


2100 


aattgattga 


AGAACACTCA GAAGACGAAG 




TATTTATTfiT 
1 n 11- lAt V>u 1 


U\(J\A) 1 0#w% 4 


2160 


ATGTAGACCT 


rrwrflTCCt 








ATCTTCCACC 


2 250 

444V 


TTCTCCATGT 




TACTGGCGTG 




tnnUAi 1 n lu 




2280 


TCTACGGTAC 


A 4 ^ W * ■ • 


GACAAGAAAG 


ACTTGAAAAA 


trr alu i4 laa 




2140 


AAGCTAAGGA 


ACGTGACCAC 


CGTAAACTTG 


GTAAAGAGCT 


TGACCTCTTT 


ATGATTTCAC 


"> 4O0 


AAGAAGTGGG 


ACAAGGTTTG 


CCATTCTGGT 




1 UwunV. 1 A 1\. 


rrrrrTfiAAT 


2460 


TGGAACGCTA 


CATCGTAAAC 


AAAGAGTTGG 


TTTCTGGCTA 


LLnALALw 1U 


1 ntttL 4 LWAV. 


2520 


CACTTGCTTC 


TGTTGAGCTT 


TACAAGACTT 


CTGGTCACTG 


GGATCAl i AC 


t_AAOAAA*A\»A 


253Q 


TGTTCCCAAC 


CATGGACATC 


GGTGACGCGC 


AAUAAi I lul 




ATCAACTCTC 

A 4 O/WW » V* I \» 


2640 


CGCACCACAT 


CCAAGTTTTC 


AAACACCATG 


TTCACTCTTA 




CCAATCCGTA 


2700 


TCGCTGAAAT 


CCGTATGATG 


CACCGTTACG 






r^rrTTCAAC 

VAJw 4 4 ^nn^ 


2760 


GTGTACGTGA 


AATGTCACTC 


AACGACCGTC 




TACTCCAGAA 


CAAATCCAAG 


2820 


AAGAATTCCA 


ACGTGCCCTT 


CAGTTGATTA 


TCGATGTTTA 

4 WWl 4 w 4 4 4 n 


TGAAGACTTC 


AACTTGACTG 


2880 


ACTACCGCTT 


CCGCCTCTCT 


CTTCGTGACC 


CTCAAGATAC 


TCATAAGTAC 


TTTGATAACG 


2940 


ATGAGATGTG 


GGAAAATGCC 


CAAACCATGC 


TTCGTGCAGC 


TCTTGATGAA 


ATGGGCGTGG 


3000 


ACTACTTTGA 


AGCCGAAGGT 


GAAGCAGCCT 


TCTACGGACC 


AAAATTGGAT 


ATCCAGATTA 


3060 


AAACTGCCCT 


TGGAAAAGAA 


GAAACCCTTT 


CTACTATCCA 


ACTTGATTTC 


TTGTTGCCAG 


3120 


AACGCTTCGA 


CCTCAAATAC 


ATCGGACCTG 


ATGCCGAAGA 


TCACCCTCCA 


GTCATGATCC 


3180 


ACCGTGGGGT 


TATCTCAACT 


ATGGAACGCT 


TCACAGCTAT 


CTTGATTGAG 


AACTACAAGG 


3240 


GGGCCTTCCC 


AACATGGCTG 


CCACCACACC 


AAGTAACCCT 


CATCCCACTA 


TCTAACGAAA 


3300 


AACACGTGGA 


CTACGCTTGG 


GAAGTGGCCA 


AGAAACTCCG 


TGACCCCGGT 


GTCCGTGCAG 


3360 




RNSnOClO: <WO RR1B931A2 I > 
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ACGTAGATGA 


GCGCAATGAA 


AAAATGCAGT 


TCAAGATCCG 


TGCTTCACAA 


ACCAGCAAGA 




TTCCTTACCA 


ATTAATTGTT 


GGAGACAAAC 


AAATGGAAGA 


CGAAACAGTC 


AACGTTCGTC 


J4BU 


CCTACGGCCA 


AAAAGAAACA 


CAAACTGTCT 


CAGTTGATAA 


TTTTGTTCAA 


GCTATCCTAG 


J540 


CTGATATCCC 


CAACAAATCA 


CGCGTTGAGA 


AATAAGAGTC 


TAGCATAAAA 


CCCTCCAATC 


3600 


TGGAGGCTTT 


TTCTCATCTA 


TTTTTACTCA 


AGGACTAAGT 


TCACTTGAGC 


AAACTGAATC 


3660 


CGCACTGTCG 


TTCCTTTTCC 


GACCTCAGAC 


TCGATACGAA 


TCTGGTGCCC 


CAGTTCTTCA 


3720 


GAAATTTTCT 


TACATAGATA 


AAGGCCAAGT 


CCAGAGGACT 


GCTGGGTCAA 


ACGGCCATTG 


3780 


TATCCTCAAA 


AGCCACGTTC 


AAATACTCGG 


AGGACATCAC 


TGTTTTTTAT 


CCCCATTCCC 


3840 


CTATCTTTGA 


TACAAAGCTC 


TTGGTCATCC 


ATATAAATCT 


CCAGACCACC 


TTCCTTGGTG 


3900 


TACTTCAGAC 


TGTTTGAGAT 


GATTTGCTCA 


ATAACCACTA 


GCAGCCACTT 


TTTATCCGTC 


3960 


ACGATTTCTT 


TATCAACGTC 


ATGTAGATTG 


ACATTTAAGC 


CTTTTTGAAT 


AAAGAAAAGA 


4020 


GCATATTTAC 


GAATTATTTC 


CTTGACCAAG 


TCCTCAATTT 


GAACCTGCTT 


TAAGACCAAA 


4080 


TCATCATGGA 


AACTTTCTAA 


ACGCAGGTAC 


TGTAAAACTA 


GGTTGGTATA 


CCACTCGATT 


4140 


TTGAAAATTT 


CCTGTTCTAG 


CTGCTGCTTC 


AGTTGGCGGT 


CGACCACTTC 


TGCAACTAAG 


4200 


AGTTGACTGG 


CTGCAATGGG 


GGTCTTTATC 


TGATGGACCC 


ACAAGCTATA 


GTAATCCAGC 


4260 


AAATCCGTCA 


GTTTTCTTTC 


TGCTTTTGAC 


CTCTGCTGAT 


ACACTTCCAT 


CTCACGCGCT 


4320 


TCTAATTTTT 


CTGCTAAAGC 


TATTTCCAAA 


GGAGACTTGG 


CTTCCCTCTC 


TCCATACAGA 


4380 


AGTTCCTGGC 


GATAGACCTG 


CCTTTCCACC 


AATATGTCCC 


AAGTGAAAAA 


TAATATGGTT 


4440 


ACAAAGCAAC 


ACAAGAAGAA 


AAAGTACAGG 


AAGTAAATTC 


CTAGACTGGC 


AAATAAAAAC 


4500 


TGAAAGAGTA 


AGACAAGAAA 


TGCCAAAGAA 


AGCAGATAGA 


TAAAAAGACG 


ACTACGGGAG 


4S60 


CGCAGATAGG 


CTAGAAAAAA 


TTGTTTCCAA 


TCAAGCATGC 


TTCAATCCGT 


ACCCTATTCC 


4620 


TTTCTTGGTC 


TCGATAAATC 


CTACCAATCC 


CTGCTCCTCC 


AACTTTTTAC 


GCAAACGAGC 


4680 


CACATTGACA 


CAGAGGGTAT 


TATCATCAAT 


GAAAAAGTCA 


CTGTTCCAAA 


GTTCCCGCAT 


4740 


CAGGTCGTCA 


CCTGCTACGA 


TGTTGCCTGC 


ATGCTCAAAT 


AACACGCGTA 


AAATCTGGAA 


4800 


TTCATTCTTG 


GTCAAATTCA 


AGACTTGCCC 


TTGATAATGT 


AAATCCATGG 


ATTTGGTATT 


4860 


GAGGATAACA 


CCAGCATATT 


CCAGCAAACT 


CTCATCACGC 


CCAAACTCAT 


AGGAACGACG 


4920 


CAACAAGCCC 


TGAACCTTAG 


CTAAAAGAAC 


CTGCTGGTCA 


AAAGGCTTGG 


TCACAAAGTC 


4980 


ATCCGCCCCC 


ATATTGATTG 


CCATGACAAT 


ATCCATAGCC 


TGGTCTCTCG 


AAGAAAGAAA 


5040 


CATGATAGGT 


ACCTTGGAAA 


TCTTGCGGAT 


TTCCTGACAC 


CAGTGATAAC 


CATTAAACAA 


5100 


GGGCAAACCA 


ATATCCATGA 


GGACCAGATG 


AGGTTCCGAC 


TGAACAAATA 


GACTCAAAAC 


5160 



0 



WO 98/18931 



PCT/US97/19588 



161 



litCATAAnu 




WWnWW^%W * » ^» 


AAATCCCCAT 


TCAGAGAGCA 


TTTTCCCAAT 


5220 


CTGTTGACGA 


A IuALIk 1 wA 4 


CATCTTCTAT 


TAATAAAATC 


TTGTGCATwC 


GCTTCTCCTT 


5280 


TTCCATTATT 


ATAACAGATT 


A * A W W** A W^ " 


AGATGGTCTG 


AAACTGAATT 


TGAAATAGCC 


5340 


TGTTTTTAGC 


CAGTACAAAC 


auww A •» * ww * 


ACTAGCTAAT 


TTGAGGGAAA 


TTTGCTAAGA 


5400 


TAAATAAAAA 


GAAAGGAGCT 


W & 4 A A WWW W** 


ATATTTTTGA 


CTATCTGAAA 


GATGTCGCAT 


S460 


ATGATTCTTA 


TTACGACCTT 


WWW 1 1 VaAA A V* 


ACTTAGACAT 


TCTAACCTTA 


ATAGAAATCA 


SS20 


CCTACCTCTC 


CTTTGATAAT 


W 1 ww 1 W A W WA 


WVW * * WW * 




GATCTAGCAC 


5580 


CTCAGGTTCC 


AAGAGATCCC 


Aww A lul> 1 in 


W A AwW#W*"f* 


TCGCCTTCAA 


TTATTAGATG 


5640 


AATTCGCTCA 


ACACAAGCGC 


i 1 tnnnnn * * 


wW^W\W A W * W 


CCATTTTATC 


AACGACATCG 


5700 


ACCCTGAACT CCAAAAGCAA 


IT! wwwwW 1 A 


A 1 A A ww 


TCTCAGCCTC 

A W A W*^wW w * w 


GATACCTATC 


5760 


^ T*T*^I^^ ^T**^ 
1 OA 4 iuiLl 4 


TfGTGGGACA 


GAT L» A w Ali I A 


1 WA 1 i WwW A w 


HAACGAAGAT 


TTCCACCTGA 


5820 


CCTATATGAA 


GGAAATTCCT 




nLuLUL i A wv» 


CTAfTTAAAG 




S880 


CCCATCATCC 


TAAGCAAAAG 


w A* Al 1W1 AW 


w 4 wwwWA 1 1 v 


CAAGGCAGGA 


AATCTCGCTA 


S940 


TCTATGCTGC 


TAGCCAAATT 


GAGCAAAwTT 


«T»f-/~ A A A A TP" A. 


CL ATC AC AGC A 


CTTTATACAT 


6000 


TTGATGCACC 


TGGTCTCCAT 


LAAuAAl 1V*A 


WAw Aw AW A wW 


CCG TTATCAA 


AGG AT AATGG 


6060 


ATAGAAGCAA 


GATATTCATT 




WWA 1 i A i Www 


TATGATCCTG 


GAAATTCCTG 


6120 


CTCACCAAAT 


CATCGTTCAG 




A www A UVfw> A 


CCCCCAGCAC 


GATACCTTTA 


6180 


GTTGGCACAT 


TGAGCACAAG 


^.nL 1 A Ww I ww 


AACTGCATAA 

rww A ww*> * 


GACCAACAGT 


GATAGCCAGC 


6240 


AAGTAGACAC 


AACCTTTAAA 




WWnWrtV) • WWW 


TGACGAAGAA 


CTTCAGCTCT 


6300 


ACTTCCACCT 


CTTCTTTGGC 


Aw 1A1 IWi Aw 


ATGCTGGtAT 


TAGCTCTATC 


AATGACTTGG 


6360 


CTTCCTTAAA 


GGCGCTTGAA 


TACATTCATC 


AT CTCTT TGT 


CCAAGCTCAA 


TCCCTCACTC 


6420 


CAGAAGAAAG 


AGAAACCTTG 


GGTCGCCTTA 


CCCAGTTATT 


GATTGATACT 


CGTTACCAGG 


£ 4 an 

040U 


CATGGAAAAA 


TAGATAATAC 


TCTTGAAAAT 


TAAATGTATA 


CAAAACAAAA 


GACCTAGAAT 


6540 


ACATACTTTC 


ATGTGCATTC 


TAAGTCTTTT 


TAAATAGAAT 


CTAATAGTCA 


ATAAAAATCA 


6600 


AAGAGCATTG 


AGAGATAATG 


GGGCTTGGAA 


CGTCCCTCTC 


GCTTCAACAA 


AATGACCCCA 


6660 


TTATAGATTA 


AAAAGATGCC 


ACTTAGAAAA 


AGCAAAAAAG 


GAAGTAACAC 


AAAGGCAAAT 


6720 


ATATAAAAAG 


CTAACTGAAC 


ATTCTCGTAT 


CCATTTTTAT 


AAAAAAGGTA 


CCATAGATAA 


6780 


AAATAACTTG 


AAATGAGGGA 


TAATAAAAAT 


AATACTGGAT 


TC C ACAAACT 


TCTATTATCC 


6840 


TTCCAAAATG 


ACACTATAAA 


GGCTAATACA 


ATTCCTATAA 


CGAGATACAT 


TTCTTACTCC 


6900 
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TTTAATACCT AC ATTTT AT C ATAATTATCC AAAGAAAAAA GAGCCCATTT ATCCCTCTTA 6960 

ATCCTTCATC TGACTCTCTG CATCGGCCAC GACTTTTTCT AGACTGGTTT GACCAAGTTC 7020 

TGCCTCCATA GTCAACTGAA TTCTCTCCAA TTTTTGATCC AAAACATCAT GAATATGAGC 7080 

TCCTACAGGG CAATTTGGAT TCGGATTGTC ATGGAAACTG AAGAGTTGAC CTGTCTTACC 7140 

AAGACATTCG ACCGCCTGAT AAACATCTAA AAGACTAATA TCCTTAAGGT CCTTGACAAT 7200 

CTCTGTTCCG CCCGTTCCAC GCGCTACTGA AATCACCTCT GCCTTCTTCA ACTGGGACAA 7260 

GATCTTTCTG ATAATCACAG GATTGACCCC GACACTAGCA GCCAGAAAAT CACTGGTCAC 7320 

CT T G CTTTCC TTCCCCTCGA GGGCAATGAT TATCAGCATA TGAGTCGCAA TGGTAAATCT 7380 

ACTTGGAATT TGCATCCTCT TCTCCTTTTT ACGAGGCTAC CCTCCCTCTA CTCTTCTTTT 7440 

TCTATTATTA TACCCTTTTT AGTTGTAATG TCAATCGTTA CCACTTTTCA ACCAGTCCTC 7500 

TAACTCCCGA TCGCAGCCCT CTTTCTGAGC CAATTCTCTC AAAAATTCCT GATGATGAGT 7560 

ATGGTGGATC CCATTGAC CA CACTTTCATA GTAAACCTCA AAATAGGGAA GTCTCACCTC 7620 

TTTAGCCAGC TGCAATTCAG CTCCTACATC CTACTCTACC CCTCGGAAGT CCATATCTAC 7680 

CAGGCCTTTG TCATCAAACT CCAAAATCAT ATACTCGGCC CGCAAGTCCT TCCGTAGCTC 7740 

AGCGTCCAAA AAGAAAGGTT GGCCAATCGA ACCCCCATTG ACAATCAATT GCCCACCACT 7800 

CCCGTAACGA AGCAACTGCT GGTCAATATG TCCATAAACA CCAATATCAC AGGGAGGATG 78 60 

AGTCACCAAG CGGTCAAACT CCTCTTGTTT GCCAGTATGA ATC AACTCTC GCCCCCAGTT 7920 

CTTATCAGGC AGATGATCGC TAATTCCCAC CCTCAAATCC CCAAACTGAC GATCAATTTG 7980 

AAGAGGTTGA TTGTGCAGCA CTTCAATTTC TTCTAGGCAA ATTTCCTCTA AAACATACTG 8040 

GCACTGCCGC AAGAGATAGC GTTGACTGGG GCGAGTACTG TCCAATTCCT TACGGACACC 8100 

ATGCCAAAGA CTGTCTTCCC ACTTTCCCAA AACTCTAGCC CTAATCGGTA GTTGATCCAA 8160 

CAAGTCCAAA ATCCTTCTAC GCCCTGTCCC TGGCATGAGA ATATCTCCCA AAAGCCAGTA 6220 

TTCATCCACT CCTATCTGCC GAGCATCTGC CAAAACAGCC TCCAAGGCGC TOGTATTTCC 8280 

ATGAATATCT CAAAGAAGAG CTATTTTCGT CATATCCATC TCCTCCTTTT TTCTCTTGCA 3340 

ATAAGTATAA CATAAAAAGT CACAGCTAGA GAAATCTAGC TTTTTT TG AT ATACTAGATA 8400 

AAGATATTAG ACAAGAGGAA ACGAATGACC CCAAACAAAG AAGACTATCT AAAATGTATT 8460 

TATGAAATTG GCATAGACCT GCATAAGATT ACCAACAAGG AAATTGCGGC TCGCATGCAA 8520 

GTCTCTCCCC CTGCCGTAAC TGAAATGATC AAACGAATGA AAAGTGAAAA TCTCATCCTA 8580 

AAGGACAAGG AATGTGGCTA TCTACTGACT GACCTCCGTC TCAAACTGGT CTCTGAGCTC 8640 

TATCGTAAGC ACCGCTTCAT TGAAGTTTTT CTAGTTCATC ATTTAGACTA TACAAGTGAC 8700 
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4 



CACATTCACG 


AGGAAGCTGA 




PAPACTGTCT 


CTGACCTGTT 


CGTGGAAAGA 


8760 


CTAGATAAAC 


TGCTAGGTTT 






GGGGAACTAT 


TCCTGCCAAG 


8820 


GGAGAACTAC 


TCGTTGAAAT 


CAATAACCTv 




ATATCAAGGA 


AGCTGGCGCC 


6880 


TACCGCCTCA 


CTCGGGTGCA 


CGATAGTTTT 




ATTATCTCGA 


CAAGCACTCA 


8940 


CTTCACATCG 


GTGACCAGCT 


CCAAGTCAAG 




GCTTCACCAA 


TACCTTCACT 


9000 


ATCCTCACTA 


ACGACGAGGA 


TTTACAAGTG 


AATAlwAU* 


TTfiP AAAAf* A 


ACTCTATGTC 


9060 


GAGAAAATCA 


ACTAATTTCP 


CAAGTCCCCT 


Al.^_4w%t.^*» * v* 


AfiTTTT AT 


TTTGGCTCTT* 


9120 


TGTCAACTGT 


AGTGGGTTGA 


AGTCACCTAA 








9180 


TTTTTGATAT 


TCAGAGCGAT 


AAAAATCCGT 


TTTTTGAAGT 


TTTCAAAGTT 




9240 


AAGGCATTGC 


CCTTGATAAG 


TTTGATGAGA 


TTATTGGTvA* 


CTTCCAGTTT 




9300 


TAGTCTACTT 


GAAGGGCGTT 


GACAATCTTT 


TCTTTATCTT 




TTTAAAGACA 


9360 


GTCTGAAAAA 


TAGGATGAAC 


CTGCTTTAGA 


TTGTCCTCAA 






9420 


CGTTTCTTAT 


TCTCAAACTG 


AAACAGCAAG 


AGTTGATAGA 






9480 


TCTTGTGAAT 


AGCTCAAAAG 


CTTGTCTAAA 


ATCTCTTTAT 






9S40 


GTAGGACGAT 


AAAATCGCTT 


ATCACTCACT 


TTACGG CT AT 


CCTGTTGTAT 




9600 


TAGCGCTTGA 


TAGCCTTGTA 


TTCATGGGAT 


TTTCGATCCA 


ATTCGTTUA I 


AATTTGAAPA 


9660 


CGCACACGAC 


TCATAGCACG 


GCTAAGATGT 


TCTACAATGT 


GAAAGCGATC 


CAACACGATT 


9720 


TTAGCATTCG 


CCACTGAAAC 


AGTCTGGGAG 


ACTGTTTCAG 


CCTGAGCCTA 


CAAATTTGAA 


9780 


AGCGAACCTG 


TTTAGCCAAG 


TCATAGTAAG 


GACTAAACAT 


ATCCATCGTA 


ATGATTTTCA 


9840 


CTTGACAACG 


AACGGCTCTA 


TCGTAGCGAA 


GAAAGTGATT 


TCGGATGACA 


GCTTGTGTTC 


9900 


TGCCTTCAAG 


AACAGTGATA 


ATATTAAGAT 


TATCAAAATC 


TTGCGCAATG 


AAACTCATCT 


9960 


TTCCCTTAGT 


GAAGGCATAC 


TCATCCCAAG 


ACATAATCTT 


TGGAAGCCCA 


GAAAAATCAT 


10020 


GCTCAAAGTG 


AAAGTCATTG 


AGCTTGCGAA 


TGACAGTTGA 


AGTTGAAATG 


GCCAGCTGAT 


10080 


GGGCAATATC 


ACTCATACAA 


ATTTTTTCAA 


TTAACTTTTG AGCAATyTTT TGGTTGATGA 


10140 


TACGAGGGAT 


TTGGTGATTT 


TTCTTTACCA 


GGGGAGTCTC 


AGCAACCATC 


ATTTTTGAAC 


10200 


ACTGATAGCA 


CTTGAAACGA 


CGCTTTCTAA 


GGAGAATTCT 


AGAAGGCATA 


CCAGTCGTTT 


10260 


CAAGATAAGG 


AATTTTAGAA 


GGTTTTTGAA 


ACTCATATTT 


CTTCAATTGC 


TTTCCGCACT 


10320 


CAGGGCAAGA 


TGGCGCCTCG 


TAGTCCAGTT 


TGGCGATGAT 


TTCCTTGTGT 


CTATCCTTAT 


10380 


TGATGATGTC 


TAAAATCTGG 


ATATTAGGGT 


CTTTAATGTC 


TAGTAATTTT 


GTGATAAAAT 


10440 
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GTAATTGTTC 


CATATGATTC 


TTTCTAATGA 


GTTGTTTTGT 


CGCTTTTCAT 


TATAGGTCAT 


10500 


ATGGGACTTT 


TTTTCTACAA 


TAAAATAGGC 


TCCATAATAT 


CTATAGTGGA 


TTTACCCACT 


10560 


ACAAATATTA 


TAGAACCGTA 


AAAATAGAAG 


GAGATAGCAG 


GTTTTCAAGC 


CTGCTATCTT 


10620 


TTTTTGATGA 


CATTCACCCT 


GATACGAAAT 


CATAAGAGGT 


CTGAAACTAC 


TTTCAGAGTA 


10680 


GTCTGTTCTA 


TAAAATATAG 


TAGATTGAAA 


TAAGATGTGA 


ACAACTCTAT 


CAGGAAAGTC 


10740 


AAATTAATTT 


ATAGAATTAT 


TTTACCAGTC 


AAGGTGTACT 


CTTATAGATT 


CAATATATTA 


10800 


TATGACTATT 


AACCTTCTCT 


TCTCCTAAAA TTGACTTTCT 


TGTTTTCTTA 


TCTTGTCCAC 


10860 


TCGAAACAAG 


TATTCTAAGA 


ATTTGATTAT 


TTTTGAAAGT 


ACTTTTAATA 


TACTTGATAT 


10920 


AGTTAAAAAA 


GATTTGAAAC 


TAAATTCCAA 


ATTAGAAAAA 


GACTTGAAAT 


ACTAAAAAAA 


10980 


AAAAAGTATA 


CTCTAATTGA 


AAACGGTAAC 


AAAACTAATT 


TAGACAATGA 


AATATAGAGT 


11040 


ATTTCTCTCT 


TAAAAGTTTT 


TGGTGAAACG 


AGATGTAGAA 


AGGAGATTTA 


GCCAAAGAGT 


11100 


CTATTAGTGC 


TAGAATAATA 


GATTAGAATT 


ATTTTAGAAA 


AACGAAGTGA 


CCAGCTTATA 


11160 


AATTCAAGTC 


CCCAAATAGA 


TTCATACTAG 


TATCTTTTGC 


AAAAAATAAA 


GGGCGACTTC 


11220 


CTTCATGAAT 


ATCAATTTCA 


TCTATAAGGA 


AGGTAGCTAA 


TTGAACTAAC 


TTATTTATTC 


11280 


TCTTTCTCGC 


TAGAAAAATC 


AGACCTCCTT 


GTGAAGATTG 


AGGAGATACT 


TAATGAAAAT 


11340 


CAAAGAAGAA 


ACTAGCAAGC 


TAGTACCAGA 


TTGCCCAAAA 


CACCGCTTTG 


AGCTTGTAGA 


114O0 


TAAGACTGAC 


CTATATAATC 


CAAGGTGAAG 


CGACTGTGGT 


TTGAAGAGAT 


TTTCAAAGAG 


11460 


TATAGGCTAG 


AGAGTAGTGT 


TTTTATGTCC 


TTCTAGTAGA 


AAATGCTAGA 


CAGAAGAATG 


11520 


GGCAACTTGG 


ATAGGAAAAA 


TACATTGAGA 


AAGGAGGTTA 


GAAGAGATGA 


TTATTACAAA 


11580 


AATTAGCCGT 


TTAGGAACTT 


ATGTGGGAGT 


AAATCCACAT 


TTTCCAACAT 


TAATACATTT 


11640 


TCTAGAAAAA 


ACAGGACTAC 


AAAATTTAAC 


AGAAGGTTCG 


ATTGCTATCG 


ATGGTAATCG 


11700 


ATTCTTTGGG 


AATTGCTTTA 


CTTATCTAGC 


AGATGGTCAA 


GCAGGGGCTT 


TCTTTGAAAC 


11760 


CCACCAAAAA 


TATTTGGATA 


TTCATTTAGT 


TTTCGAAAAC 


GAACAAGCCA 


TGGCTGTTAC 


11820 


ATCGCCGGAA 


AATGTAAGCG 


TTACCCAAGA 


ATATGATGAA 


GAGAAAGATA 


TTGAATTATA 


11880 


CACAGGGAAA 


CTGGAACAGT 


TGGTTCATTT 


GAGAGCTGGC 


GAATGCCTCA 


TCACTTTTCC 


11940 


AGAAGATTTA 


CATCAACCCA 


AGGTTCCTAT 


AAATGATGAA 


CCTCTCAAAA 


AAGTTGTCTT 


12000 


TAAAGTTGCG 


ATTTCTTAAT 


GTAGAAAGAG 


AAGAACGATG 


AAAAAAATGA 


GAAAGTTTTT 


12060 


ATCTCTAGCT 


GGAATTGCGC 


TAGCGGCTGT 


TGCCTTGGTA 


GCTTGTTCAG 


GAAAAAAAGA 


12120 


AGCTACAACT 


AGTACTGAAC 


CACCAACAGA 


ATTATCTGGT 


GACATTACAA 


TGTGGCACTC 


12180 


CTTTACTCAA 


GGACCCCGTT 


TAGAAAGTAT 


TCAAAAATCA 


GCAGATGCTT 


TCATGCAAAA 


12240 
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GCATCCAAAA 


& /"V* A & A A *Pf" A 


iLf ATTnAAAC 
AlaAl 1 \J/W\V» 


ATTTTCTTGG 


AATGACTTCT 


ATACTAAATG 


12300 


GACTACAGCT 


TvAGCAAATG 




AnATATCACT 
nwn 1 /\ X wnw * 


ACAGCTCTTC 


CTAACCAAGT 


12360 


AATGGAAATG 


GTCAACTCAG 


ATGCTTTOGT 


*pf*f*GPTAAAT 

1 1 AAA ft 


GATTCTATCA 


AGCGTATTGG 


12420 


ACAAGATAAA 


IT iAAVAiAAA 


L» 1 wLL I 1 AAA 


TCAACGAAAA 

J VJAnV*Wnnnn 


ATCGGAGATG 


ATTACTACTC 


12480 


TGTTCCTCTT 


tm m wxw^ ft ^ ft 

TATTCACATG 


ft ft ft / »»p^* A^ 
UntnAu 1 wA 1 


Vj A u\*u i ft nun 


ACAGATTTGT 


TAAAAGAACA 


12S40 


TAATATTCAG 


tfl iCCTAAAA 


LTIubuA 1 ^A 


AfTfTATGAA 
Aw i w l ni vmv* 


GCTTCTAAAA 

WW 4 A W ft *V M W I 


AATTGAAAGA 


12600 


AGCTGGAGTT 


TATGGCTTGT 


CTGTTCCGTT 






CAAgACGTTT 


12660 


CTTGAACTTC 


TACGTACCTA 


-allM ■ rf^Ml^ ^ ft 

rTGGTGGAGG 


AAViCL Ivl In 


AfAAAAGATC 

AW AAAAVAA 4 W 


ttaaaccajca 


12720 


CTTGACAAGC 


CAACTTGCTC 


AAGATCGTAT 


TAAA1 nw 1 Mt 


Ui 1 AAA 1 1 \J I 


ATAAACAJLAT 
n 1> /\nnunnn & 


12780 


CTCACCTCAA 


GATTCTTTGA 


AC TTTAATGT 


B*!**!/^ K ft #» ft ft 

CCTTCAAUAA 


Vit 1 Aw W 1 1 Vj 1 


TfTATTAACC 
i V>_ 1 A IUwWM 


12840 


AAAAACAGCA 


TTTGACTTTA 


ACTCTGGCTT 


h «V ft I*/*/?/* ft 

CCATATCGGA 


A A A A'Prt 


LLAALAu 




TCAATTGATT 


GATTCGATTG 


ATGCTTATCC 


fn ■> muma ft ft ft ft 

TATTCCAAAA 


ft*TV*A A Af*A*"^ 
ATUUvttiAu 1 


L> 1 A_iA 1 AAAUA 




CCAAGGAATT 


GAAACCTCAA 


ACATTCCAAT 




A A A A a*prriA 

AaAAA 1 1 wAA 


A A f ATt^ff*Afi A 
AALA * V-L A«A 


13020 


AGTTGCTAAA 


GCATTCTTAG 


AAGCACTTTA 


n* ft ft T»f ft ft r* ft ft 
TAATCAAbAA 


r* a r**p a fv**p*r a 

UAL 1 Awvj Lin 


AA 1 Iw^l I VjM 


13080 


TTCAACTCCA 


GTAGGTATGT 


TGCCAACTAT 


TAAGGGGATT 




LAuLiW 1 A 1 AA 


13140 


AGAAAATGAA 


ACTCGTAAGA 


AATTTAAACA 


tn^m^mtm^ m. ft Aft ft 

TGCTGAAGAA 


ft A*r*TA<*^^R 
V* I AA 1 1 A(. I \B 


AAGCTGTTAA 


13200 


AAAAGGTACT 


GCTATTCCTT 


ft #1%^ ft ft ft ft rttfO^ 

ATGAAAATGG 




w AAvjw i \#w l A 


• Vj ft ft \jnw ft nn 


13260 


CCAACACATT 


ATTGAACAAA 


«m^www^^ ft ft ^* ft 

TGTTCCAAGA 


TA » 11 AwA 


A ATftfl AAC AG 
AA 1 Aw 


ATCCTATGAA 
n ft ww * n * wnn 


13320 


AGCAGCAAAA 


GAAGCAGAAA 


AACAATTAAA 


TGATTT ATTT 




AGTAGATGTA 


13380 


AAAGACTAGA 


AAATAGGTGG 




*mAAAAf"2f*"Tf* 

IWWWttfk i W 


TACCCCAATC 


TTGTAAAAGA 


13440 


AGGGAGAAGG 


ft ^ ft ft #V*^*^**fMf*ft 

AGAATGGTTA 


& ft/" ft ft i^^T & & 
AAuAAtuTAA 


ill aa*» i wvjw 


TGGATATTTG 


TTTTGCCAGC 


13500 


TATGATTATC 


GTAGGATTAC 


TCTTTGTTTA 


TCCCTTTTTC 


TCGAGTATTT 


TTTATAGCTT 


13S60 


TACCAATAAG 


CATTTGATTA 


TGCCTAATTA 


TAAATTTGTT 


GGTTTGGCTA 


ACTATAAAGC 


13620 


TGTGCTATCA 


GATC C CAACT 


TCTTTAATGC 


GTTCTTTAAT 


TCAATTAACT 


CGACCGTTTT 


13680 


CTCATTAGTT 


GGTCAAGTTT 


TAGTAGGGTT 


TCTATTGGCT 


TTAGCTCTTC 


ACAGAGTACG 


13740 


CCACTTCAAG 


AAATTATATA 


GG AC ATT ATT 


GATTGTTCCT 


TGGGCATTTC 


CTACCATCGT 


13800 


TATTGCCTTC 


TCTTGGCAGT 


GGATTCTAAA 


CGGGGTTT AT 


GGCTACTTAC 


CTAATCTAAT 


13860 


CGTAAAATTA 


GGTTTAATGG 


AACATACACC 


TGCATTTTTG 


ACAGATAGTA 


CATGGGCATT 


13920 


CCTATGTTTG 


GTGTTTATCA 


ACATTTGGTT 


TGGAGCACCA 


ATGATTATGG 


TTAATGTGCT 


13980 
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TTCACCTTTG 


CAAACAGTAC 


CAGAAGAACA 


ATTTGAGGCT 


GCTAAGATAG 


ATGGTCCTTC 


14040 


AAGTTCGCAG 


GTGTTCAAGT 


TTATCCTCTT 


TCCACATATT 


AAAGTGGTTG 


TAGGACTTCT 


14100 


AGTTGTTTTG 


AGAACTCTAT 


GGATCTTTAA 


TAACTTTGAC 


ATTATCTACC 


TCATTACTGG 


14160 


TGGTCCACCA 


GCCAATGCTA 


CAACGACGCT 


TCCAATTTTT 


GCTTACAACC 


TGGGCTGGGG 


14220 


AACTAAATTG 


TTGGGTCCTG 


CTTCAGCAGT 


TACAGTACTG 


CTCTTTATCT 


TCTTGGTGGC 


14280 


CATTTGCTTT 


ATCTACTTTG 


CTATCATCAG 


TAAGTGGGAA 


AAGGAGGGTA 


GAAAATAATG 


14340 


AAGAAGAAAT 


CCACTATTTA 


TTTAGATATT 


CTCTCACATG 


TACTTTTAGT 


TGGTGCGACC 


14400 


ATCCTTGCAG 


TTTTCCCATT 


GGTATGGATT 


ATCATATCTT 


CTGTCAAAGG 


GAAAGGCGAA 


14460 


TTAACTCAGT 


ATCCAACACG 


ATTTTGGCCT 


GAACACTTTA 


CATTAGATTA 


TTTCACTCAT 


14520 


GTTATCAACG 


ATTTGCACTT 


CATTGATAAC 


ATTCGAAACA 


GTTTAATCAT 


TGCCTTGGCT 


14580 


ACAACCCTTA 


TTGCGATTAT 


TATTTCTGCT 


ATGGCAGCCT 


ATGGTATTGT 


TCCATTCTTT 


14640 


CCTAAATTGG 


GAGCAATCAT 


CTCGAGACTA 


CTCGTCATTA 


CCTACATTTT 


CCCACCAATT 


14700 


TTCTTAGCAA 


TTCCCTATTC 


AATTGCCATT 


GCTAAAGTTG 


GGTTAACAAA 


TAGTTTATTT 


14760 


GCCTTGATGA 


TGGTTTATCT 


ATCTTTTAGT 


GTTCCATATG 


CAGTTTGGCT 


CTTAGTTGGA 


14820 


TTTTTCCAAA 


CAGTTCCAAT 


TGGAATTGAA 


GAACCGGCTA 


GAATTGATGC 


TCCAAATAAA 


14880 


TTTGTTACGT 


TTTATAAAGT 


TGTGCTACCG 


ATTCTAGCAC 


CAGGTATTGT 


AGCAACAGCT 


14940 


ATTTATACAT 


TTATCAATGC 


TTGGAATGAA 


TTCCTGTATG 


CCTTGATTTT 


GATTAACAAT 


15000 


ACAGGAAACA 


TGACAGTAGC 


AGTAGCCCTT 


CGTTCACTTA 


ATGGTTCAGA 


AATACTACAC 


15060 


TGGGGAGATA 


TGATGGCAGC 


GTCTGTTATT 


GTAGTTCTTC 


CATCAATTAT 


TTTCTTCTCT 


15120 


ATCATCCAAA 


ATAAGATTCC 


AAGTGGATTA 


TCAGAAGGAT 


CTGTGAAGTA 


GACGAAAGAA 


1S180 


GGAAAAAAAT 


GAATAAAAGA 


GCTCTTTATT 


CAAAACTAGG 


AATTTCCGTT 


GTAGGCATTA 


15240 


GTCTTTTAAT 


GGGAGTCCCC 


ACTTTGATTC 


ATGCGAATGA 


ATTAAACTAT 


GGTCAACTGT 


15300 


CCATATCTCC 


TATTTTTCAA 


GGAGGTTCAT 


ATCAACTGAA 


CAATAAGAGT 


ATAGATATCA 


15360 


GCTCTTTGTT 


ATTAGATAAA 


TTGTCTGGAC 


AGAGTCAGAC 


AGTAGTAATG 


AAATTTAAAG 


15420 


CAGATAAACC 


AAACTCTCTT 


CAAGCTTTGT 


TTGGCCTATC 


TAATAGTAAA 


GCAGGCTTTA 


15480 


AAAATAATTA 


CTTTTCAATT 


TTCATGAGAG 


ATTCTGGTGA 


GATAGGTGTA 


GAAATAAGAG 


1S540 


ACGCCCAAAA 


GGGAATAAAT 


TATTTATTTT 


CCAGACCAGC 


TTCATTATGG 


GGAAAACATA 


15600 


AAGGACAGGC 


AGTTGAAAAT 


ACACT AGT AT 


TTGTATCTGA 


TTCTAAAGAT 


AAAACATACA 


15660 


CAATGTATGT 


TAATGGAATA 


GAAGTGTTCT 


CTGAAACAGT 


TGATACATTT 


TTGCCAATTT 


15720 


CAAATATAAA 


TGGTATAGAT 


AAGGCAACAC 


TAGGAGCTGT 


TAATCGTGAA 


GGTAAGGAAC 


15780 
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ATTACCTCGC 


AAAAGGAAGT 


ATTGATGAAA 


TCAGTCTATT 


TAACAAAGCA 


ATTAGTGATC 




AGGAAGTTTC 


AACTATTCCC 


TTCTCAAATC 


CATTTCAGTT 


AATTTTCCAA 


TCAGGAGATT 




CTACTCAAGC 


TAACTATTTT 


AGAATACCGA 


CACTATATAC 


ATTAAGTAGT 


GGAAGAGTTC 


tea £A 
X 3 y DU 


TATCAAGTAT 


TGATGCACGT 


TATGGTGGGA 


CTCATGATTC 


TAAAAGTAAG 


ATTAATATTG 




CCACTTCTTA 


TAGTGATGAT 


AATGGGAAAA 


CGTGGACTGA 


GCCAATTTTT 


GCTATGAAGT 




TTAATGACTA 


TGAGGAGCAG 


TTAGTTTACT 


GGCCACGAGA 


TAATAAATTA 


AACAATAGTC 


16140 


AAATTAGTGG 


AAGTGCTTCA 


TTCATAGATT 


CATCCATTGT 


TGAAGATAAA 


AAATCTGGGA 


16200 


AAACGATATT 


ACTAGCTCAT 


GTTATGCCTG 


CGGGTATTGG 


AAATAATAAT 


GCAAATAAAG 


16260 


CCGACTCAGG 


TTTTAAAGAA 


ATAAATGGTC 


ATT ATT A i I T 


AAAACTAAAG 


AAGAATGGAG 


16320 


ATAACGATTT 


CCCTTATACA 


GTTAGAGAAA 


ATGGTGTCGT 


TTATAATGAA 


ACAACTAATA 


16380 


AACCTACAAA 


TTATACTATA 


AATGATAAGT 


AT G AAGTTTT 


GGAGGGAGGA 


AAGTCTTTAA 


16440 


CAGTCGAACA 


ATATTCGGTT 


GATTTTGATA 


GTGGCTCTTT 


AAGAGAAAGG 


CATAATGGAA 


16S00 


AACAGGTTCC 


TATGAATGTT 


TTCTACAAAG 


ATTCGTTATT 


TAAAGTGACT 


CCTACTAATT 


16560 


ATATAGCAAT 


GACAACTAGT 


CAGAATAGAG 


GAGAGAGTTG 


CGAACAATTT 


AAGTTGTTGC 


16620 


CTCCGTTCTT 


AGGAGAAAAA 


CATAATGGAA 


CTTACTTATG 


TCCCGGACAA 


GGTTTAGCAT 


16680 


TAAAATCAAC 


TAACAGATTG 


ATTTTTGCAA 


CATATACTAG 


TGGAGAACTA 


ACCTATCTCA 


16740 


TTTCTGATGA 


TAGTGGTCAA 


ACATGGAAGA 


AATCCTCACC 


TTCAATTCCG 


TTTAAAAATG 


16800 


CAACAGCAGA 


AGCACAAATG 


CTTGAACTGA 


GAGATGCTGT 


GATTAGAACA 


TTCTTTAGAA 


16860 


CCACTACAGG 


TAAGATAGCT 


TATATGACTA 


GTAGAGATTC 


TGGAGAAACA 


TGGTCGAAAG 


16920 


TTTCGTATAT 


TGATGGAATC 


CAACAAACTT 


CATATGGCAC 


ACAAGTATCT 


GCAATTAAAT 


16980 


ACTCTCAATT 


AATTGATGGA 


AAAGAAGCAG 


TCATTTTGAG 


TACACCAAAT 


TCTAGAAGTG 


17040 


GCCGCAAGGG 


AGCCCAATTA 


GTTGTCGGTT 


TAGTCAATAA 


AGAAGATGAT 


ACTATTGATT 


17100 


GGAAATACCA 


CTATGATATT 


GATTTGCCTT 


CGTATGGTTA 


TGCCTATTCT 


GCGATTACAG 


17160 


AATTGCCAAA 


TCATCACATA 


GGTGTACTCT 


TTGAAAAATA 


TGATTCGTGG 


TCGAGAAATG 


17220 


AATTGCATTT 


AAGCAATGTA 


GTTCAGTATA 


T AG ATTTGG A 


AATTAATCAT 


TTAACAAAAT 


17280 


AAAGGAGAAA 


AACATGGTTA 


AATACGGTGT 


TGTTGGAACA 


GGGTATTTTG 


GAGCTGAATT 


17340 


GGCTCGCTAC 


ATGCAAAAGA 


ATGATGGAGC 


AGAG ATT ACT 


CTTCTCTATG 


ATCCAGATAA 


17400 


TGCAGAGGCG 


ATTGCAGAAG 


AATTGGGACC 


AAAAGTAGCA 


AGTTCCTTAG 


ATGACTTGGT 


17460 


TTCTAGCGAT 


GAAGTAGATT 


GTGTTATCGT 


CGCAACTCCA 


AATAATCTTC 


ATAAGGAACC 


17520 
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GCTTATTAAG 


GCTGCACAGC 


ATGGTAAAAA 


TGTTTTCTGT 


GAAAAACCAA 


TTGCCCTTTC 


17560 


TTATCAAGAT 


TGTCCCGAGA 


TGGTAGATGC 


GTGTAAAGAA 


AACAATGTAA 


CCTTTATGGC 


17640 


AGGACATATT 


ATGAATTTCT 


TTAATGGTGT 


TCATCATGCA 


AAAGAACTCA 


TTAATCAAGG 


17700 


AGTTATCGGA 


GACCTTCTAT 


ATTGTCATAC 


AGCTCGTAAT 


GGTTGGGAAG 


AACAACAACC 


17760 


GTCAGTATCA 


TGGAAAAAAA 


TTCGTGAAAA 


ATCAGGTGGT 


CACTTGTATC 


ACCACATCCA 


17820 


TGAATTGGAT 


TGCGTTCAAT 


TCCTTATGGG 


GGGCATGCCT 


GAAACTGTAA 


CCATGACAGG 


17880 


TGGAAATGTG 


GCCCATGAAG 


GTGAACATTT 


CGGTGATGAA 


GATGATATCA 


TTTTTGTCAA 


17940 


TATGGAATTT 


TCTAATAAGC 


GTTTTGCCTT 


GTTACAATGG 


GGTTCAGCTT 


ATCGTTGGGG 


18000 


TGAACATTAT 


GTCTTAATCC 


AAGGAAGCAA 


ACGTGCCATC 


CGCTTAGACT 


TATTCAACTG 


18060 


TAAACGAACT 


CTTAAGCTAG 


ATGGGCAAGA 


AAGCTATTTC 


TTGATTCACG 


AATCGCAAGA 


18120 


AGAAGATGAT 


GATCGGACTC 


GTATCTATCA 


TAGTACAGAG 


ATGGATGGAG 


CAATTGCTTA 


18180 


TGGTAAACCA 


GGTAAACGTA 


CTCCATTATG 


GCTATCATCT 


GTCATTGATA 


AAGAAATGCG 


18240 


CTATCTGCAT 


GAGATTATGG 


AAGGAGCTCC 


AGTATCAGAA 


GAATTTGCAA 


AACTTTTGAC 


18300 


AGGTGAAGCT 


GCCCTAGAAG 


CAATTGCTAC 


TGCAGATGCT 


TGTACCCAGT 


CTATCTTTCA 


18360 


AGATCGCAAA 


GTAAAATTGT 


CAGAAATTGT 


AAAATAAATT 


TTGCTATTCT 


CCTATTTATA 


18420 


GGTCGACTTC 


CTCCTCTGAA 


ACTACTTTTA 


GAGGAGCTGT 


TTGACTTTGC 


TAGTTTTTGA 


18480 


AACTGAAATC 


TATTATACTA 


CAAACTATTG 


AAAGCGTTTT 


AATTTTAAGG 


TATAATAATC 


18540 


TCATAGAAAT 


AAAGAAAACG 


AGGAAAGAGG 


ATGCCACAGA 


TTAGCAAAGA 


AGCCTTGATT 


19600 


GAGCAAATCA 


AAGATGGAAT 


CATCGTTTCT 


TGTCAGCCTC 


TTCCTCATGA 


ACCGCTTTAT 


18660 


ACAGAAGCGG 


GAGGGGTGAT 


TCCCTTGCTG 


GTCAAAGCGG 


CTGAGCAAGG 


TCGAGCAGTC 


18720 


GGTATCCGAC 


CAAACAGTGT 


TCGCGATATC 


AAGGAAATTA 


AGGAAGTCAC 


TAAACTTCCA 


18780 


ATCATTGGGA 


TTATCAAACG 


TGATTATCCA 


CCTCAGGAAC 


CCTTCATCAC 


GGCTACTATG 


18640 


AAAGAAGTTG 


ATGAATTGCC 


AGAACTGGAC 


ATCGAGGTGA 


TTGCTCTGGA 


TTGTACCAAG 


18900 


CGTGAACGCT 


ACGATGGTTT 


GGAAATTCAA 


GAGTTCATTC 


GTCAGGTTAA CGAGAAATAT 


18960 


CCTAATCAGC 


TTTTGATGGC 


TGATACTAGT 


ATCTTCGAAG 


AAGGGCTAGC 


AGCTGTAGAA 


19020 


GCAGGAATTG 


ACTTTGTCGG 


AACAACCTTA 


TCAGGCTACA 


CATCCTACAG 


TCCAAAAGTA 


X9080 


GACGGTCCAG 


ATTTTGAATT 


GATTAAGAAA 


CTCTGTGATC 


CTGGTGTAGA 


TGTCATTGCA 


19140 


GAAGGAAAAA 


TTCATACACC 


AGAACAAGCC 


AAACAAATCC 


TTGAATATGG 


AGTGCGAGGC 


19200 


ATCGTTGTTG 


GTGGCGCCAT 


TACTAGACCA 


AAAGAGATTA 


CAGAACGCTT 


CGTTGCTAGT 


19260 


CTTAAATAAC 


ATGTGAGGGG 


GAGTTTTATG 


TTTAAAGTTT 


TACAAAAAGT 


TGGAAAAGCT 


19320 
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TTTATGTTAC 


CTATAGCTAT 


ACTTCCTGCA 


GCAGGTCTAC 


TTTTGGGGAT 


TGGTGGTGCA 


X9360 


CTTTCAAACC 


CAACCACCAT 


AGCAACTTAT 


CCAATACTAG 


ACAATAGTAT 


TTTTCAATCA 


19440 


ATATTCCAAG 


TAATGACCTC 


TCCAGGAGAG 


GTTGTATTCA 


CTAATTTGTC 


ACTACTTCTC 


19500 


TGTGTGGGAT 


TATGTATTGG 


CTTAGCCAAA 


CGAGATAAAG 


GAACCGCTGC GTTAGCAGGA 


19560 


GTAACTGGTT 


ACTTACTTAT 


GACTGCAACG 


ATCAAAGCTT 




TTTTATGGCA 


19620 


GAAGGATCTG 


CAATTGATAC 


TGGAGTTATT 


GGAGCATTAG 


TTGTCUGAAT 


AGTTGCCGTA 


19680 


TATTTGCACA 


ACCGATATAA 


CAATATTCAA 


TTACCTTCCG 


CTTTAGGATT 


CTTTGGAGGT 


19740 


TCACCCTTCG 


TTCCTATTGT 


TACATCGTTC 


TCTTCTATCT 


TGATTGGCTT 


l\. » _ 1 


19800 


GTTATTTGGC 


CACCTTTCCA 


ACAACTTCTT 


GTTTCTACAG 


GTGGATATAT 


TTCTCAGGCG 


19860 


CGTCCAATTG 


GAACTTTTCT 


ATATGGATTT 


TTAATGAGAC 


TTTCTGGAGC 


ft ft /*^MHA H 

AGTAGGCTTA 


19920 


CATCATATAA 


TTTACCCTAT 


GTTTTGGTAT 


ACTGAACTTG 


GTGGTGTTGA 


AACTGTTGCA 


19980 


GGACAAACAG 


TGGTTGCAGC 


TCAAAAAATA 


TTTTTTGCTC 


AATTAGCCGA 


TTTCGCCCAT 


20040 


TCTGGATTAT 


TTACAGAAGG 


AACAAGGTTT 


TTTCCAGGTC 


GTTTCTCAAC 


AATGATGTTC 


20100 


GGTTTACCGG 


CTGCCTGTTT 


AGCGATCTAC 


CATAGTGTTC 


CTAAAAATCG 


TCGTAAAAAA 


20160 


TACGCGGGTT 


TGTTTTTTGG 


AGTTGCTTTA 


ACATCTTTTA 


TT ACCGGT AT 


TACAGAACCA 


20220 


ATTGAATTTA 


TGTTTCTATT 


CGTCAGTCCG 


CTTCTATATG 


TTGTTCACGC 


ATTCCTTGAT 


20280 


GGTGTTAGCT 


TCTTTATTGC 


AGACGTCTTA 


AATATTTCAA 


TAGGAAACAt. 


A I I I n.A(A*n 


20340 


GGTGTAATCG 


ATTTCACTTT 


ATTTGGAATT 


TTCCAGGGGA 


ACGCTAAGAC 


CAATTGGCTT 


20400 


CTTCAGATTC 


CATTTGGACT 


TATTTGCAGT 


GTTTTGTATT 


ATATTATTTT 


TAGATGGTTC 


20460 


ATTACTCAAT 


TCAACCTTCT 


AACGCCAGGG 


CGAGGAGAAG 


AAGTAGATTC 


TAAAGAAATT 


20520 


TCTGAATCCC 


CAGATTCAAC 


TTCAAATACT 


GCAGATTATT 


TAAAACAGGA 


TAGCCTACAA 


20580 


ATTATCACAG 


CCTTGGGTGG 


ATCAAATAAT 


ATAGAAGATG 


TAGATGCTTG 


TGTGACACGT 


20640 


TTACGTGTAG 


CTGTAAAAGA 


AGTTAATCAA 


GTTGATAAAG 


CACTTTTAAA 


ACAAATTGGT 


20700 


GCAGTTGATG 


TCTTAGAACT 


GAAGGGTGGC 


ATTCAAGCAA 


TCTATGCAGC 


AAAAGCAATC 


20760 


TTATATAAAA 


ATACTATTAA 


TGAAATTTTA 


GGTGTAGATG 


ATTAAGTACT 


TACTGACTTA 


20820 


ATAAAAAACA 


GAGGAGAGTG 


ATGGATGAGT 


AGGATGAAAT 


GAAATCGCAT 


ACAAGAAATA 


20880 


AAGAACTCAT 


TATCCAAGTT 


GGATACGCTT 


ATTACATAGG 


AGAATACAAA 


TGAAATTTAG 


20940 


AAAATTAGCT 


TGTACAGTAC 


TTGCGGGTGC 


TGCGGTTCTT 


GGTCTTCCTG 


CTTGTGGCAA 


21000 


TTCTGGCGGA 


AGTAAAGATG 


CTGCCAAATC 


AGGTGGTGAC 


GGTGCCAAAA 


CAGAAATCAC 


21060 
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TTGGTGGGCA 


TTCCCAGTAT 


TTACCCAAGA 


AAAAACTGGT 


GACGGTGTTG 


GAACTTATGA 


21120 


AAAATCAATC 


ATCGAAGCCT 


TTGAAAAAGC 


AAACCCAGAT 


ATAAAAGTGA 


AATTGGAAAC 


21180 


CATCCACTTC 


AAGTCAGGTC 


CTGAAAAAAT 


CACAACAGCC 


ATCGAAGCAG 


GAACAGCTCC 


21240 


AGACGTACTC 


TTTGATGCAC 


CAGGACGTAT 


CATCCAATAC 


GGTAAAAACG 


GTAAATTGGC 


21300 


TGAGTTGAAT 


GACCTCTTCA 


CAGATGAATT 


TGTTAAAGAT 


CTCAACAATG 


AAAACATCGT 


21360 


ACAACCAAGT 


AAAGCTGGAG 


ACAAGGCTTA 


TATCTATCCC 


ATTAGTTCTG 


CCCCATTCTA 


21420 


CATGGCAATG 


AACAAGAAAA 


TGTTAGAAGA 


TGCTGGAGTA 


GCAAACCTTG 


TAAAAGAAGG 


21480 


TTGGACAACT 


CATGATTTTG 


AAAAAGTATT 


GAAAGCACTT 


AAACACAAGG 


GTTACACACC 


21540 


AGGTTCATTG 


TTCAGTTCTG 


GTCAAGGGGG 


AGACCAAGGA 


ACACGTGCCT 


TTATCTCTAA 


21600 


CCTTTATAGC 


GGTTCTGTAA 


CAGATGAAAA 


AGTTAGCAAA 


TATACAACTG 


ATGATCCTAA 


21660 


ATTCGTCAAA 


GGTCTTGAAA 


AAGCAACTAG 


CTGGATTAAA 


GACAATTTGA 


TCAATAATGG 


21720 


TTCACAATTT 


GACGGTGGGG 


CAGATATCCA 


AAACTTTGCC 


AACCGTCAAA 


CATCTTACAC 


21780 


AATCCTTTGG 


GCACCAGCTC 


AAAATGGTAT 


CCAAGCTAAA 


CTTTTAGAAG 


CAAGTAAGGT 


21840 


AGAAGTGGTA 


GAACTACCAT 


TCCCATCAGA 


CGAAGGTAAG 


CCAGCTCTTG 


AGTACCTTGT 


21900 


AAACCGGTTT 


GCAGTATTCA 


ACAATAAAGA 


CGACAAGAAA 


GTCGCTGCAT 


CTAAGAAATT 


21960 


CATCCAGTTT 


ATCGCAGATG 


ACAAGGAGTG 


GGGACCTAAA 


GACGTACTTC 


CTACACGTGC 


22C20 


TTTCCCAGTC 


CCTACTTCAT 


TTGGAAAACT 


TTATCAAGAC 


AAACGCATCG 


AAACAATCAG 


22080 


CGGCTGGACT 


CAATACTACT 


CACCATACTA 


CAACACTATT 


GATGGATTTG 


CTGAAATGAC 


22140 


AACACTTTGC 


TTCCCAATGT 


TGCAATCTGT 


ATCAAATGGT 


GACGAAAAAC 


CAGCAGATGC 


22200 


TTTGAAAGCC 


TTCACTCAAA 


AAGCGAACGA 


AACAATCAAA 


AAAGCTATGA 


AACAATAGTC 


22260 


CTTAGTTATT 


CTATAAAAAG 


TAGTTTTTTA 


AAGAACCTAA 


GAGTGTATAC 


CCCCTTTTCC 


22320 


CTCTACACAG 


ATAGTGTAAG 


AAAAGGGGCC 


TTTTCTTTAA 


AATGTAAGAA 


ACTGTCACGA 


22380 


AATTAAAATG 


AAGTTCTTAC 


ATAAGCGAAT 


CATAAAAAAT 


TTCATTTTGA 


ttttaaaaoa 


22440 


GTTCAAGAAA 


GTCAAAAAAT 


TATTCTATTT 


GAAAGAGAGG 


TGCCGACTGT 


GAAAGTCAAT 


i2500 


AAAATCCGTA 


TGCGGGAAAC 


AGTGATTTCC 


TACGCTTTCC 


TAGCACCAGT 


ATTATTCTTC 


22560 


TTTGTCATCT 


TTGTGTTGGC 


TCCGATGCTG 


ATGGGCTTCA 


TTACAAGTTT 


CTTTAACTAC 


22620 


TCAATCACTA 


AATTTGAGTT 


TGTAGGCTTG 


GATAACTATA 


TCCCTATGTT 


TAAAGATCCT 


22680 


GTCTTTACAA 


AATCTCTGAT 


TAACACAGTT 


ATTTTGGTTA 


TTGGATCTGT 


ACCACTTCTT 


22740 


CTTCTATTCT 


C ACT CTTTGT 


AGCATCTCAG 


ACCTATCATC 


AAAATGTCAT 


TGCCAGATCC 


22800 


TTCTACCGTT 


TCGTCTTCTT 


CCTTCCTGTT 


GTAACGGGTA 


GTGTTGCCGT 


GACAGTTGTT 


22860 
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TGGAAATGGA 
ATCATCAGCC 
ATTATTCTCT 
AATATTGACA 
TTTTGGAAGA 
ACAATTAACT 
TACTCAACAA 
GGCTATGCCA 
CAATTTAAAG 
ACAGAAAAAA 
ACTGTGCTGT 
GATACAATTG 
CAACTCATCG 
GTAACCATGT 
CCTTTCTATG 
CAAGTTGTCC 
TCGGCAGTTA 
AGTGAAAATA 
CGTACCTTCT 
TTTACCTTCA 
AACAATTTGA 
GGTTTGATTA 
TTCCAAAAAT 
AAAATCTCTT 
CTTCCTAGTT 
AAGTATATTT 
TGCAGTTACT 
TATCAGAAAC 
AGGGATTCAT 



TTTATGACCC 
AAAACATTTC 
TGACCACTTC 
ATTCACTGGT 
TTAAATGGCC 
CATTCCAGTG 
GTACCTTGAT 
ACACAATTGC 
TACTTGGAAA 
AACCATTAAC 
TCATCTTTCC 
TTATTCCTCC 
TGCAGAACCC 
TCTTAGTTTG 
GTCAACGCAT 
TTGTACCATT 
TCTTGCCTTT 
TCCCTACAGA 
GGAGTGTAGC 
TCAATACTTG 
CCATCTCACT 
TGGCAGGAGC 
CCTTCACACA 
CAAACTACGT 
TGTTCTTCAA 
TATAGATTTA 
TTATGAACTT 
GAAGGAAAGA 
CCTAATTTAG 



ACTATCAGGG 
TTGGTTGGGA 
AGTTGGTCAG 
TGAAGCGGCG 
AAGCCTTCTT 
TTTCGCCTTG 
GTACTACCTT 
TGTCTTCTTG 
CGACGTAGAA 
AGCCTTTACT 
ATTCTACTGG 
TCAGTGGTTC 
TGCCTTGCAA 
TGCAACCTCA 
TCTATTTGCT 
GGTACGTATC 
GATTGGATGG 
GTTGCTTGAA 
CTTCCCGATT 
GAATGACTAC 
TGGGGTTGCG 
TGCCCTTGCT 
GCGTATTACT 
CAGCTTCACC 
TTTTCATTGA 
GAGAATATAG 
TTGTCAGACA 
GTATGATTTT 
ACAAGGCTAT 
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ATTCTAAACT 
GATAAAAACT 
CCCATCATCC 
CGTGTTGATG 
CCAACAACTC 
ATTCAGCTTT 
TACGAAAAAG 
GCACTCATGA 
TACTAAAGAA 
CTTATTTCAA 
ATTTTGACAG 
CCTAAAATGC 
TGGATGTGGA 
TCTCTAGCAG 
ATCTTT ATCG 
GTCAACTTCA 
CCATTCGGTG 
TCAGCTAAAA 
GTGAAACCAG 
TTCATGCAAT 
ACCATGCAGG 
GCTGTTCCAA 
ATGGGACCGG 
TTGCCATACT 
GTATAGGAAA 
AGGTTATAAC 
CTTATAAACT 
TGACGATTTG 
CGACTATCTC 



TTGTCCTTAA 
GGGCATTCAT 
TTTATATCGC 
GTGCAACTGA 
TTTATATTCC 
TGACATCTGG 
CCTTCCAATT 
TTGCTATCGT 
AGGAGACAGC 
CAATCATTTT 
GGGCATTCAA 
CAACCATGGA 
ACTCAGTATT 
GTTATGTATT 
CTGCTATGGC 
TGGGAATCCA 
TCTTCCTCAT 
TCGACGGTTG 
G G TTTGCAGC 
TGGTAATGTT 
CTGAAATGGC 
TCGTCACAGT 
TCAAAGGATA 
TAAGTATTGC 
ATCAATCTAT 
TGTCTACAAA 
TAAGAATGCT 
AAAAACATCA 
TACCAACATC 



GTCCAGCCAC 
GGCGATTATG 
TGCCATGGGG 
GTTTCAAGTT 
AATCATCACA 
TCCTCCAAAC 
GACAGAATAC 
AAGCTTTGTT 
TATGCAATCT 
GCTCTTGTTG 
ATCACAACCT 
AAACTTCCAA 
TATCTCATTG 
GGCTAAAAAA 
GCTTCCAAAA 
TGATACTCTC 
CAAACAGTTC 
TGGTGAGATT 
CCTTGCAATC 
CACTTCACGT 
AACCAACTAT 
CTTCCT AGTC 
ATACTCTGCG 
CTGCGGTTAG 
CAAGATACAG 
ATGGAGGGTA 
TTTAGTTAAC 
CCTTTTACAA 
GTAAGGATTC 



22920 

22960 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23S20 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24S40 

24600 



OMcnnrirv ,w»n 0010011 a? i -> 



WO 98/18931 



PCTAJS97/19588 



172 



TTTCGAATTA 


GGAAAGTATG 


ATATTCATGG 


AGATAAAGTC 


TTTCTAGTTG 




24660 


TGTCCTCAAT 


CAAGCTGAAA 


ATGATCAATT 


TGAGTATCAT 


AAGAACTATG 


CAGAii i^jtA 




TTTGCTGGTA 


GAAGGACATG 


AATATTCGAG 


CTACGGTTCA 


CGTATCAAAG 


ACGAGGCAGT 


"> a t on 


ACCATTCCAC 


GAAGCGAGTG 


ACATTGGCTT 


TGTTCATTGT 


CATGAACACT 


ACCCACTCTT 




GTTGGGTTAT 


CACAATTTTG 


CGATTTTCTT 


CCCAGGTGAG 


CCACATCAGC 


CAAATGGTTA 


^ JAMA 

24900 


TGCAGCCATG 


GAAGAAAAGG 


TTCGAAAATA 


TCTCTTTAAA 


ATTTTGATTG 


ATTAAAAATA 


24960 


GGATGAATTG 


TTTTTTTGTA 


AAGCTTTGAT 


AATACTCTAC 


CATGAAATTG 


ATCITTGTGA 


25020 


GGTAGAGAAA 


TGAGAATAAA 


ATATTTAAAA 


ATTGGTATCT 


TCTAAGTATC 


CTGCAAGAGC 


25080 


TAGTTTCTTA 


GATGGACAGG 


GGATTACAGT 


TGATGAGATG 


GCTTGGATAA 


TTAGGGGCAT 


25140 


TGTGAATGCA 


TTGATTGGTA 


GATACATAAA 


ATTAGGTACT 


TATGCCGCTA 


AGTATGGTAT 


25200 


TAGTATGGCA 


CGCTCCATCT 


TAAGTAGGGT 


AGCTGCAACT 


GCAGCAGCAA 


GAGTAGGATT 


25260 


ACTGACCAAG 


ATTTCTGGAT 


GGATTTTACG 


AGTAGCTGTG 


AATGTAGCPG 


ATGTATATGG 


25320 


TAATTTTGCC 


AACAATATTG 


CTGCAGCTTG 


GGATGCATAT 


GATAAAATTC 


CTAACAATCG 


25380 


TCCTATAAAC 


TTTTAAAATG 


CGAGAATGAA 


AGCACTTTGT 


ATTTTTTTAT 


TGAATATGTT 


25440 


AGCTTGGACA 


GTGCTTGCAA 


TGATAATTCG 


TGGAGGGCTA 


CATGGATTTG 


ATAGGCATAC 


25500 


TTGGAGTACT 


ATTTTAATTG 


CGTCGCTGTT 


CGGGGTATAT 


GATTATAAGC 


CCATACATAA 


25560 


AAATAGAAAA 


AAGTCCAAAA 


GAAAAAATAG 


ATTTGTTCAT 


GGTAGGGACT 


TATGAAAGCT 


25620 


TTACTGACAA 


AAAAGAAAAC 


AGTTTACAAA 


GAAAAATGAT 


GGAGGAGCAA 


ACATGGCACA 


25680 


AAAAGGAGTA 


AGCCTTATCA 


AGGCAGCATT 


TGATACAGAT 


AACTTTCTCA 


TGCGTTTTAG 


25740 


TGAGAAGGTC 


TTGGACATCG 


TGACAGCCAA 


TCTTCTTTTT 


GTCGTCTCTT 


CTTTACCCAT 


25800 


CGTGACGATT 


GCAGTGGCTA 


AAATCAGCCT 


CTACGAGACC 


ATGTTCCAAG 


TTAAGAAGAG 


25860 


CAGACGGGTG 


CCTCTTTTTA 


AAATCTATCT 


AAGATCTTTC 


AAGCAAAATC 


TGAAACTAGG 


25920 


TCTTCAGCTG 


GGTTTAATGG 


AGTTAGGAAT 


TGTGTTTCTT 


ACCCTT TCAG 


ATCTCTATCT 


*» C BOA 


TTTCTGGCGT 


CAAACACCTC 


TGCCCTTCCA 


ATTGCTGAAA 


GCCATTTGTT 


TAGGTATTCT 


26040 


GATTTTTCTT 


ACTATCCTGA 


TGCTGGCTAG 


TTACCCTATC 


GCGGCACGTT 


ATGACCTATC 


26100 


TTGGAAAGAA 


ATTCTTCAAA 


AAGGATTGAT 


GTTGGCTAGT 


TTTAACTTTC 


CTTGGTTCTT 


26160 


CCTCATGTTA 


CCCATTCTTG 


TCCTCATTCT 


GATGGTTCTT 


TATCTCTCCG 


CCTTCAGTCT 


26220 


ACTCTTAGGT 


GGCTCAGTCT 


TCCTACTTTT 


TGGGTTTGGA 


CTATTGGTCT 


TTATCCAGAC 


26280 


TGGATTGATG 


GAGAAAATTT 


TCGCAAAATA 


CCAATAGGAG 


CTTTATTTCT 


GAAACTACTT 


26340 


TCAAAGGCTC 


CAAACGCTAT 


TCTATAAGCG 


AGAAACTAAA 


ATCGG 




26385 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2716 base pairs 
(8) TYPE: nucleic acid 
(C) STRAND EON ESS : double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CCTGCCCGCA TTGCCCTACG CATTAAGTAA ACATATAAAA GCATGTGAGA GACTGTTGGA 60 

AAAGCGAGGA AATTTCCCCT CTTTTCCTCT AGTCTCTCCT TTCTTTTGCT GATTTTATTC 120 

AAAGAAAATG ATATAATAGT AGTTATGGAG AAAAAGAAAT TACGCATCAA TATCTTGACT 180 

TCAAGTGAGA AAGTAGCAGG ACAGCGAGTT TCAGGTGCTT ACCGTGAATT AGTTCGTCTT 240 

CTTCACCGTG CTCCCAAGGA CCAATTGATT GTTACAGAAA ATCTTCCAAT CGAGGCAGAT 300 

CTGACTCACT TTCATACGAT TGATTTTCCC TATTATTTAT CAACCTTCCA AAAGAAACGC 360 

TCAGGGAGAA AGATTGGCTA TGTGCATTTC TTGCCAGCTA CACTTGAGGG AAGTTTGAAA 420 

ATTCCATTTT TCTTAAAGGG AATTGTGAAA CGCTATGTAT TTTCTTTTTA CAACCGGATG 4 80 

GAGCACTTGG TTGTGGTCAA TCCTATGTTT ATTGAGGATT TCCTAGCAGC TGGTATTCCA 54 0 

CCTGAAAAAG TGACCTATAT TCCTAACTTT GTCAACAAGG AAAAATCGCA TCCTCTACCA 600 

CAAGAAGAGG TAGTCAGACT GCGCACAGAT CTTGGTCTTA CTGACAATCA CTTTATCGTA 660 

GTAGCTGCTC CGCAAGTTCA GAAACGTAAA GGGATTCATG ACTTTATCCG TCTGGCTGAG 720 

GAATTGCCTC AGATTACCTT TATCTGGGCT GGTCGCTTCT CTTTTGGTGG TATGACAGAT 780 

GCTTATGAAC ACTATAAGAA AATTATGGAA AATCCCCCTA AAAATTTGAT TTTTCCAGGC 840 

ATTGTATCGC CAGAGCGGAT GCGCGAATTG TATCCTCTAG CGGATCTTTT CTTGTTGCCT 900 

AGTTACAATG AGCTCTTTCC TATGACTATT TTAGAAGCTG CGACTTGTGA GGCTCCTATT 960 

ATGTTGCCTG ATTTAGATCT CTATAAGGTC ATTTTGGAGG GAAATTATCC GGCGACAGCG 1020 

GGTACAGAAG AGATGAAAGA GGCTATTTTG GAATATCAAG CAAATCCTCC TCTCTTAAAA 108 0 

GATCTCAAAG AAAAGGCTAA GAATATTTCC AGAGAGTATT CTGAAGAGCA TCTGTTACAA 1140 

ATCTGCTTGG ACTTTTATGA GAAACAAGCC GCTTTAGGGA GAAAGTAAAA AGTGAGGTAA 1200 

TCTATGCGAA TTGGTTTATT TACAGATACC TATTTTCCTC AGCTTTCTGC TGTTGCGACC 1260 

AGTATTCGAA CCTTCAAAAC AGAACTTGAA AAGCAGGGAC ATGCTGTTTT TATCTTTACG 1320 

ACGACAGATA AGGATGTCAA TCGCTACGAA GATTGGCAAA TTATCCGCAT TCCAAGTGTT 1380 



Ducnnrtn. -urn oot pckji ao i ^ 



WO 98/18931 



PCT/US97/19588 



CCTTTCTTTC 


CTTTTAAGGA 


TCGTCGCTTT 


174 

GCCTACCGAG 




1440 


ATTGCTAAAC 


AGTATCAGCT 


AGATATTATC 


CATACTCAGA 


CAGAA l i l l w l ^» 1 * * ** 


1500 


TTGGGGATTT 


GGATTGCGCG 


TGAATTGAAA 


ATTCCAGTCA 


TCCATACCTA TCAtAW.ww 


1560 


TATGAAGACT 


ATGTCCATTA 


TATTGCTAAG 


GGGATGTTGA 


TCCGGCCG AG TATWj i\,aau 


1620 


TATCTGGTTA 


GAGGTTTCCT 


GCATGATGTG 


GATGGGGTTA 


TTTGCCCTAG TGACATTviTC 


16B0 


CGTGACTTGC 


TATCTGATTA 


TAAGGTCAAG 


GTTGAAAAAC 


GGGTCATTCC TACTCGGAtt 


1740 


GAATTAGCCA 


AGTTTGAGCG 


TCCGGAAATC 


AAGCAGGAAA 


ATTTGAAAGA ACTGCGTAGT 


1 BOO 


AAACTAGGGA 


TTCAAGATGG 


TGAAAAGACG 


TTGCTTAGTC 


TTTCGAGAAT CTCCTATGAA 


I860 


AAAAATATTC 


AAGCAGTTTT 


AGCAGCCTTT 


GCTGATGTTC 


TGAAAGAGGA AGACAAOGTT 




AAACTGGTAG 


TAGCTGGGGA 


TGGCCCTTAT 


CTGAATGACC 


TCAAAGAGCA AGCCCAGAAC 




PTJ^SAGATTC 


AAGACTCAGT 


CATCTTTACA GGGATGATTG 


CTCCTAGTGA GACGGCTCTT 




TACTATAAAG 


CGGCGGATTT 


CTTCATTTCG 


GCATCGACAA 


GCGAAACGCA AGCTTTGACC 


o i /> ft 


TACTTGGAAA 


GCTTAGCCAG 


TGGAACACCT 


GTCATTGCTC 


ACGGAAATCC TTATTTGAAC 


•4 1 £ ft 


AACCTCATCA 


GTGATAAAAT 


CTTTGGAACC 


TTGTACTATG 


GAGAACATGA TTTGGCTGGT 


2220 


CCTATTTTCG 


AAGCCCTGAT 


TGCAACACCA 


GACATGAACG 


AGCAT AC CTT ATCAGAGAAA 


1 "1 O A 

2280 


TTGTATGAGA 


TTTC AGCTG A 


GAACTTTGGG 


AAACGAGTGC 


ATGAGTTTTA TCTGG ATGCC 


2340 


ATTATTTCAA 


ATAACTTCCA 


GAAAGATTTG 


GCTAAAGATG 




2400 






TCAGCAACAG 




TACCTGTAAA AGGATCTAGA 


^ A £ ft 

Z4QU 


TTTAAGACAG 


TTTTGTATCT 


GTGGTTGCTG 


CGCATGTTGA 


AGGCTTCAAA 


AACACAGTTG 


ATCAGTATGA 


GAGACTATTG GAAAGACCAT 


2520 


GAAGAATAGA 


AAGAGGAACA 


GCTATGAAAA 


AAACAATTAA 


TCAGAAGCCG TCGTGATAAA 


2560 


AAGATTGCGG 


GTGTTTGTGC 


TGGGGTGGCC 


CATTATCTGG 


ATATGGATCC GACTATCGTT 


2640 


CAAGTCATTT 


GGGGTGTTCT 


TACTTGCTGT 


TACGGAGCTG 


GAATTGTAGC TT AC ATT ATT 


2700 


TTATGGATTA 


TCGCGA 








2716 


(2) INFORMATION FOR SEQ ID NO: 5 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13926 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDN ESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GCTTTT GCCTTATTCA ACACATGAGG GCCATCAGGA ATGATCTGAA ACTGCGAATC 
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TGTTAACAGT 


CTATGGAGAG 


CTTTCATAGA ACTAAGATTC GGTTTATCTT 


TGCTGCCACA 


120 


AATTAGTAAG 


GTTGGATAAG 


CGTAAGTTCC 

WW 4 A A w w 


TGCTATATCC 


GTTAAATCAA 


GTGTCTTCAA 


180 


CTCCTCAGAA 


ACTCCGACCA 


T AAG ACTGTT 


GTCTGCTCCC 


TGTTTTTCAA 




240 


GGGAACTAGT 


TTAAAAATCA 




ATAAAATAGG 


ATATTCCCTG 


CTAATTTAAG 


300 


CGGGCATCCT 


GACAGAATCA 


J\j\kj\m 1 w>V»#WMi» 


ATTTGGTAAA 


TCGTAACTGG 


AAAGTTCTAG 


360 


TGTCAGCGCA 


GCACCTAAGG 


ALAa 1 w> w«*\ 1 


CAAAACAAAA 


GG'ITCTGTCT 


CTTGAGCTAG 


420 


GTGCTGATAA 


ACTCGCTCTP 


TAGCTTGTTb 


M 1 MV* 1 1 ** w 1 *% 


ACTCCAGAAG 


GAAATAACTC 


480 


GATAGCCTCA 


GAAGGATAAT 




ATTCCGAACT 
nl l ww\J#%«w a 


TCTTTCCAAG 


ACTCTGCTGA 

#*W 4 W A **W * W*» 


540 


CTGCCCTAAC 


CCATGCAAAA 


ATATTAATT 1 


riTPTACTTC 


TCCTCAAGGC 


*PT AATTC AT A 


600 


CAAGCCTCTC 


ACTGCATTAC 


AGCCGTAAAT 


AGCTTCTGCT 




PTCWAAGtTT 

W l\lUL/IIWIU 4 


660 


CAAGACTTTC 


TCTTCTACCT 


GTCCTGTTTC 




TCACCGTAAA 




720 


GATTCCAAGT 


CGGATAGGCG 


GT GTGT AG AG 


III 1 wwAuww 


t\ 4 4 4 4 W^wrM 


wUMAl 1 4 ww 


780 


TATACAGCTT 


TCAAGCAGTT 


CTCCTGACTT 




/\ 4 W 4 i W A W A a 


VV I 4 w 1 V— V— 1 MVf 


840 


GCTCAAATGC 


GGTCGCTGAG 


TGGTTTTAAA 


fTAGGTAAAG 


GATTGATTCA 


IV AGf AGtTTC 


900 


CTGAACACAG 


ACTTGGGCCT 


GACAAAAGt- I 




GCGGTTAATA 


fTTGACGATT 


960 


GACTTCTATC 


TCTCCAGATT 


TGCTAAGGwT 




CGGTAATCTC 


OA 1 4 rtV*W> 4 4 W 


1020 


ACAATCCTGA 


CACTCTTCCT 


CAATCTTGTG 


1 WWW/WlW 1 W * 


TCTGCATCAA 


AAGGAAAACC 


1080 


AAAATAACSA 


CTAGCTTTTC 


Tw>Auw>wl 1 l\i 


f* AG ATGTTGT 


TCTTCAAACA 


* W* • * V* * * * 


1140 


TTGGCTGATT 


TTTCCAGTTG 


TnA 1 1 4 * v» 


GAACGGAGCT 


TGTTTACGAT 


AGAGAACTGC 


1200 


TGCCTTTTGA 


TGAACCTCTC 


Vjw 1A1 1 w,/%V*#% 


TTCCCATGTG 


CTATCCCAAG 


TAATCCCTCC 


1260 


GCCAACTCCA 


TAAATGGCTT 


GACCTTTGTG 


AAGTTGAATG 


CTAv.uAAl\>w 


CCACATTAAA 


1320 


AATCCCTCCT 


CCATTTGGAA 


GCAAGAGACC 


AATCCTTCCA 


CAGTAGACTC 


CACGCGGTTG 


i ion 


AGGCTCCAAG 


TCCTTGATAA 


TCTCCATTGT 


CGCAATTTTC 


GGTGCACCCG 


TTATGGAACC 


1440 


ACAAGGAAAG 


AGTGAGCGGA 


AGATTTCAAC 


AAGGTCCACA 


TCCTCTCGCA 


ACTGACTCTT 


1500 


GATGGTCCAA 


GTCAT CTGCC 


AAACAGTTGA 


ATACTGCTCT 


ACCTGACACA 


GACGCTCCAC 


1S60 


GTGCTCGCTC 


CCAACTTCAG 


AAATACGCTT 


CATATCATTG 


CGCAAGAGCT 


CCACAATCAT 


1620 


CATATTTTCA 


GAGCGATTTT 


TGCGATCCTG 


TTCCAACCAA 


CTGGCCTGTT 


CAAGATCTTC 


1680 


TTGGTCAGTT 


ACCCCACGCT 


GAGTCGTCCC 


CTTCATTGGT 


CGTGTTGTCA 


ACTCGCGATC 


1740 


ATTTTGCTCA 


AAAAAGAGCT 


CTGGGCTCAT 


CGAAATCACT 


GTCATCTCGT 


CATGTTCCAC 


1800 
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ATAGCCATTG 


TAGCCCCCCT 


CCTGCTCTAC 


CACCATACGA 


TTGTAGATGG 


CAAAAGGATT 


1860 


CCCATTTAAC 


TTTTGCTTAA 


GTTGGACGGT 


GTAGTTGACC 


TGATAGGTAT 


CTCCCTGCCC 


1920 


TAAATGATCG 


TGAATTTGGG 


CAATGGCCTT 


TTCATAGTCT 


GCTGCAGACG 


TTACTTCCTG 


I960 


CCAATTTCAG 


GGCAAATCAA 


TATCCTCATA 


AGTCAGAGGA 


ATAGGGGAAG 


TTTCTACGAT 


2040 


ATCATCAACA 


GTAAAGTAAA 


CCAGCTACTC 


TCCCAGTAGG 


GGATCCTTGT 


GAACTGCTAA 


2100 


TTTTTCCTCA 


AAAGCAGGTG 


CAGCCTCGTA 


GCTGACATAC 


CCCACCACAT 


AATAACCTTC 


2160 


CTCTTGGTAG 


CTTTCCACTT 


GTGCCAGCAA 


ATCTGCCACT 


TCTTCTACAT 


TTCTCGTTTT 


2220 


CAACTCTTTA 


ATAGGCTGGG 


TAAAGGTATA 


TCTCTCCCCC 


AAAGTCCTAA 


AATCAATCAC 


2280 


TGTTTTTCTA 


TGCATACCTT 


AAGTATAGCA 


TAAAATAAGA 


AAACCCTCAT 


CCGCAAAGCA 


2340 


GATGAGAGAT 


TTCAATTATT 


TAAAGATTGA 


AGTTTTAAAG 


CTATTTCTTT 


GTTCAAGAAG 


2400 


TTTCTTATAA 


ACAGcrrcrr 


TTAATTTAAC 


TGTATTATTC 


ATAGATACTG 


TTTTATTACC 


2460 


CTTTGCTTCT 


TGTTTAAGAG 


TTTCGGCATC 


TTTTTTAACA 


GCTTCTTTAA 


ACAATGTCAG 


2520 


TAAATGATCG 


TATGATGAAA 


CGGAAGAACC 


ATTTACTTCG 


AATGTTGTTA 


ATCCTTTCGT 


2S80 


TGCTTTATCT 


TTAACTTCTT 


TGAAGTAACC 


TTTTTTAAAT 


TCTTCAATAG 


TATTAAATGT 


2640 


ATTGTTAGAT 


ATTTT CTTGA 


TAATATATTC 


ATCACTTAGA 


ACAGACTCAC 


CATCTGTTTT 


2700 


AGATTGTTGT 


TTATATTTAT 


TTGAAGCATA 


ACCTAAGAAC 


CCATTTTCGT 


ATCCGTAGTA 


2760 


ACCCCATAAT 


CTAAAACCAT 


TATGTTTGAA 


TGAAACAGCT 


CCAGGAGCAC 


CTTTACTAGT 


2820 


ATTACCTCCG 


TAGATACCGG 


TCATCATTCT 


AACACCTACA 


TAAGGTGATT 


GATCGTTATA 


2880 


GCTAATTGCT 


TCGGGTTTAT 


AGATACCATT 


ACCTGGATTG 


CCATTAGTCA 


TTAATTGTTG 


2940 


ATCAACTAAA 


TCATTAACAG 


ATTGAATATT 


TAATTCATTT 


ttctcttctt 


GACTTAGATT 


3000 


TCGAATTTTA 


TCCCATTGAT 


TTAATTTATT 


GTTATCACGG 


TATTCTCTAT 


CTATTTTTTT 


3060 


GAACCATGCA 


CTATTTAAAT 


CTTTATTTTC 


TTGAGAAATC 


ACAGATTCAG 


CCTCAATTTC 


3120 


ATCAAGAAGA 


CTTAAAGTCT 


CATTATAACC 


CTTCATATAT 


CTATTAATAT 


CTTCTCGTGT 


3180 


TTTTAGAGTT 


TTTGGATCTG 


TAATATACCA 


CTGATTCCCA 


TCATTTTTGC 


GTTTAAATAC 


3240 


CATATTAATA 


CCTAAAGAAC 


CAAACTCATC 


AAATCCACTA 


CCAGTAACAG 


GACTTTGTAG 


3300 


CATACCCTGA 


CCATATGCTT 


CAGCATCAGT 


ACCTTCACGG 


TGTCCAAAGC 


CACCTAAGTA 


3360 


AATCGCACGG 


TCGTTGACGT 


GTGTTGTTTC 


ATGTGTGTAA 


ACTGAAATAC 


CGTATTCACC 


3420 


AACCATTTCT 


AAATGAACAT 


ATTTTACATC 


AGTTCTAATA 


TC AT C AG ACT 


TAGGATATAT 


3480 


AGCAGCATAA 


GCTCCTGTTC 


CATTATAATT 


ATAATACTTA 


TCCATAGGAC 


CAAAGAATTC 


3540 


TCTAAGAGGA 


CTATATACTT 


TGTCGGTATT 


ATAGCGGCCA 


TATTTTTCAA 


CCCATCCACC 


3600 
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AGCAGCGTTA 
GTTATCAGAC 
GTTAACGATA 
GGCAATTGCA 
GACTATAGTA 
TTTAATTTCT 
AATAAACCAA 
TCCATCTAAA 
ACGTGAACCT 
AACTTTTTCA 
AAATTTAACT 
TTTCTCGACT 
TTCGTTTTGA 
TTCTTCTAAA 
CTG AT A CAT C 
AAGAGTACTG 
AGTATATTCT 
TGTACCCTCC 
ATTGTTGTTC 
TAATTTATTA 
ATTTTTATAG 
TTCATTAAGT 
AGTAGTAACT 
AAATGATCTA 
AGCATAATCA 
AGCATAGCTG 
AACTTCTACT 
TTTTGTACCA 
GGCGCGTGTT 



TAACCTTCCC 
GCTAGACGAT 
TCTTTAATAT 
TTATAATTTG 
TTTCTAAGGT 
TCCACCTCAG 
TCGTTCATAT 
TTACCTGATT 
TTAATGTTGT 
CCATAGAAAT 
CCATAGTAAC 
TTATCACGAA 
ACTAATTTTG 
TATAAATCTT 
GTTTGAGACT 
TTATCTTTTT 
TTTACATTAC 
GCATAGTGAA 
ATCGCGGTAA 
CCTTGATATA 
GCTTGGTTAT 
TTTGCAAGAC 
TTAGCATCAG 
TTACCTGACG 
GAATCATCAA 
ACAGAATTAC 
GGATTTGAAA 
ATAAGATGGT 
TT G TTTCCTG 



AAATAGGAAT 
ACCAGAAATC 
CTTCTAATGA 
AAATTAAATA 
GACTTCGTTT 
AAGCGCGTTC 
TGTCTATATT 
TAGTATATTT 
TCTCTTTAGA 
CTGGTTTGAA 
GATTTAGGTA 
TCATTTGACG 
TGATTAGGTT 
TGATTGCATT 
GAAGCTCTAC 
CAACGATATT 
TTAAACCTTC 
CAATAATTTT 
CAGAAAGAAC 
CAATATAATC 
ATTCAGCGTT 
GAGATAGATC 
CTTGTTCTTT 
AATATCCCTC 
CGTCGTTAGA 
TTACCGTACC 
CATCTATTTT 
CTAGAGTTAA 
TAATAGTAGC 



177 
AACAGCATCT 
ATAATACTTT 
T TTTTTACCT 
AAGATGTGCT 
TAAATTATCG 
TGCGATGTAG 
TGTGAACAAT 
AGCCAATACT 
ACCGATTTCA 
TACCATTAAT 
AGTTAAACCT 
AGCAGCTGCA 
TGTTAAGTTT 
AACTCTATAG 
TGATTCTAAA 
AGGTGTATAT 
ACTGCTACAA 
ATTAGCTTCA 
TTCTTTAGTA 
TTTATTGTAG 
ATAATCTTGA 
ACTTTCTAAT 
ACTTAATTTA 
TACCGCATAT 
GCCGAATAAC 
TACAGCCCAA 
ACCTTTTACA 
TCCATAATCT 
ATCAACATAT 



CTTACTACTC 
CTATAACCAT 
AATCCCTCTG 
TTATCAATAT 
AATGCACGAT 
ACATCGTCTT 
TGTCTATTAT 
TGACCGAATG 
ATTAATCTGT 
TCTTTAATAT 
AGTAATAAAG 
GAATCATTTA 
TCTTTAACAT 
TCACCTAATC 
ATAGATTTTA 
TTAATTCCTA 
GACAAGTTAA 
TCTAGGTTTG 
TTTAGATGGT 
AATGGTATTA 
ATACTAGAAT 
TTATCAGCAG 
GTAAATGTTT 
AAATCTTTTA 
TCCTCTCCAC 
GTCTTACTTG 
ACCGACTCAG 
ACTTTAGGAA 
CCTTTTCTAA 



GTTGTTTAAC 
CTGCAGCTTT 
CACTACCAAA 
TCAGTAGTGG 
GTTTAGAATT 
CTGTAGCATC 
AATTTAAAAA 
CGTCGAATGT 
CTAATACGCT 
TAACATCACC 
CTC CT TT G TT 
GTTGATGTTC 
CTGTGAAGCT 
GATTTAGATG 
TATCATTAAC 
ACTCAGTTAT 
ACTAATCTTT 
TGATAAACTC 
GTTCTTTATT 
ATTTTTCAAG 
AGGCTTTTTC 
TAATATTGAA 
TAGATTTCCT 
TATGAGCACT 
GGATAATCTT 
CTATTGCTCC 
TTAGGAGAGC 
CTAACAAGCT 
CAATTCCTCT 



3660 

3720 

3780 
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3900 
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4020 
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ATAGTTTGTA 


CCTGCAATTC 


CCCCTGTATG 


AGAGCCATTT 


CCACTTCTAG 


AGTGTAGTTT 


5400 


GCCAAAGAAA 


GCAACATTTT 


CAATACGAGT 


TCCATCATTC 


ATATTATTTA 


CAAATCCAGC 


5460 


AACATTATTA 


CGACCTGAAA 


GTGTGCCTGT 


AATTTTGACA TTTCTAATAA 


CTGAAGAACC 


5520 


TT TCATAGT A 


TTGGCTAATG 


ATGCAATATT 


ATCTTGACCA 


GAACGTTCTA 


TCTCTACATT 


5580 


TTCAAAATTC 


ACATTATTTA 


TCGTTGCGTT 


TGTTATCACA 


TTAAATAATG 


GATGTTCCAA 


5640 


TTCAGTAATA 


GCAAATTGTT 


TTCCTTCAGA 


ACTTAAAAGT 


TTTCCTGTGA 


ATTCTTTAGT 


5700 


GATATATGAT 


TTTCCATTAG 


GAACAACATT 


TCTAGCGCTC 


ATTGATTGTC 


CCAGACGATA 


57 60 


TTCTTTTGAA 


GGATCGTTTT 


GAATAGCTTC 


CACTAATTCT 


TTGAAATTAT 


AATATACATT 


5820 


ATCTTCGTGG 


ACTTTAGGTT 


TTTCAATATA 


GTGAACGTAT 


TCTTCTTCAA 


ATTTATTATC 


5880 


AGCAGTTCTA 


GAGACTAAAT 


TGTCTGCGAT 


TGCTGTAACT 


TTATATACAG 


GTGTTCCGTT 


5940 


AACCGTACTT 


TCTTCTATAT 


TTTTAACAGC 


TAGTAATGTA 


GTTTTCTGAT 


TATTTGAAGT 


6000 


TATTTTTAAA 


TAATAATTGC 


TCTTATCATC 


AGGAATAGTT 


CTTATCAGTG 


ATTCATTAGT 


6060 


TTCTTTTCCA 


TTTTCGTATT 


TGATTAAATC 


TGTACGTTTA 


ATATTTTTAA 


GCTCAACTTT 


6120 


TTTAAGATCT 


AATTGAATAT 


TTTGATTTTC 


TAGACTTTCA 


CTTTCTTCAC 


CGTTACCTCT 


6180 


CTCCTAAATC 


ATACTTGTAG 


ATAGGGTGTA 


TTCTTTGTAG 


TACTCTAGGT 


TCTTAAATGC 


6240 


AGCGCTTATA 


GTTTCTGTTG 


TTACCTTGTC 


ATCTCTAAGG 


ACTACACTAT 


TAATAACTTC 


6300 


TTCTCCTTTT 


TTCAATTCAG 


CTGTGATTGA 


TTTGATTTTT 


GTrrrcTTTT 


GATTTTCTAG 


6360 


AGTATACTTA 


GCAACACCTT 


CACGTTCCAA 


TATTTTCTTA 


TCGCTACTAG 


TCAATGTTAA 


6420 


TATTGGCTTT 


TCAGATAATT 


CAACCAATTT 


TTCAATAGTT 


GCACTTAATT 


TTTCAACAGC 


6480 


TTCCTTAACT 


TCACTTTCTT 


TAGCATCTCT 


ATTAGCTGCA 


ACTTTTTCAG 


CCTTTGTAAC 


6540 


TTCAGTTTGG 


AGGTTTTGCC 


AACTTCTATC 


ACTGTAATGT 


TCTTTTACCT 




6600 


ATCTGCAATC 


GTATTGTTTA 


ATTCAGTTTT 


ATCAACGTTT 


AGAGCGTCAA 


TAGCCGTTTT 


6660 


AAGTTTATTT 


GTCTCGCTAT 


TTACCTCAGG 


CTCTTTTACA 


GGCTCTGAAG 


CATACACACC 


6720 


TTTTGCAGTT 


TCTAAAACAG 


GTCCAAGAGC 


ATTGTAACTT 


GCTGTAGAAT 


AATCAGTAGG 


5780 


AGAAACTGAA 


CTAGCTTTAT 


CAATTTGATT 


ATTTAACTCA 


CTTTTATCAA 


CTGGTTCTTT 


6840 


AGTACCAATA 


CCCTTTATTT 


TATCTTCTGG 


TTTCGGTGTT 


TCCTCTACAG 




6900 


TTCAGGAACT 


TCTGGTTGCT 


TTTCTGGCTC 


AACTGGTGCC 


GTTGGTGCCT 


CTTCGTCTTC 


6960 


TCTTGGCGCG 


ACTGGTTCAC 


CTGCTTGTTC 


AACTTTTGGT 


TCCTCTGTTG 


GTTCTGTTTG 


7020 


TTTTTCTACA 


GCAGG CCTTT 


CAACTTTTGG 


TTGTTCAATA 


CATTGATTAA 


CAGTCTCCTC 


7080 


TTTTGGTTCT 


ACAGTTTCTT 


CAGCCTTCGT 


ATCTGGAGTT 


GACTCTTCTT 


GTTTCGGTGT 


7140 
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TTCCTCTACA 


GCCTTCTCTT 


CTTCAGGACC 




111 1\. l v*v»w * 


WVjAW X vAj 


7200 


CTTTTCGTCT 


TCTCTTPCCG CCACTGCTTC 


ACCTGCTTGT 


4 WAAL 1 1 i 1VJ 


Al t lUMW 


7260 


TGCTTTGTCT 


GATGGTTGAC 


TTTCTGGCTT 


AAW l\»wTAWT 


TTTTCCXCTG 


GTTTTGACTC 


7320 


AACTTCTCCA 


Wv» 1 AW A * * * 


CAACTGGAGC 




GAATCTTCTT 




t JOU 


TACTTTAGGA 


nuvivi * w & w%* * 


CAGTAGCTTT 


TACwTwCGAT 


TT'PGGTTCTT 


CCmXjGACT 


r m 4U 


TTCTTCTGTT 


irt*r ft tfift^Pftf"VP 
1 1 nuu 1 * * 


W A 4 \m 4 4 4 4 \*%* 


AGCTTCCTCT 


GTCTCTACTA 


CTTGGTTTTC 


/ 3UU 


TGTCCTAGCT 




TTGTTJVTTGA 

4 4V#4 * # ■ * 4 W»» 


TTGAGGAGTC 


TCAACTTCGA 


ft ^ ft / "V^^ ft ^* 


73O0 


CTCTCCAGGT 


I I luv. i UMU\J 


114W44W4 


AACAGTGTCC 


ft ft ft ft 

AAGCCAAGCG 


TTTTGAGGAT 


1C1A 


GTCACCTGAT 


AV*A 1 nAWtftn 




GCCCTCCATT 


TCAACAAvAC 


CCTCTCGACT 


ican 
7 DoU 


AGCCAGCGCT 




rTfWtTTTAr 
W 1 wOU 1 l» 1 /w. 


AGCCCCTGCA 


CTAGGAAGAA 


CTACCAATCC 


7740 


CATAGCTCCA 


Aw fMiAnAUA 


www 1 nuww A 


TTTCTTTCTC 


TTGTAGATTA 


ft ft ft ft ft ^^^^n 

AAAGCAAGCT 


7800 


CCCAACAGTC 




AACCTCTCAA 


AACAGATGCT 


TCTGTCCCTG 


MMtA -ft ft 

TTTGAOGCAA 


7860 


CTGATCTTTT 


I VjA J A\.nWv>A 


AACCATATAC 

noV. 4 4 raw 


AACTTCATTC 


CTCTCAGCCT 


TTCCTGTCTG 


7920 


AATTAAATCT 


1 i AuV. 1 


GTCAAATAAT 


CTCTTTATTT 


ft s** ft «v* ft **^/^ ft *P 

ACATAGTGAT 


AGCTGGCTGw 


loan 

'SHU 


GTCCACTACA 


& aftrr Afire & 


TCAAAAGGCT 


TCCAAGAAAT 


ACAGAGCCTA 


C AACTCC CTT 


O Ait A 


AATCTTACGA 


A I Iwwvinvi 


fiflTPTTTTTT 

UU 4L444444 


AAACACTTTT 


& M/w/*wiMftin ft 

ATCTCCTTTA 


TTCATTCTCA 


8 100 


AAACTTCCTA 


A 1 AULA IVii i 


GCGGATAGTG 


CGCACGCGCA 


CwTwCGATTA 


ATTTTubACu 


a 1 cn 
olou 


ACTAGCCAGT 




GGGCATGACC 


AATCTCTCTC 


> k k ft T» ft 

aaaatagggg 


GAATCGviAAt. 




CTGAACATGC 


TTGACATGCA 


TGCCAATTGC 


AGTGTCTCCG 


ft fV* ft ft ft ff*^ 

ATATwwAATw 






CTTGATAAAT 


TCAACCTCAA 


CTGCATCCTG 


^ft»l» ft ft ft /■^H*^ A 

CATAAACTTA 


AAUuv. 1ULU\ 


AW 1 uUvV.V.Un 


6340 


ACCTCCTGCA 


TGAAGAGTAG 


GATGGACACT 


bALAnl I I WW 


Af* A.C f" AAACT 
nwnLLnnHV. 1 




8400 


CTGACGTTCA 


ACAACGAGAG 


CCCCATTGAC 


ATGCTCACAA 


CCTTGAACTG 


CTAAATGGAT 


8460 


ACCTCTACTA 


CCTAGAATAT 


CCAACATAGT 


CTCCACTATC 


AGCTCACCAA 


TCTCTTGACT 


8S20 


GCATTCTTTC 


CCAATATGAC 


CACCTAGCAC 


CTCACTAGAA 


GATAGACCTA 


AAACAAAAAG 


8580 


GGCCCCCTGC 


TTCAAATTGG 


TCTTTTCTAA 


AACATCTTCC 


ACTACCTGAC 


GTGTTTCTCT 


8640 


TTGAATCTGT 


GTCTCGTTCA 


TCTCTGTTAC 


CTCTGTTGTC 


ACTCTTCTAT 


CATACCGTTT 


8700 


TTTCTTGTTT 


TTAGCAAGAT 


AGACAACCTA 


GAAAGTTTGC 


CCAATTACGC 


ATAAAACTCC 


8760 


CAGAATTGAC 


TGGCACTTAG 


CTAGTTTCTA 


TTCTATTTAT 


ATATATTTCA 


ACTTTCGTCC 


8820 


CTTTTTGGGG 


TCTAGAATCA 


ATCTTCATAT 


GGTAATTGGC 


TCCAAAATGA 


AGTT TGAGCC 


8880 
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CTTGATCGAC 


ATTTTCAACA 


CCAACTCCCC 


CACGTTTGAG 


TTOACSTTCA 


CTACTATCAC 


8940 


CAGCATCTTG 


GAAGCCAACG 


CCATCATCCT 


CAATACGGAT 


GACCAATCCC 


GAATCCTGTT 


9000 


TCTGGACAGA 


AAGTTTAATA 


TGGCCCTGAC 


CTTCCTTTTC 


CTTAATGCCA 


TGGTAAAGAG 


9060 


CATTTTCTAC 


AAGGGGTTGT 


AGGACCAGCT 


TGGGTAAGAC 


TAAATTATCA 


AAGGCAACAT 


9120 


TTTCATTAAT 


TTCGTATTCC 


AGCTTATCTC 


CATAGCGTTG 


TTTCPGGATA 


AACAGATACT 


9180 


GGCGGACATG 


ATTCATTTCG 


TCAGAGAGAC 


AAATCAAGTC 


CTTCCCTTGA 


TTGAGCGCCA 


9240 


AGCGGAAATA 


GCTTGCCAAG 


CACTTGGTCA 


CCTGCACCAC 


TCGCTGACTA 


TCATGAAATT 


9300 


CAGCCATCCA 


GATGATGGTG 


TCCAAAGTGT 


TATAGAGGAA 


ATGTGGATTA 


ATCTGGCTCG 


9360 


AAAGGGCTTG 


AACTTGGTAC 


TGACGGGTCG 


TTTCTTCCTG 


G CT ACGAAT A 


GCTACCATCA 


9420 


ACTGATCAAT 


CTGATCCAAC 


ATAGCATTAA 


ATTGGCCAGT 


TACTTCTCTC 


AGTTCATAGG 


9480 


CACCAACTTC 


CTTGGCACGA 


AGATTTTGAG 


CACCAGAAGC 


AATTTCCAAC 


ATGGTTTCTC 


9S40 


TCAAATCCTT 


CAAAGGAGCA 


ATCCAGCGTT 


TAAGACTGAA 


CCACACTAAG 


CAGAGACAGA 


9600 


CAAGAAGAGA 


TGTGACACTG 


GCCCCAAGCA 


AGGTCCACAA 


GAGCTGACTC 


CGAACCTGGT 


9660 


CTAACTTTTC 


CAATGATGAC 


ACGCCAAGCA 


CCGTCCAATC 


AGTTCCTGCA 


ATCTTCTCTT 


9720 


GACTGACGTA 


GGATTTGTGA 


CCAGGAGTAT 


AACCCTGACC 


TCTATCGATG 


TAGGGTTTCA 


9780 


TACCCTCCAT 


TTTGCTAGAC 


GAACTATAAA 


CTCTCTGTTG 


AGGATGCTAG 


AC AAATTCAT 


9840 


GGTTTTCATT 


GATAATGAAG 


GCAAACCCCT 


CCTGCCCCAA 


CTGGAGTTGA 


TTGAGATAGG 


9900 


CTTCCACAGT 


TTCATAACAA 


ATATCCAAAC 


GAAGCACACC 


AAGATTGGCT 


CCCTTTGCAT 


9960 


CAACAAGTTC 


TTGAGTGACA 


GAAATGACCC 


ACTGACTATC 


TGATTTACGA 


GCTGGAGTCA 


10020 


AAACAGGCAT 


AGCTCCCTGA 


TGAATGGCCT 


TTTGGTACCA 


ATCCTCAGCC 


ATCATATCAG 


10080 


AGGAAGTTTT 


CATCTGCACA 


CTCTCATCTG 


TAGAAATGAC 


CTGACCAGAT 


TTGGTCACCA 


10140 


GCACAACACT 


TTTCAAGTCC 


TTATCTGACT 


TCAAGATGCT 


CAAAAACAAA 


TCTCCGATTC 


10200 


CCTCGACCTT 


GTCTTCACTG 


GGATTCTCAG 


CATAGGCCAG 


AACATCCCTC 


TGCTGGGTCA 


10260 


AACCAGTCGA 


GGTGGTTTCT 


AGTTTTTTGA 


TATAAGACTG 


AATAAAGTGG 


CTAGTCTGGC 


10320 


TG AT GGTCGT 


TTGGCTGTTG 


CCCTCAATGG 


TGGCCTCAAT 


GGCTGAAGAA 


CTTGATTGAT 


10380 


AGTAGAAAGT 


TCCAACCAGA 


GCTAGGAGAA 


TGAGAAAGAC 


CAGAAAGA7G 


GAAATAACCA 


104<0 


TTCTAACTAA 


AAGAGAAGAA 


CGCTTCATCG 


GTCTTCTCCC 


TTCTTAAACT 


GACGAGCTGT 


10500 


CACACCTGCA 


ATCTGCTTAA 


AACGTTGGGT 


AAAATAGTTC 


ATATCTTCAA 


AACCAACCTT 


10560 


CTCTGCGATC 


TCATAAATCT 


TCAGATCTGT 


ACTTAAAAGC 


AAGAGCTTGG 


CTTGTTTAAC 


10620 


ACGTTCTCTC 


ACCAGATAAT 


CCTGAAAAGG 


CAAGCCCAAC 


TCTTTCTTAA 


TCAAGGAACT 


10680 
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CAGATAGGTC 1 




CTAAGTCACT 


GGCTAAAGAC TTTAAACTAA 


ATTGGCTATC 


10740 


AGCCAGATGA 


r* a cwjc n 1 "T"T 

UAL I i 1 4 


TCTGGGCCAT 


GTTTCCTTCA AACCTATTAC 


TCAATAAATC 


10800 


TTGTAACTGC 


TCTTCTTTCT 


CTTCCTTGTC 


TAGTTTTTGT TTGATTTTCC 


CCAACATTTC 


10860 


CTCAATATCC 




ACCCTTTGAG 


CAGGTAGTCG TCCACACCTA 


GTTTGACAGC 


10920 


AGACAAGGCA 


n* a ft TV* ft ft ft A*P 


CATCGTAACC 

l»A 1 \^ * #ft*-*wft» 


TGTTAAAAAG ACCAAATGAA 


CCTGAGGATA 


10960 


GGTTTCTCGT 


ACCAGACTGG 




GCCATTTAGA TGACGCATGT 


TGATATCGGT 


11040 


TAAAATGATA 


TCTGGCACCT 


OL 1111 «V>A A 


CAATTCCCAA CCCTGCCTTC 


CATTTTCAGC 


11100 


CTGACCGATG 


ATTTCCATAT 


cc^ & f^nc*t*fic 


TACATTGACC AGTTTAGTCA 


AACCTTGTCT 


11160 


TACCAGATAT 


TCATCTTCTA 




TCTCTAGGTC ATGCTCTGCT 


CCTTTACCAC 


11220 


TTACTAGTAT 


CAGTATAGCA 


AAA! 1\.I\.W1 


CTAACTGCTT AGCAAAGACC 


TCTT AT ACTC 


11280 


AATAAAAATC 


AAAAAGTAAA 




AGCCACAGGT TTCTCAAAGT 


ACCGCTTTGA 


11340 


GGTTGTAAAT 


AAAACTGACG 


AAGTtwM. Iv. 


AAAGTATAGC TTTGAGGTTG 


TAGATAAAAC 


11400 


TCACGAACTC 


GATAACCCTA 


& T & ClWt* AA 


GGCGACGCTG ACGTGGTTTG 


AAGAGATTTT 


11460 


CGAAGAGTAT 


TAATCAACAT 


ft »Tf*rirTii 
AATL 1 nw 1 


ATAAGCGTAc CTTTTTCTTC 


CATTTGGTCT 


11520 


TTGGGAATAA 


AGCGGATAGA 


CAGGCTnl Iw 


ATACACTAAC GTAAGCCGCC 


CTTGTCCTGT 


11580 


GGACCATCCG 


TAAAGACATG 




GAATCTCCTA CTCGCCTCCG 


CACTTCCATA 


11640 


CGCGTCATAT 


TGTAGGACTT 


nlLl 1 * * vj 


TAGGTCACAA CATCTCGACT 


GATGGGTTGC 


11700 


CTAAAACTAG 


GCCAGCCACA 




AATTTGTCTT TTGATGAAAA 


GAGAGCTTCC 


11760 


CCACTTGCTA 


TATCCACATA 


r.ATACCCGAT 


TCAAATTTAT CCCAGTAACC 


GTTTGAGAAA 


11820 


GCTCGTTCTG TTTGATTTTC 


CTGGGTAACT 


GCATACTCCT CAGGTGACAG 


GGTCTTTTTC 


11880 


AATTCCTCAT 


CACTTGGTTT 


TGGATATTTG 


CTGGCATCAA TGACAGGATA 


GGCCGCCTGA 


11940 


TTAACATTGA 


TATGGCAGTA 


GCCAl l iwA 


TTTTTCTTGA GATAGTCTTG 


ATGGTAATCC 


12000 


TCAGCCACCA 


CAAAATTCTT 


CAAGTTTTCC 


TTTTCAACTG CTAGAGGTTG 


ATCGTATTTC 


12060 


TTACCCACCT 


CATCAAAGAC 


TTGCTTAATC 


ACTTCCAAAT CCTTGTCATC 


TGTGTAATAA 


12120 


ACACCAGTAC 


CGTACTGGGT 


CCCCACATCA 


TTTCCTTCTT TATTTTTGCT 


GCTTGGATTG 


12180 


ATAATGCGGA 


AATAGTGAAG 


CAGGATTTCC 


TTGAGAGAAA TTTGCTTGGC 


ATCATAGGTG 


12240 


ACATGGACCG 


TTTCTGCATG 


ACCTGTTTGG 


TTAATCAATT CGTACTTGGT 


TGTTTCTCCT 


12300 


CTACCATTTG 


CATAGCCTGA 


AACCGCATCC 


GTCACCCCGG GAACACGTGA 


> GAAATATTCC 


12360 


TCCACTCCCC 


AGAAACAACC 


TCCAGCTAGA 


TAAATTTCGT GCAAGTCTGC 


: GTCTTTACTA 


12420 
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ATTTCTGTTT 


TTTTCACTGC 


TTTTCCTCCT 


TGGCTAACTG 


CCGCCTTTTC 


AATTTGCGAG 


12480 


GCATCTCTCT 


GCCCTGCATT 


TCGTATCAAT 


AGAACATAGA 


AACCGGTTAT 


GGCTAGAAAA 


12540 


AATACTCCTA 


GCAACAAGAA 


GATTTTTAAC 


TTATCATTCA 


TAAGACGCCT 


CCTAGGCTAA 


12600 


TTCCTTCAAA 


GTTTGCAAAA 


TTGCATCTTT 


TTCCATGAAT 


CCTGGATCTG 


TTTTGACCAG 


12660 


CTTGCCTTCT 


TTGTCTATAA 


AGGCTTGGGT 


TGGGTAAGAA 


CGGACACCAT 


AAGTTTCCAA 


12720 


AAGTTTGCCT 


GATGGGTCAA 


CTAGGACTGG 


GAGATTTTTA 


TAATCCAATC 


CCTTATACCA 


12780 


ATTCTTAAAG 


TCCGCTTCAG 


ATTGCTCTCC 


CTTATGTCCT 


GGTGACACTA 


CTGTCAAGAC 


12840 


CACATACTCA 


TCACCAGCTT 


CTTTAGCAAT 


CTCATCCGTA 


TCTGGAAGAC 


TAGCCAGACA 


12900 


GATGGAACAC 


CAAGAAGCCC 


AGAATTTGAG 


ATAGACTTTC 


TTGCCCTTGT 


AATCAGATAA 


12960 


ACGGTAGGTC 


TTGCCATCTA 


CTCCCATCAA 


TTCAAAATCA 


CCCACCTCTT 


TCCCTTTAGC 


13020 


TGCGCTTGTT 


TTACTAGCTG 


TCTGCTCCGT 


CTTCATTTCA 


TCTTTCGTTT 


GGTGTTCACT 


13080 


AGTCACGGAC 


TTGCCTGAAC 


AAGCCGTCAA 


ACAAAGGAGC 


GAACCTGCTC 


CAAGAACACA 


13140 


TCTTTGCCAT 


TTTTTCATAT 


TGATATTCCT 


TTCCATTTTA 


TTCAAATAAT 


TGACTTAAAA 


13200 


TTGAAGCATT 


TCCAAACAGA 


ACCAAGAAGC 


CCATCACAAT 


AATGAGAAAA 


CCACCCACTT 


13260 


TTTTGAGGAT 


TCCGAGATAG 


GGATGAAGTT 


TTCGGAAATG 


TTTCAAAACA 


TAACTAGAGG 


13320 


TCAGAGCTAG 


AAGCAACAAT 


GGTAGCGCCA 


AGCCCAGCCT 


ATACACCAAC 


ATGAGACCAG 


13380 


CTCCCTCCCA 


AGCTCCTGAA 


CCACCTGAAG 


CCGCCAAGGC 


CAAAACACAC 


CCCAGAACCG 


13440 


GCCCCACGCA 


AGGCGTCCAA 


GCAAAACTAA 


ACCTCAAGCC 


CAATAAAAAT 


GCCTGACTAT 


13500 


AGCCCTTACC 


ATTTTGCCCC 


TGTCCTTGCA 


GTTGTAGCCT 


CTTTTCCTTA 


TAAAGCCCCT 


13560 


TAAAGTGTAG 


AATCTCCATT 


TGGTGCAAAC 


CAACAAGGAT 


AATAATTGCC 


CCAGTAAGAT 


13620 


ATTGGAACCA 


AGAAGCATAA 


AGCAAATCGC 


CTAAAAAACC 


AGCTCCATAG 


CCCAACAAAA 


13680 


TAAATATAAA 


GGAAATTCCT 


GCTATAAAGG 


CCAGAGTTCG 


TAATAAACTA 


GTAACTGAGA 


13740 


TTGAAAATTT 


GCCGCTAGAA 


GCCTGAGCAC 


CATCCTTATC 


ATCTAGTAAC 


ACTCCTCTAT 


13800 


AGACCGGTAA 


CAAAGGTAAG 


ATACAAGGAG 


AAAAGAAGGA 


TAGAATCCCT 


GCCAAAAAGA 


13860 


CACTTAGAAA 


AAAGAAAATA 


TGACCCATAA 


AGTTCCTCCT 


ATCATTTTAT 


TGATAGATTT 


13920 


ATTATA 












13926 


(2> INFORMATION FOR SEQ ID NO: 6: 











(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 20199 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 
<D> TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CCCAGCACAA 


AAATGGCATT 


TGGAGATAA I 


Vw>nnn 1 WW » r% 


AAAAAACTAT 


CTTTGAGAAA 


60 


ATAACCTTGT 


TTATCGTCAT 


TATCATGCTA 


V* 1 amWiau * * 


TATTGGGAAT 


TTTTGCAACT 


120 


CCAATTGGTG 


CCCTCAGTAA 


TCTATAAAAT 


AGATTv. nnOn. 


&&.ATTTACTC 
nnn a i i *»v* * 


ACTGGGATTT 


180 


CCCAGCCCTT 


TTTTAAAGTG 


AGAAGAAATA 


ATGAu I Al w a 


A A » A nxji^\ * *^ w 


AGCTAAGATT 


240 


AAGCTCAAGG 


CTGGTAATGG 


TGGCGATGGT 


ATGCTTW.V. 1 


^•*Pf™^*P<' * t 'I' PIT & 

4 4 WW A V* * 


AAAATATGTC 


300 


CCTAATGGAG 


GCCCTTGGGG 


TGGTGATGGT 


GGTUjTwAVi 


W.AA 1 n-> 1 \M 1 


( 11 ( «"1 T^f3^ A 

V* 1 1 VaW ft A w 4 #» 


360 


GACGAAGGAC 


TACGTACCTT 


GATGGATTTC 


CGCTACAATC 


Oil. A I A 1V.AA 


W\tL A V»»n 1 1 Vi 


420 


GGTGAAAAAG 


GGATGACCAA 


AGGGATGCAT 


GGTCGTGGTG 


CTvAuwALv. 1 


1 /\V*AV» 1 A 


480 


GTACCACAAG 


GTACGACTGT 


TCGTGATGCG 


GAGACTGGCA 




nun ft 1 A V»n A i 


540 


GAACATGGGC 


AAGAATTTAT 


CGTTGCCCAC 


GGTGGTCGTv; 




Ann L n a 1 ^ V» A 


600 


TTCGCGACAC 


CAAAAAATCC 


TGCACCGGAA 


ft *#f 1/ *^ft*f" i h It ft 

ATCTCTGAAA 




ACCTC!ACCAA 
nww a wx»\n*nn 


660 


CGTGAGTTAC 


AATTGGAACT 


AAAAATCTTG 


GCAGATGTCt* 


f TTT &f^T l<^n 


ATTCCCATCT 


720 


GTAGGGAAGT 


CAACACTTTT 


AAGTGTTATT 


ACCTCAGCTTA 


AuLU 1 a/vv\ a 


TCCTGCCTAC 


780 


CACTTTACCA 


CTATTGTACC 


AAATTTAGGT 


ATGGTTCGCA 


UULAA 1 ^nww: 


a unn A WW AAA 


840 


GCAGTAGCCG 


ACTTGCCAGG 


TTTGATTGAA 


GGGGCTAGTC 


* arPTT.TTflf! 
nn\*v# a 1 I 


TTTGGCAACT 

ft A 4 ftf *^ ft 


900 


CAGTTCCTCC 


GTCACATCGA 


GCGTACACGT 


GTTATCCTTC 




TATGTCAGCT 


960 


AGCGAGGGCC 


GTGATCCATA 


TGAGGACTAC 


UTAwU I A 1 *-n 


A * nnnunv*v« * 


GGAGTCTTAC 


1020 


AATCTTCGCC 


TCATGGAGCC 


TCCACAGATT 


ATTGTAGCTA 


ATAAGATGGA 


CATGCU I WHj 


1080 


AGTCAGGAAA 


ATCTTGAAGA 


CTTTAAGAAA 


AAATTGGCTG 


AAAATTATGA 


TGAATTTGAA 


1140 


GAGTTACCAG 


CTATCTTCCC 


AATTTCTGGA 


TTGACCAAGC 


AAGGTCTGGC 


AACACTTTTA 


1200 


CATCCTACAG 


CTGAATTGTT 


AGACAAGACA 


CCAGAATTTT 


TCCTCTACGA 


CGAGTCCGAT 


1260 


ATGGAAGAAG 


AAGCTTACTA 


TGGATTTGAC 


GAAGAAGAAA 


AAGCCTTTGA 


AATTAGTCGT 


1320 


GATGACGATG 


CGACATGGGT 


ACTTTCTGGT 


GAAAAACTCA 


TGAAACTCTT 


TAATATGACC 


1380 


AACTTTGATC 


GTGATGAATC 


TCTCATCAAA 


TTTGCCCGTC 


ACCTTCGTGC 


TATGGGGGTT 


1440 


GATGAAGCCC 


TTCGTGCGCG 


TGGAGCTAAA 


GATGGGGATT 


TGGTCCGCAT 


TGGTAAATTT 


1500 


GAGTTTGAAT 


TTGTAGACTA 


GGAGACTGGT 


ATCGGAGATA 


AACCCATATC 


TTTCCGAGAT 


1560 


GCGGATGGTA 


ATTTTUTTTC 


CGCCGCACAC 


GTTTGGAATC 


AAAAGAAATT 


GGAAGAACTA 


1620 
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TTTAATCGTC 


TCAATCCAAA 


TCGTGCCTTC 


AGATTGGCAC 


GAACTAAAAA 


GGAAAATCCA 


1680 


TCTCAGTAAA 


GAAGCTAAAA 


AATCCCGTGC 


CTCATCAGAC 


ACGGGATTTT 


CTCGTACGAC 


1740 


AGGCATGTAT 


AGCAAACTGA 


ATCTGGAATA 


GCACAGCATA 


TCTTCTAAAA 


TATAGTAAAA 


1800 


TGAAATGAGA 


ACAGGACAAA 


TCGATCAGGA 


CAGTAAAATC 


GATTTCTAAC 


AATGTTTTAT 


1860 


AAGCAGAGAT 


GTACTATTCT 


AGTTTCAATC 


AACTATATTG 


TTATAAATTG 


ATTTGAATTT 


1920 


CAAAATTAAA 


TTGTTTCATT 


CTTATTTCAA 




TATATCTGAT 


GTCAAAGTTC 


1980 


TCGCCGAGTC 


AAATAGCGAT 


TCCCAAGCCT 


GACTATCGTG 


AGGTAGCGGA 


TTAAAATGGT 


2040 


CTGGGGATAG 


ACCGTTTTAA 


GTCTGACCCT 


GGAAATAAGA 


ATTGTCAGAA 


GAAGGGATAG 


2100 


CGAAATCGTG 


GCTCTACGAA 


CAGGAACGTG 


ATAATAAGGC 


GTATATAGCG 


GATAAGAGGG 


2160 


CATCAAACTC 


TAAAGTCCAA 


AAAGGTAGTC 


GTAACCTATA 


TGCGTAAATC 


ACGAGAGTAA 


2220 


TTGAATTCGT 


ACTAAGATTT 


TCTATTTTCA 


CTGTAACCTT 


TTAACGCCCT 


TATATCTTGT 


2280 


ATACACGAGC 


AAAGATGTAC 


GACTTATCCC 


GTGAGGTCTA 


TCACTATAAA 


GAGAAAACGA 


2340 


CAGATAGAAG 


TGATCCTGAG 


TCACGGTTAT 


CTGTCTGATA 


GGACGGTATG 


TATAAAACGC 


2400 


TTCTGTGAAC 


TGAGAGAAGG 


GGGAGAAGTT 


CTTGCTAAAA 


TTTAGTTGAA 


CAGCCGTATT 


2460 


CCGATACTTA 


GATAAGAGAT 


CTAGTCTTAG 


CTCCTACTCA 


GTTTTAGGCG 


ATAAAAAAGG 


2520 


GGCAATAGCG 


ATTCGAGAAA 


GATTATACTC 


TTCGAAAATC 


TCTTCAAATC 


ACGTCAATAT 


2560 


CGCCTTGTCG 


TATGTGTAGG 


ATACTGACTA 


CGTCAGTTCC 


ATCTACAACC 


TCAAAACAGT 


2640 


GTTTTGAGCA 


ACcTGCGCCT 


AGTTTCCTAG 


TTTGATCTTT 


GATTTTCATT 


CACTATTAGT 


2700 


AATTCAGTTA 


CTAACTCGTC 


AACTCTGATT 


TATCCAATAA 


AATTGAAAAG 


GATGCAAAAA 


2760 


AGGATAAATT 


TATGATATAC 


TTTATTTTGA 


AGACCTTATT 


AGAAATCTTG 


AAAGAGTATT 


2820 


GAAAACTTAG 


AATGAGAAAA 


ATTGTTATCA 


ATGGTGGATT 


ACCACTGCAA 


GGTGAAATCA 


2880 


CTATTAGTGG 


TGCTAAAAAT 


AGTGTCGTTG 


CCTTAATTCC 


AGCTATTATC 


TTGGCTGATG 


2940 


ATGTGGTGAC 


TTTGGATTGC 


GTTCCACATA 


TTTCGGATGT 


AGCCAGTCTT 


GTCGAAATCA 


3000 


TCGAATTGAT 


GGGAGCTACT 


GTTAAGCGTT 


ATGACGATCT 


ATTGCAGATT 


GACCCAAGAG 


3060 


GTGTTCAAAA 


TATTCCAATG 


CCTTATGGTA 


AAATTAACAG 


TCTTCGTGCA 


TCTTACTATT 


3120 


TTTATGGGAG 


CCTCTTAGGC 


CGTTTTGGTG 


AAGCGACAGT 


TGGTCTACCG 


GGAGGATGTG 


3180 


ATCTTGGTCC 


TCGTCCGATT 


GACTTACACC 


TTAAGGCGTT 


TGAAGCTATG 


GGTGCCACTG 


3240 


CTAGCTACGA 


GGGAGATAAC 


ATGAAGTTAT 


CTGCTAAAGA 


TACAGGACTT 


CATGCTGCAA 


3300 


GTATTTACAT 


GGATACGGTT 


AGTGTGGGAG 


CAACGATTAA 


TACGATGATT 


GCTGCGGTTA 


3360 


AAGCAAATGG 


TCCTACTATT 


ATTGAAAATG 


C AG CCCGTG A 


ACCTGAGATT 


ATTGATGTAG 


3420 
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CTACTCTCTT 


fl A AT VAT ATG 


CCTCCCCATA 


TCCGTGCCGC 


AGGAACTAAT 


ATCATCATTA 


3480 


iTuAluulul 


i u/vW\uA 1 1 n 




GTCATCAGGT 


GATTCCAGAC 


CGCATTGAAG 


3540 


CJGGAACATA 


TATATCTTTA 




TTGGTAAAGG 


AATTCCTATA AATAATCTTC 


3600 


UTAL uAALA 






AGTTGGAAGA 


AATGGGAGTG 


AGAATGACTG 


3660 


MS ft flftMMftA ft ft ft 

TATCTGAACA 


CAGCATTTTT 






GAAAGCAATC 


AATATTAAGA 


3720 


CAGCTCCTTA 


CCCAGGCTTT 


GCAACTGAiT 


Tf" f* & & & A f"*f* 


CCTTACCCCT 


CTTTTACTAA 


3780 


GAGCGAATGG 


TCGTGGTACA 


ATTGTCGATA 


LuAl I lALun 


AAAACGTGTA 


AATCATGTTT 


3840 


TTGAACTAGC 


AAAGATGGAT 


GCGGATA n i 




TGGTCATATT 


TTGTACACGG 


3900 


GTGGACGTGA 


TTTACGTGGG 


GCCAGTGTTA 


AAOCuAWLbA 


CTTAAGAGCT 


GGGGCTGCAC 


3960 


TAGTCATTGC 


TGGGCTTATG 


GCTGAAGGTA 


AAACTGAAAT 


TACCAATATC 


GAGTTTATCT 




TACGTGGTTA 


TTCTGATATT 


ATCGAAAAAT 


TACGTAATTT 


AGGAGCGGAT 


ATTAGACTTG 


A AAA 


TTGAGGATTA 


AACCGTAGAG 


GTGTTTATGA 


ATATTTGCAC 


CAAATTACCA 




A 1 dA 


TTTTTGAAAC 


GGATCGCTTG 


TATTTGCGTC 


CTTTCTTTTT 


TACTGATAGT 


CAGGACTTCC 




GCGAGATAGC 


TTCAAATCCA 


GAAAATCTTC 


AATTTATTTT 


CCCAACGCAG 


GCAACTCTGC 


A 5 AO 


AAGAAAGTCA 


ATATGCACTG 


GCCAATTACT 




CCCTTTGGGA 


GTCTGGGCAA 


4320 


TTTGTGACCA 


GAAAAATCAA 


CAAATGATTG 


GTTCTATTAA 


ATTTGACAAG 


TTAGATGAAA 


4 380 


TCAAAAAAGA 


AGCTGAGCTT 


GGCTATTTTT 


TGAGAAAAGA 


TGCTTGGTCC 


CAACGATTTA 




TGACAGAGGT 


TGTTAGAAAA 


ATTTGTCAGC 


TTTCTTTTGA 


GGAATTTGGC 


TTAAAACAAT 


4500 


TATTTATCAT 


TACCCACCTT 


GAAAATAAAu 


rnuL^AAAU 


AGTTGCTCTT 


AAw uOM i 


4360 


TTAGTTTGTT 


CCGTCAGTTT 


& & err* a a 
AAGGGAAGTu 


at/vstt ir At** 


AAGAAAAATC 


CGGGATTATC 


4620 


TTGAATTTCG 


GTATGTAAAA 






ATCAGGAAAT 


TCTAAGCTAT 


4680 


TTGGAGGAAT 


TACCAGTAGG 


TAAAAGGGTC 


AGTGTTCGTA 


GCATTTCGAA 


TCATCTAGCA 


4740 


GTTAGTGATG 


GAACAGCCTA 


TCGCGCTATT 


AAAGAAGCTG 


AAAACCCTGG 


AATTGTGGAG 


4800 


ACCCGTCCTA 


GAAGTGGAAC 


AATTCGTGTT 


AAATCCCAGA 


AACTTGCTAT 


AGAGAGATTA 


4860 


ACGTTTGCTG 


AAATTGCAGA 


AGTGACTTCT 


TCTGAGGTTC 


TGGCTGGGCA 


AGAAGGTTTA 


4920 


GAGAGAGAAT 


TTAGTAAGTT 


TTCAATTGGT 


GCCATGACTG 


AACAAAATAT 


CTTGTCTTAC 


4980 


CTTCATGATG 


GGGGGCTCTT 


GATTGTCGGA 


GACCGAACCC 


GTATTCAGTT 


GCTAGCCTTG 


5040 


GAAAATGAAA 


ATGCAGTTCT 


GGTTACAGGG 


CGATTTCAGG 


TTCATCATGA 


TGTGCTTAAA 


5100 


CTGGCCAATC 


AAAAAGGGAT 


TCCTGTTCTA 


AGAAGTAAGC 


ATGATACCTT 


TACCGTCGCG 


5160 



oMcrwirv -wn 



an man ao i -v 
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ACCATGATCA 


ATAAAGCCTT 


GTCAAATGTC 


CAAATCAAGA 


CTGATATTCT 


CACAGTTGAG 


S220 


AAACTTTATC 


GCCCTAGTCA 


TGAGTATGGT 


TTTCTGAGAG 


AGACAGATAC 


AGTTAAAGAT 


5280 


TATTTGGACT 


TGGTTCGTAA 


GAATCGTAGC 


AGCCGTTTCC 


CTGTTATCAA 


TCAACATCAG 


5340 


GTCGTTGTTG 


GTGTTGTAAC 


CATGAGAGAC 


CCTGGTGATA 


AATCACCAAG 


CACGACAATT 


5400 


GATAAGGTTA 


TGTCTCGTAG 


TCTATTTTTG 


GTTGGATTAT 


CGACAAATAT 


TGCCAATCTG 


5460 


AGTCAACGGA 


TGATCGCAGA 


AGACTTTGAA 


ATGGTACCAG 


TTGTTCGAAG 


CAATCAAACT 


S520 


TTGCTTGGCG 


TTGTGACCCG 


ACGAGATGTC 


ATGGAGAAGA 


TGAGCCCTTC 


CCAAGTTTCG 


5580 


GCTCTACCAA 


CTTTTTCTGA 


GCAGATTGGA 


CAAAAGCTCT 


CTTATCACCA 


TGATGAAGTA 


5640 


CTCATTACAG 


TGGAACCCTT 


TATGCTAGAA 


AAAAATGGAG 


TTTTGGCTAA 


TGCTGTATTC 


5700 


GCAGAAATTC 


TGACCCACAT 


GACCCGATTT 


ACTTGTTAAT 


ACTCGTCGCA 


ATCTCATTAT 


5760 


CGAGCAGATG 


CTGATCTACT 


TTTTGCAGGC 


TGTTCAGATA 


GATGATATAT 


TGCGCATTCA 


5820 


CCCACCGATT 


ATTCATCATA 


CGAGACGGTC 


AGCTATAATT 


GATTACGATA 


TTTATCATGG 


5860 


TCACCAGATT 


GTTTCAAAAG 


CAAATCTGAC 


TCTTAAAATT 


AATTAGAAAC 


TAGGAGAAAA 


5940 


GATGATAACA 


TTAAAATCAC 


CTCGTCAAAT 


CGAACCTATG 


GACAAGCCTG 


GTGATTTTCT 


6000 


AGCAACTATT 


CATATAGGCT 


TACGTGATTT 


GATTAAGCCA 


GGCGTAGATA 


TGTGGGAAGT 


6060 


TGAAGAATAT 


GTCCGCCGTC 


GTTGTAAACA 


AGAAAATTTC 


CTTCCACTTC 


AGATTGGGGT 


6120 


TGACGGTGCC 


ATGATGGACT 


ATCCTTATGC 


TACCTGTTGC 


TCTCTTAACG 


ATGAAGTGGC 


6180 


TCACGCTTTC 


CCTCGTCATT 


ATATCTTGAA 


ACATCCTCAT 


TTGCTCAAAG 


TTGATATGGT 


6240 


TTTGGGAGGT 


CCCATTGCTA 


AATCTGACCT 


AAATCTCTCA 


AAATTAAACT 


TCAACAATGT 


6300 


TGAACAAATG 


AAAAAATACA 


CTCAGAGCTA 


TTCTGGTGGT 


TTACCAGACT 


CATCTTGGGC 


5360 


TTATGCTGTT 


GGTACACCGT 


CCGAACAAGT 


CAAAAACTTG 


ATGGATGTAA 


CCAAACAAGC 


6420 


TATCTACAAG 


GGTATTGAGC 


AAGCTCTTGT 


TGGAAATCGT 


ATCGGTGATA 


TCGGTGCGGC 


6480 


TATTCAAGAA 


TACGCTGAAA 


GTCGTGGTTA 


CGGTGTAGTG 


CGTGATTTGG 


TTGGTCATCG 


6540 


TGTTGGCCCA 


ACTATGCACG 


AAGAACCAAT 


GGTTCCTAAC 


TATGCTATTG 


CAGGTCGTGG 


6600 




CGTGAAGGAA 


TGGTCTTAAC 


CATTGAACCA 


ATGATCAATA 


CAGGCGATTG 


6660 


GGAAATTGAT 


ACAGATATGA 


AAACTGGTTG 


GGCGCATAAG 


ACCATTGACG 


GTGGATTGTC 


6720 


ATGTCAGTAT 


GAACACCAAT 


TTGTCATTAC 


GAAAGATGGA 


CCTGTTATCT 


TGACTAGCCA 


6780 


AGGTGAAGAA 


GGAACTTATT 


AATAAAAAGT 


GAAAAGACTA 


CTGGAAGTTT 


ATTTTGATAA 


6840 


AAAATCCACT 


AGATCTTTTC 


ATAATAAAAC 


GCATTGTATC 


AAGTGTTAGG 


GGCTGATATC 


6900 


ATGCGTTTTT 


CTGCTTTTAA 


GATTTTTTCC 


AACTCTGTTT 


GTAAGCGCAT 


CATAACAAAG 


6960 
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GCTCTAGGAT 


TCAGGGCTCT 


CCTCCTATAT 


ACTATTAGTA 






' U* w 


ATTTTAGTGT 


CGCAGTCTAT 


TGTTCCTGTA 


GAGATTCCAC 




TTTTGATTCT 


'WOW 


AAAAAGAGAA 


ATGGAATTCT 


GTTTAATGTT 


CGTATTGCCA 


ATCTTAAATT 


i Al. 1 1 l l 


"7 1 d A 
/ JL4 U 


TATTATACTT 


CCTGCGAAAC 


AAAATATGGT 


ATAGTAGTTC 


*f»»*^^ft 


UAAliCAAwTA 




AACAACTAAC 


TGATGCACGA 


TTTAAGCGTC 


TTGTTGGTGT 


TCAuCltiAw V- 


ACTTTTGAAG 




AGATCTTACC 


TGTATTAAAA 


ACAGCTTATC 


AACTTAAACA 


CGCAAAAGGT 


GGACGAAAAC 


7320 


CTAAATTAAG 


CCTAGAAGAC 


CTTCTTATGC 


CCACTCTTCA 


ATACTGCGAG 


AATATCGAAC 


7380 


TTATGAAGAA 


ATTGCGGCTG 


ATTTTGGTAT 


TCACGAAAGC 


AACTTTATCC 


GTCGGAGCCA 


7440 


ATGCGTTGAA 


ATAACTCTTG 


TTCAAAGTGG 


TTTTACCCTT 


TCAAGAACTC 


CTCTCAGTTC 


7500 


TGAGGACACG 


GTAATGATTG 


ATCCGACGGA 


AGTAAAAATC 


AATCGCCCTA 


AAAAAACAAT 


7560 


TAGCGAATGA 


TTCTGGTAAA 


AAGAAATTTC 


ACGCTATGAA 


GGCTCAAGCG 


ATTGTCACAA 


7620 


GTCAAGGGAG 


AATTGTTTCT 


TTGGATATCG 


CTGTGAACTA 


TAGTCATGAT 


ATGAAGTTGT 


7680 


TCAAAATGAG 


TCGTAGAAAT 


ATCGAACAAG 


CTGGTAAAAT 


CTTGGCTGAC 


ACTGGTTATC 


7740 


AAGCGCTCAT 


GAAGATATAT 


CCTCAAGCAC 


AAACTCCACC 


TAAATCCAGC 


AAACTCAAGC 


7800 


CGCTAACAGC 


TGAAGATAAA 


GCCTATAACC 


ATGCGCTATC 


TAAGGAAAGA 


AGCAAGGTTG 


7860 


AGAACATCTT 


TGCCAAAGTA 


AAAACGTTTA 


AAATATTTTC 


AACAACCTAT 


CGAAATCATC 


7920 


GTAAACGCTT 


CGGATTACCA 


ATGAATTTGA 


GTGCTGGTAT 


TATCAATCAT 


G AACT AGGAT 


7980 


TCTAGTTTTG 


CAGGAAGTCT 


ATTGAGGTAT 


TGAGCTAGTT 


TATGAAAAAA 


TTGGGTGAAA 


8040 


AGTCGAGTGT 


TTTAGAAACC 


CACAGTCTAC 


TATTCTAGTT 


TCAATCCACT 


ATATTTTGCT 


8 LOO 


ACTCCCCGTA 


AAGTTTCTAT 


TTTCCCTGAT 


TTCTGATATA 


ATAGAAATAT 


TGAV.HUAAU 


B i An 
a i ou 


AGTAAGGAAG 


AGAAGATGAA 


CGCATTATTA 


AATGGAATGA 


ATGACCGTCA 


GGCTuAGCCW 


ft"**>n 


GTGCAAACGA 


CAGAAGGTCC 


CTTGCTAATC 


ATGGCAGGGG 


CTGGTTCTGG 


AAAGACTCGT 


8280 


GTTTTGACCC 


ACCGTATCGC 


TTATTTGATT 


GATGAAAAGC 


TGGTCAATCC 


TTGGAATATC 


8340 


TTGGCCATTA 


CCTTTACCAA 


CAAGGCTGCG 


CGTGAGATGA 


AAGACCCTCC 


TTATAGCCTC 


8400 


AATCCAGCGA 


CTCACGACTG 


TCTGATTGCG 


ACCTTCCACT 


CCATCTGTCT 


GCCTATTTTG 


8460 


CGTCGCGATC 


CGCACCATAT 


TGGCTACAAT 


CGTAATTTTA 


CAATTGTGGA 


TCCTGGTGAA 


8520 


CAGCGAACGC 


TCATGAAACG 


TATTCTCAAA 


CACTTCAACT 


TGGACCCTAA 


AAAATGCAAT 


8580 


GAACGAACTA 


TTTTGGGGAC 


CATTTCCAAT 


GCTAAGAATG 


ATTTGATTGA 


TGATGTTGCT 


8640 


TATGCTGCCC 


AAGCTGGCGA 


TATGTATACG 


CAAATTGTGG 


CCCAGTCTTA 


TACAGCCTAT 


8700 
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CAAAAAGAAC 


TTCGTCAGTC TGAATCCGTT GACTTTGATG 


ATTTGATTAT 


CCTGACCTTG 


8760 


CGTCTCTTTG 


ATCAAAATCC 


TCATGTTTTG 


ACCTACTACC 


AGCAAAAATT 


CCAATACATC 


8820 


CACCTTGATC 


AGTACCAAGA 


TACCAACCAC 


GCTCAGTACC 


AATTGGTCAA 


ACTCTTGGCT 


8880 


TCCCGTTTTA 


AAAATATCTG 


TGTGGTTGGG 


GATGCGGACC 


AGTCTATCTA 


CGGTTGGCGT 


8940 


GGTCCTGATA 


TGCAGAATAT 


CTTGGACTTT 


GAAAAGGATT 


ACCCCAAAGC CAAGGTTGTT 


9000 


TTGTTGGAGG 


AAAATTACCG 


CTCAACCAAA 


ACCATTCTCC 


AAGCGGCCAA 


CCAGGTTATT 


9060 


AAAAATAATA 


AAAATCGCCC 


TCCTAAAAAT 


CTCTGGACTC 


AAAACGCTGA 


TGGGGAGCAA 


9120 


ATCGTTTACT 


ATCGTGCCCA 


TGATGAGCTG 


GATGAGGCTG 


TATTTGTAGC 


CAGAACCATC 


9180 


CATGAACTTA 


GTCGCAGTCA 


AAACTTCCTT 


CATAAGGATT 


TTGCAGTTCT 


CTATCGGACT 


9240 


AATGCCCAGT 


CCCGTACAAT 


TGAGGAAGCC 


CTGCTCAAGT 


CTAACATTCC 


TTATACCATG 


9300 


GTTGGCGGAA 


CCAAATTCTA 


CAGCCGTAAG 


GAAATTCGCG 


ATATTATTGC 


TTATCTCAAC 


9360 


CTTATTGCTA 


ATTTGAGTGA 


CAATATTAGT 


TTTGAGCGTA 


TTATCAACGA 


GCCTAAACGT 


9420 


GGAATTGGTC 


TACGTACACT 


TGAGAAAATC 


CGTGATTTTG 


CAAATTTGCA 


AAATATGTCT 


9480 


ATGCTGGATG 


CTTCTGCTAA 


TATTATGTTG 


TCTGGTATCA 


AGGGTAAGGC 


ACCCCAATCT 


9540 


ATCTGG G ATT 


TTGCCAATAT 


CATGCTTCAT 


TTGCGGGAGC 


AGCTAGACCA 


CTTAACCATT 


9600 


ACAGAGTTGC 


TTCAGTCCGT 


CCTAGAAAAA 


ACAGGTTATG 


TCGATATTCT 


TAACTCCCAA 


9660 


GCGACTCTAG 


AAAGCAAGGC 


ACGGCTTGAA 


AATATCGAAG 


AGTTTCTTTC 


TGTTACGAAG 


9720 


AACTTTGATG 


ACACCACGGA 


TCTGACAGAA 


GAGGAAACTG 


GTCTGGACAA 


ACTGAGTCGT 


9780 


TTCTTAAATG 


ACTTGCCTTT 


GATTGCCGAC 


ACAGATTCAG 


GTAGTCAGGA 


GACATCAGAA 


9840 


GTGACCTTGA 


TGACCCTGCA 


TGCTGCCAAA 


GGTCTCGAAT 


TTCCAGTTCT 


CTTTTTGATT 


9900 


GGGATGGAAG 


AAAATGTCTT 


TCCACTTAGT 


CCTGCGACTG 


AAGATTCAGA 


TCAATTAGAA 


9960 


GAAGAGCGCC 


GTCTAGCCTA 


TGTAGGTATC 


ACGCCTGCAC 


AGAAAATTCT 


CTATCTGACC 


10020 


AATGCCAACT 


CACGCTTGCT 


TTTTGGTCCT 


ACCAATTATA 


ACCGTCCGAC 


TCGTTTTATT 


10080 


AACGAAATCA 


GTTCAGACTT 


GCTTGAGTAT 


CAAGGTCTGG 


CTCGTCCTGC 


AAATACAAGC 


10140 


TTTAAGGCAT 


CATATAGCAG 


TGGTAGTATT 


TCCTTTGGTC 


AAGCTATGAG 


TTTGGCTCAG 


10200 


GCTCTTCAAG 


ACCGTAAACG 


CGGTGCTGCC 


CCAAAATCAA 


TCCACTCAAG 


CGGTCTTCCA 


10260 


TTTGGTCAAT 


TTACAGCTGG 


CGCAAAACCA 


GCATCTAGCG 


AGGCAAATTG 


GTCCATTGGT 


10320 


GATATTGCTC 


TCCACAACAA 


ATGGGGAGAG 


GGAACCGTTC 


TGGAAGTTTC 


AGCTAGCCGT 


10380 


GCTAGGCAGG 


AATTGAAAAT 


CAATTTCCCA 


GAAGTAGGTT 


TGAAAAAACT 


TTTAGCCAGT 


10440 


GTGGCTCCAA 


TTGAGAAAAA 


AATCTAATTT 


TCCATCCTTC 


TCACGAATAA 


TAAAGTGAGG 


10500 
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AGGATTTTTA TCTAC ACT AT TTCATTCCAA GAAGATTCAC TATTACCAAG ACAAAGCCTG 
GCCAACGAAG GAGTTGAAGC GCTTAGTAAC CAACACTTGC TAGCTATTTT ACTCAGGACA 
CGAACACGTC AAGCTAGCGT TTTTGAAATT GCCCAAAAAC TCTTCAACAA TCTTTCAAGC 
CTAACGGATT TGAAAAAAAT GACCCTGCAG GAATTGCAGA GTTTGTCTGG TATTGGGCGT 
GTTAAGGCCA TAGAATTACA AGCTATGATT GAACTGGCGC ATCGTATTCA CAAACACGAC 
ACTCTTGAAA TGGAAAGTAT TCTCAGCAGT CAAAAGTTGG CCAAGAAGAT GCAGCAGGAA 
TTAGGGGATA AAAAACAAGA CCACCTGGTG GCACTCTATC TCAATACTCA AAATCAAATC 
ATCCATCAGC AGACCATTTT TATCGGGTCT GTAACTCCTA CTATCCCTCA ACCGCGAGAG 
ATTCTTCACT ATGCAATCAA GCATATGGCG ACTTCTCTTA TCTTGGTCCA CAATCATCCT 
TCAGGAGCGG TAGCCCCTAG CCAAAATGAT GATCATCTCA CTAAACTTGT TAAAGAACCC 
TGCGAATTGA TGGGGATTGT TCTCTTGGAC CATTTGATTG TCTCTCATTC TAATTACTTT 
ACTTATCGTG AAAAGACAGA TTTAATCTAA AGTTCATTAA CGACATAGTC AAAGAGTTTT 
TTATCTTTGG GACGATTTTC AAAAACAAGT TCTGGATGCC ATTCCACACC GAGAAAGGCG 
ACATCATCCG TACTCATGAC AGCCTCAATG ATACCATCTT TAGGATCATG AGCCACAACT 
TTTAAATTTG GTGCTAAGTC CTTGATGCTC TCGTGGTGGA AGGAGTTGAT ATGAGAGA TT 
TCTCC AT AG A TTTCTTGGAG AACGGTATCT GGTTCTGTTA CCAAGCGTTG AGTTGTGTAC 
TCAACAGAAG AATCCTGCCA ATGGTCTTCC ATATCTTGGT ACAAAGTTCC ACCCATGGC A 
ACGTTAAAGA GTTGCGTACC ACGGCAGACA GAGAAAATGG CCTTTTTCTG TTTAATAGCT 
TCCTTGATGA GGGCCAGTTC GAACATATCT CTTTGAAGGT GATACTCATC ACTATCAATG 
GTTTTCGGTT CGCCATAAAA TTTTGGATCC ACATTTTGCC CACCTCTCAA GATCAGCTTG 
TCAATCAAAC TCATATAGTG GCAGGCCATT TCTTGATCAC CAATCGGTAG GATGATGGGA 
ATCCCTCCAG CATCTTTAAC CCCTTCAACA AAGCCTTTTG CTGCCTAGCT CATCATGATG 
TCATCATCTG GATGAGTTTT TTCGTTTCCT GTAATCCCAA TAACTGCTTT TTTCATAAAA 
TGATTTTCGC TTTCTAATCC TCTTTTCGCA TGAAGTAGAG CAGGGTTTGG AGTTCACTTG 
TCAAATCGAC ATACTGAACG ACCACGTCTT TTGCTAAATG CAGATGGACT GGTGAAAAAC 
TGACAATTCC TTTCACACCA GCATCAACCA AGAGATTAGC AACCTCTTGT GACTTGACGC 
TGGGAACAGT TAGGATAGCA GTCTTCACAT CAGCATCCTT GATTTTATCC TTGATCTGAG 
AAATCCCGTA AATGGGAATC CCGTCAGGAG TTTGGGTACC GACTTCAGGA TGCTCCTCTA 
GGTCAAAGGC CATGATAATC TTCATCTTGT TACGTTCGTG GAAGCGGTAG TGGAGAAGCG 
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CATGGCCCAT 


ATTTCCAATA 


CCAACCAGCA 


TGACATTGCT 


AATAGACTTG 


TCATTGAGCA 


12300 


AATCGGCAAA 


AAATGTCATT 


AGTTTTTTGA 


CATCATAGCC 


AAAACCACGA 


CGACCAAGTT 


12360 


CACCAAAATA 


GGAAAAATCA 


CGACGTACGG 


TCGCTGAATC 


AATACCGATA 


GCCTCTGCAA 


12420 


TTTGCTTAGA 


GTTGGCACCT 


TCAATCTTTT 


CTGCATGAAA 


TCTCTTAAAA 


ATTCGATAGT 


12480 


AGAGAGAGAG 


TCTTTTTGCT 


GTAGCTTTTG 


GAATAGCAAA 


CTGTTTATCT 


TTCACAAAAT 


12540 


CACAACCTTT 


CTATTCTTCT 


ATTTTATAGA 


AACATTCTGA 


AAAAATCAAC 


AAAAATAAGA 


12600 


AAAAACTAAG 


AAAAATCTTA 


GTTTTGATGT 


AAAAAATCTG 


CATGAGATAG 


AAAACGGTAG 


12660 


AGGTCTCCCA 


CCAGCCCCTG 


ATAAACTTTT 


TTGCCCCTAA 


AAGTCAGAGA 


ACTCACATAA 


12720 


ACTGTATCTG 


GTAAGGTTAC 


ACATCCTGAC 


AAAGTCAACA 


TGAGAGCCTC 


ATGATCCTCA 


12780 


TACTTGAGAG 


TACGCTCTAC 


ATGATAGCAG 


TCCTTATAGG 


TCAGTTCAAA 


CATTTTGGCT 


12640 


CTATCTTTCC 


GATTTTGTAA 


AGACACCACG 


TTCTACCAAG 


CTATCCATGA 


GGAAGTAGAA 


12900 


TTTTTCCTGA 


TGAATATGGT 


GGTCTTCTGA 


TTTGAAAATA 


TCAACTAGAC 


GAAGGCCAAA 


12960 


CTTGTCAGTG 


ATATTGATTT 


TAGCCCCTCT 


AAGTTCCTTG 


TTAATGATGA 


TTTTGAGTTG 


13020 


CAAGCCTTCA 


CCGCTGTTTG 


GCACTTTTTC 


CAAAAGGCGA 


GTCAGTTCAT 


AGTTACCAAC 


13080 


CTTAGTTTCA 


AAAAAGCTGT 


TATCTTTGAG 


GGTGAATTTT 


TTAACAGAAG 


GGCTAAGAGT 


13140 


CTAATCCTAA 


CGACAATTTT 


TTAACTGAAT 


GATTTTTTCA 


AATGCCATAT 


GGCTAACCTC 


13200 


CGATAATTTC 


TTTTAAGGTT 


TTTGCGAGGG 


TTTGTAGGTC 


TTCAACGGTA 


TTTTGTGGCG 


13260 


ACAAACTGAT 


GCGAAGGGAT 


TCCTTCAAGC 


GTTCTGAATT 


TGCGCCATAC 


ATCGCTTCAA 


13320 


GAACATGGCT 


GGATTGGACA 


ACGCCTGCAC 


TACAGGCTGA 


GCCAGTAGAG 


ATTGAAATTC 


13380 


CAGCTAAATC 


TAGCCGAACG 


AGTAAGAGGT 


CATTTTTCTG 


ACCAGGAAAT 


CCAATATTGA 


13440 


GAACATAAGG 


GAGATGATGT 


TTTCCTCTAT 


TCAGGTAATA 


CTGAATGCCC 


TCCAGCTCTC 


13S00 


CCAGAAAGGC 


AGTTTCTAGA 


TTTTGTACAT 


GTTGAAAATG 


TTCTTCTTGT 


TTTTCTAGGT 


13560 


CTTCTTTTAG 


GGCTGCAACC 


ATGCCTACAA 


TGGCAGGCAG 


ATTTTCAGTT 


CCTGCACCTT 


13620 


TTTTCTGTTC 


CTGGTCTCCG 


CCATGTAGAT 


AGGAATCAAA 


GTCCATGCTA 


GATGCCTAGA 


13660 


GAAAACCGAT 


TCCCTTAGGA 


CCATGGAATT 


TGTGGGCAGA 


AGCAGTGAGA 


AAATCAATGC 


13740 


CCAATTCTTC 


TGAATGAATT 


GGGATTTTAC 


CAATAGCCTG 


AACTGCATCA 


ACATGATAGG 


13800 


CAGCAGGCTG 


TTGCTTGACT 


ATTTGGCCAA 


TTTCAGCGAT 


GGGCAGTAGG 


TTTCCTGTCT 


13860 


CATTATTGAC 


AAACATGGTA 


GAAACCAAAA 


TCGTATCGTC 


ACGTAAAGCC 


TTTTGAATTT 


13920 


GCTGGGCTGT 


CATTTCTTGA 


TTTTCTGGCT 


GGATAATGGT 


TGCTTCAAAC 


CCAAAGTCTT 


13960 


GAACCAAGTA 


ATCAATTGTT 


TCAAGGACAG 


CATCGTGCTC 


GATGGCAGTT 


GTGATGATAT 


14040 
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Ul t 1 1 WW 1 * w 


TTCTTGGTGA CGAAGACAGT 


AGCCAATGAT 


GGTACTATTA 


TTGCCTTCAC 


14100 


w wA w W n w n 


AGTGAAAAAG ATATGTTGAC 


GTTTTGTCCT 


TAGTAACTGG 


GCTAGTTCCT 


14160 


GAw^Wjti 1 w 


TCGCAAGAGT TTGCCAGCTT GACGACCATG 


ACCATGAATA 


CTAGAAGGAT 


14220 




TTCTTGCATA ACCTTGGTCA TAGCTGAAAT 


AGCAACTGCT 


GACATAGGAG 


14280 


TLVil l\A.WJl- 


ATTGTCCAAA TAAATCAAAG 


AATCACCTTA 


TTTCTTTTTA 


TTGTAGGCAA 


14340 


AGAuTviUuL I 


r» » rwtTTT rTTTCCTC AA 

UAU tuu ill* W A * * WW * 


TACGGACGAT 


AGCATCACCA 


ATTAACTCAC 


14400 


TAGCAGTGAT 


UTAuCAiAWn 1114 *#»ww*-»w 


rrn'mciTT 


TGTTGCTACT 


GAATCAGTCA 


14460 


CAAGAAi 1 it 


iMMntfiT^fc ftTATTGTCAA 
111 AA 1 A 1 « ^ w*" 1 *w*w*-*« 


GAAGCTCAGC 


AGCTCCCTCG 


ACGAAGAGAC 


14520 


CGTGGCTAwA 


AA(~ Aww A A A/* Ml l * w * vi * 


CTCCTTCACG 


TTCAACGATT 


TTAGAACCTT 


14580 


CAGAGAAGGT 


*rr*rrrrfiTA TTTAAAATAT 


CATCAATCAA 


GATAGCTTTC 


TTACCTTCAA 


14640 


CATCACCAAT 


& af*P*T TCGTTACGAG 

AAiAlAAU^l l ww *j-»ww#-»w 


TTGCATCGTC 


TTGAGGGTAG 


TCGATAATCC 


14700 


CGATAGGAGC 


Rirr».&rAT&T T*f"A.GCCAGGC 
ATUWin 1 A 1 1 w/%w w w*%www 


TACGCGCACG 


TTTGACACCT 


GAATTTTTAG 


14760 


GGCTAACGAC 


a. & c* & ariTrr 4APT AAGCA 

AACAALA 1 w A uftAV.unnwwn 


ATCCTTTATC 


GCAGTAATGT 


TTTGCGAATA 


14820 


GGGGAACAGT 


rxiiinATTA TCCACTGGAA 


TATCAAAGAA 


ACCTTGAACC 


TGAACGGCAT 


14880 


GCAAATCAAG 


trTrartnATA CGATCAACTC 


CAGCCTTAAC 


CAGC AT ATTG 


CCAACTACTT 


14940 


TTGCTGTAAG 


TGGCTCACGA GGACAAGCAA 


TGCGGTCTTG 


ACCTGCATAG 


CCAAAATATG 


15000 


GAAGGACAAw 


CTTGATACTG TGGGCACTTC 


CACCCACACA 


AGCATCGACC 


ATGATTAACA 


15060 


ATTCCATTAu 


GTwO I IUl I w «w/\www*w»w 


TTGTTGATTG 


GATGATGTAA 


ACATCATAAC 


15120 


CACw*«SACACT 


TTCTTCGATA TTTACTTGGA 


TTTCTCCGTC 


TGAAAATTGA 


CGTGATGATA 


15180 


rrrrrrrrAafi 


TGGGACACCA ACAGCTTGCG 


CAATTTTTTG 


TGCAATCTCT 


TCGTTAGAGT 


15240 


luAw luWWnA 


AAGTTTCATG TTTTTTCTAT 


CTGACATTAT 


AGACCGTCCT 


CTGTAAACTT 


15300 


TATAAATCCT 


AGTTATATTT ACCTTACATA 


TATGAACTGG 


GATTTGTCTA 


TTTTTATCTT 


15360 


TTCTATTTTA 


CCAAAAAATG GAGATTATTT 


CAGCTATTTT 


TCATACTTTT 


GACAAATCCA 


15420 


ACCAATTTTG 


AAGGAGCTTT ttgataggaa 


ATCTGATTTT 


TCTCTAAAAA 


TTGTCGAAAA 


1S480 


TCCTGTTTGC 


CTTGCTCATG ATTTTCCACT 


TCAAGCTCCA 


ATTCCTAATC 


TCTTATATCA 


15540 


AAGTATCGCC 


TCTGATCCAG TGCCATGAGA 


CCAATAGCTG 


TTTTCATTTC 


ATAGCGAAGC 


15600 


GTTGTTAGAC 


AACCAAGAAC CTGCCAGTTC 


TTACTTTGGA 


TACCATGTTT 


CGCCAATTCA 


15660 


TCCAGTACTA 


GCCCTTGACG AAGTTCTTCC 


TTACTCAGAT 


AGTTCTCAGC 


ATCTTTTAGT 


15720 


TGCAATTTTT 


GGTTGTATTC CATGTTTCCA 


ACACTCTGCG 


GGACTTTGAG 


; TGTCAACTCA 


15780 
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GCCCAGTCTT CAAAGGTTCC AATCCGCATA CCGACTTTCT TTTCfCwCAG TTCAAAATCA 15840 

t 

GCCCTGTCGA TGTAGTAATT TCTTTGAAGA ACAGGACTGA CACCTGTGAA CTGCT C TTTT IS 9 00 
ACACGATTGT ATTCATCTTT TTTCAATAGT GTTTTCAATT CAATTTCTAA ATGTTTCATT 15960 
TTTCTTACCT TTTTTTATCG TTGAAAGCGG ATTTATGGTA TAATAAGCAT TGTATTTATT 16020 
GTATATGAAT CTGGAGAAAA AATCAAAGAT ATTTTTGACG GATAATATGA GAACAAGGGA 16080 
GAATATATGA CCTTAGAATG GGAAGAATTT CTAGATCCTT ACATTCAAGC TGTTGGTGAG 16140 

TTAAAGATTA AACTTCGTGG TATTCGTAAG CAATATCGTA AGCAAAATAA GCATTCTCCA 16200 

ATTGAGTTTG TGACCGCTCG AGTCAAGCCA ATTGAGAGCA TCAAAGAAAA AATGGCTCGT 16260 

CGTGGCATTA CTTATGCGAC CTTGGAACAC CATTTGCAGG ATATTGCTGG CTTACGTGTG 16320 

ATGCTTCAGT TTGTAGATGA CGTCAAGGAA GTAGTGGATA TTTTGCACAA GCCTCAGGAT 16380 

ATGCGAATCA TACAGGAGCG AGATTACATT ACTCATAGAA AAGCATCAGG CTATCGTTCC 16440 

TATCATGTGG TAGTAGAATA TACGCTTGAT ACCATCAATG GAGCTAAGAC TATTTTGGCA 16500 

GAAATTCAAA TTCCTACTTT CGCCATCAAT TTCTGGGCAA CGATAGAACA TTCTCTCAAC 16560 

TACAAGTACC AAGGGGATTT CCCAGATGAG ATTAAGAAGC GACTGGAAAT TACAGCTAGA 16620 

ATCGCCCATC AGTTGGATGA AGAAATGGCT GAAATTCCTG ATGATATCCA AGAAGCCCAG 166 80 

GCACTTTTTG ATCCTTTGAG TAGAAAATTA AATGACGGTG TAGGAAACAG TGACCATACA 16740 

GATGAAGAAT ACAGGTAAAC GAATTGATCT GATAGCCAAT AGAAAACCGC AGAGTCAAAG 16800 

GGTTT T GTAT GAATTGCGAG ATCGTTTGAA GAGAAATCAG TTTATACTCA ATGATACCAA 16860 

TCCGGATATT GTCATTTCCA TTGGCGGGGA TGGTATGCTC TTGTCGGCCT TTCATAAGTA 16920 

CGAAAATCAG CTTGACAAGG TCCGCTTTAT CGGTCTTCAT ACTGCACATT TGGGCTTCTA 16980 

TACAGATTAT CGTGATTTTG AGTTGGACAA GCTAGTGACT AATTTGCAGC TAGATACTGG 17040 

GGCAAGGGTT TCTTACCCTG TTCTGAATGT GAAGGTCTTT CTTGAAAATG GTGAAGTTAA 17100 

GATTTTCAGA GCACTCAACG AAGCCAGCAT CCGCAGGTCT GATCGAACCA TCGTGCCAGA 17160 

T A TTGTAATA AATGGTGTTC CCTTTGAACG TTTTCGTGGA GACGGGCTAA CAGTTTCGAC 17220 

ACCGACTGCT AGTACTGCCT ATAACAAGTC TCTTGGCGGT GCT G TTTTAC ACCCTACCAT 17280 

TGAAGCTTTG CAATTAACGG AAATTGCCAG CCTTAATAAT CGTGTCTATC GAACACrGGG 17340 

CTCTTCCATT ATTCTGCCTA AGAAGGATAA GATTGAACTT ATTCCAACAA GAAACCATTA 17400 

TCATACTATT TCGGTTGACA ATAGCGTTTA TTCTTTCCGT AATATTGAGC GTATTGAGTA 174 60 

TCAAATCGAC CATCATAAGA TTCACTTTGT CGCGACTCCT AGCCATACCA GTTTCTGGAA 17520 

CCGTGTTAAG GACGCCTTTA TCGGCGAGGT GGATGAATGA GGTTTGAATT TATCGCAGAT 17580 
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CAACATCTCA 


AGGTTAAGAC 


CTTCTTAAAA AAGCACCAGC 


TTTCTAAGGG 


ATTGCTGGCC 


17640 


AAGATTAAGT 


TTCGAGGTGG 


AGCTATTCTG 


GTCAATAATC 


AACCGCAAAA 


TGCAACGTAT 


17700 


CTATTGCACG 


TTGGAGACTA 


CGTTACCATT 


GACATTCCCG 


CTGAGAAAGG 


CTTTGAAACC 


17760 


TTGGAGGCTA 


TTGAGCTTCC 


ATTAGATATT 


CTCTATGAGG 


ATGACCACTT 


TCTAGTCTTG 


17820 


AATAAACCCT 


ATGGAGTGGC 


TTCTATTCCT 


AGTGTCAATC 


ACT CT AAT AC 


CATTGCCAA? 


17880 


TTTATCAAGG 


GTTACTATGT 


CAAGCAAAAT 


TATGAAAATC 


ACCACGTTCA 


CATTGTTACC 


17940 




GGGATACTTC 


TGGCTTGATG 


CTCTTTGCCA 


AGCACGGTTA 


TGCCCATGCA 


18000 


TRATTAGACA 


AGCAGTTGCA 


GAAGAAATCT 


ATCGAGAAAC 


GCTACTTTGC 


TTTGGTTAAG 


18060 


fV?Al?ATGGAC 

UUflU *» i W*% 


ATTTGGAGCC 


AGAAGGGGAA 


ATTATTGCTC 


CGATTCCCCG 


TGATGAAGAT 


18120 


TCCATTATTA 


fTACACGAGT 


GGCTAAAGGC 


GCAAAGTATG 


CCCATACTTC 


ATACAAGATT 


18180 


Vj 1 AuL I IL1 i 


ATGGAAATAT 


TCACTTCGTC 


TATATTCACC 


TGCACACTGG 


TCGAACCCAT 


18240 




i i * *w 


TCATATCGGT 




TGGGAGATGA 


TTTCTATGGT 


18300 






TCAACGTCAG GCTCTGCATT 


GCCATTACCT 


ATCCTTTTAT 


18360 


LAII.LA1 111 


TACACCAAGA 


CTTG C AGTT A 


GAAACTCCCT 


TGCCGGATGA 


TTTTAGTAAC 


18420 


{_ n All Av_v„l_ 


ACTTATCAAC 


TAATACTCTA 


TAAAAACTGT 


CT C AGAGT AT 


AATTATTATC 


18480 




AAACTCATGG 


AAGTTTTTGA 


AACTCTCAAA 


GCCAACCTTG 


TTGCTAAAAA 


18540 




GTTCTCCCTG 


AAGGGGAAGA 


GCCTCGTATT 


CTTCAAGCAA 


CAAAACGCTT 


18600 


ACTTAAAACAA 


ACAGAAGTGA 


TTCCTwTTTT 


GCTTGGAAAT 


CCTGAAAAAA 


TTAAAATTTA 


18660 




GAAGGAATCA 


TGGATGGTTA 


TGAGGTCATC 


GACCCTCAAC 


ATTATCCTCA 


18720 


ATTTGAAGAA 


ATGCTTTCTG 


CCTTCGTGGA 


GCGTCGCAAG 


GGCAAAATGA 


CTGAAGAAGA 


18780 


TGTACGCAAG 


GTTTTGGTTG 


AACATCTCAA 


CTACTTTGGT 


GTGATGTTGG 


TTTACTTGGG 


18840 


CTTGGTTGAT 


VAiAn 1 wvj 1 1 


CAGGAGCGAT 


TCACTCAACA 


GCTTCAACAG 


TTCGCCCAGC 


18900 


TCTACAAATC 


ATCAAAACTC 


GTCCAAATGT 


AACTCGTACT 


TCAGGAGCCT 


TCCTCATGGT 


18960 


TCGTGGTACG 


GAACGTTACC 


TATTTGGAGA 


CTGTGCCATT 


AACATCAATC 


CAGATGCAGA 


19020 


AGCCTTGGCT 


GAAATTGCCA 


TCAACTCAGC 


AATCACAGCT 


AAGATGTTTG 


GCATCGAACC 


19080 


TAAAATTGCC 


ATGTTGAGCT 


ATTCTACTAA 


AGCTTCAGGG 


TTTGGTGAAA 


GCGTTGATAA 


19140 


GGTCCTTGAA 


GCAACTAAAA 


TTCCTCACGA 


CTTGCGTCCT 


GACCTTGAAA 


TCGATGGTGA 


19200 


CTTGCAATTT 


GATGCAGCCT 


TTGTTCCTGA 


AACTGCAGCT 


CTCAAAGCTC 


CTGGAAGTAC 


19260 


GGTAGCTGGT 


CAAGCAAATG 


TCTTCATCTT 


CCCAGGTATC 


GAGGCAGGAA 


ATATTGGTTA 


19320 
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CAAGATGGCT CAACGCCTGG GTGGCTTTGC GGCTGTAGGA CCTGTTTTGC AAGGTTTAAA 19380 

CAAGCCACTT AATGATCTTT CTCGTGGATG TAATGCAGAT GATCTTTACA AGTTGACCCT 19440 

CATCACAGCA GCTCAAGCAG TTCATCAATA GTGAAAACTA TAAAGTCATA TACTATCCTA 19500 

TACTGTAGTT ATGAAACTAT GTACGAAAAG CACTCCCATT AATTCCTGAG AACTAAATTA 19S60 

CTGATTGGTG TCAAAAAGGA AAACTTCCAA GCGATGATAT CCTCTCTATA CACGACCTAT 19620 

AGAAATCTGT AATATACATA TCCGTAAAAC GATAAATTCC CTTTTTGATT TTAAATGAGT 19680 

ATCAAAAGAG AATTTTTTGG CTCTTTGTCA ACTGTAGTGG GTTGAAGAAA AGCTAAGCTC 19740 

GAGAAAGGAC AAATTTCATC CTTT C TTTTT TGATATTCAG AGCGATAAAA ATCCGTTTTT 19800 

TCAA CT TTTC AAAGTTCCGA AAACCAAAGG CATTGCGCTT GATAAGTTTG ATGAGATTAT 19860 

TGGTCGCTTC CAGTTTGGCG TTAGAATAGT GTAGTTGAAG GGCGTTGATA ATCTTTTCTT 19920 

TATCTTTGAG GAAGCTTTTA AAGACAGTCT GAAAAATAGG ATGAACCTGC TTAAGATTGT 19980 

CCTCAATAAG TCCCAAAAAT TTCTCTGGTT CCTTATTCTC GAACTGAAAA AGCAAGAGTT 20040 

GATAGAGCTG ATAGTGGTGT TTCAAGTCTT CCGAATACCT CAAAAGCTTG TTTAAAATCT 20100 

CTTTATTGCT TAACTGCATA CGAAAAATAG CACGATAAAA TCGCTTATCA CTCAGTTTAC 20160 

GGCTATCCTG TTGAATGAGT TTCCAGTAGC GCTTGATAG 2019 9 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACCCGATGTA TCACCGGATA TTTACTCTAT TTTTCAAACG ATCTTATACC CACAATAAAA 60 

GAAAAAAGAC CCTAAGGTCT CCTTTGCTTT TATTATTAAA CGCGTTCAAC TTTACCTGAT 120 

TTCAAACCAC GAGCTGAAGC CCAAACTTTT TTAGGTTTAC CATCGATAAG AACAGTAACT 180 

TTTTGAAGGT TTGGTTTTAC GGCACGTTTT GTTTGGTTCA TCCCGTGTGA ACGGTTGTTT 240 

CCTGATACAG TCTTACGACC TGTAAAGTAA CATACTTTAG CCATTCTGTT TTCCTCCTAT 300 

TAGATCTAAT ATAGCGGATG TCCTAGCACC ACATACCGTA CTATGTTATC ACATTTTCTT 360 

GTTTTTTGCA AGGGAATTGG AAGATTTTTT ATTTGTGTCT TAAATCAGGT CTTGCGTGAC 420 

ATTTcTCCTC TCCACATGCC ATCGTTGATT AACAGAACAC CAGAATTAAA ATTATGTGTA 480 

TAAAAATCAT CTCTAACTCC AGCTAAGGCT ATAGCCGTCA AGTCCAAATC CCACAGCTCA 540 
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TCTATCGATT 


TTCTTACAAC 


AATATCTGAA 


TCCAAATACA 


GTACACGAGA 


CTCGCTTACA 


600 


TACTTTGGAA 


TAAAATACCT 


AAAAAAGCCG 


CATATGAAAG 


TCCCTCAAAG 


GGGAGACGAT 


660 


AACCTTTCAG 


AATATTACTG 


TCAATCTAAA 


CATTCACAAT 


CTCACTATTC 


AAAGTCTCTA 


720 


^rf^TTTTTT^ 

W A W A 1 4 1 1 A 


CATCAATTGG 


AACCATTCTC 


GCGGAAGGTC 


ATCATTAAAA 


ACATAAAACT 


780 


TAAGATTATA 


ATGATGAACA 


CAAAGAGATT 


TTATTGTTGT 


TTCAACTTTA 


TCCATATAAG 


B40 




A£*PTAAGACA 


ATCGCTTTTT 


TCTCTTCTTT 


CACTTTTTAT 


CTCATTTCTT 


900 


TTTATTCCCA 


Tf* AT ATT ATT 
lv./\inA ArtA * 


CCCATCATAT 


GTTTCCCATC 


ATATGTTTCT 


ACGTAACCAT 


960 


T ATTTTCG CC 


TATTCGTTCG 


TAAAACCATA 


CCAGTGGAGA 


TTTTAGATGA 


AGTCCCATTA 


1020 


CGuTTTAUvi 


i 1 1 4 1 t\\—S\ A A 


ACGACACGGA 


GTTTTACAAA 


TCGATTTCAT 


TTGCCAAACG 


1080 


TACTTACTV»n 


uuLAu 1 *■ 


TACTTCGCCA 


AATAGCGACT 


AGCGTCCAAC 


AATTTGGAAC 


1140 


TTTACTTCCA 




CTGAGTCACA 


TCTTCTCCTC 


TAACTCTACG 


TCTGGATACT 


1200 


f^f /** A & & 


& fif fift A ft fl 


GCAAAGTCAT 


TTTCAAAGAG 


AAAGACTGGT 


TGGTCAAAAC 


1260 




*T A iftlT 1TTP. 
1 nnun i Al 


CGACTTGACG 


ACATCCGTTC 


ATCCAACTCC 


TCAGCCTTGA 


1320 






C CC ATTGGGT 


TCATAACTAC 


TTCCGCATTG 


TACTCCCCTT 


1380 


Ut-SViut-Oit i vj 


TTT A A AflAf'T 
111 nJv\\»i\^ l 


TCAAACTGGA 


CTTGACCTAC 


AGCGCCTAGC 


ATCTACTCAC 


1440 


CTGTTTGGTA 


A 1 1\> 1 A A 1 #W» 


ACCTGAACGG 


CTCCTTCTTG 


CACCAATTGC 


TCAATCCCCT 


1500 




A A 1 1 A V*^— A A W 


ATAACATTCT 


TAGCAGAAAC 


TTTCATCAAA 


ATCTCAGGTG 


1560 






AATTCAAACT 


TCTTTTTTCC 


AACCGTCAAG 


GTATCCCCAA 


1620 




ACCGGTATCG 


TAAACCCCGA 


TAATATCACC 


TGCCACCGCA 


TTGGTCACAT 


1680 




CTCCGCCATA 


AACTGGGTAA 


C ATT AG AT AG 


TTTAGCC-rC 


TTACCAGTAC 


1740 


GACGGAGATT 


GACACTCATG 


CCGCGCTCAA 


ATTCGCCAGA 


TACGATACGG 


ACAAAGGCAA 


1800 


TACGCTCACG 


GTGACGAGGG 


TCCATCTTviv* 


(.11 \jKjr\ lilt 


AAAGACAAAG 


CCTGAGAAAT 


1860 


CCTTGTCATA 


ACCATCCACA 


ATTTCACCGT 


CTGTTTTCTT 


GTGACCATGT 


GGTTCTGGAG 


1920 


CAAACTTCAG 


GAAGGTTTCA 


AGGAAGGTCT 


CCACACCAAA 


GTTTGTCAGG 


GCTGAACCGA 


1980 


AAAAGACAGG 


CGTCAATTCT 


CCAGCCAGAA 


TAGCTTCCTC 


TGAAAACTCA 


TTCCCGGCTT 


2040 


CATTTAAAAG 


CTCAATGTCA 


TCCTTGACTT 


GCTCGTAGAA 


AGCATTGCTA 


CCAAAGAGTT 


2100 


TGTCCCCCTC 


TTCTAGACTG 


GCAAAACGCT 


CATCCCCTTT 


GTAAACCTCT 


AAACCTTGGT 


2160 


TATAGAGGTC 


ATACAAGCCC 


TCAAAGGCTT 


TCCCCATCCC 


GATAGGCCAG 


TTCATAGGGT 


2220 


AGCTAGCAAT 


GCCCAAGATT 


TCTTCCAATT 


CTTGCAAGAG 


ATCCAAAGGC 


TCACGACCGT 


2280 
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CACGGTCCAG 


CTTGTTCATA 


AAGGTAAAGA 


CTGGAATGCC 


ACGATGTTTC 


ACAACCTCAA 


2340 


ACAATTTCTT 


CGTTTGAGCC 


TCGATCCCCT 


TCCCAGACTC 


CACGACCATG 


ACCGCAGCAT 


2400 


CCACCGCCAT 


CAACGTACGA 


TAGGTATCTT 


CTGAGAAGTC 


CTCGTGCCCT 


GGCGTGTCTA 


2460 


AGATATTCAC 


GCGCTTGCCG 


TCGTAGTCAA 


ATTGCATAAC 


AGATGAAGTA 


ACAGAAATCC 


2S20 


CACGTTGCTT 


CTCGATATCC 


ATCCAGTCAG 


ATTTAGCAAA 


AGTCCCTGTT 


TTCTTCCCTT 


2580 


TTACCGTACC 


ACCCTCACGA 


ATCTCACCCC 


CAAAGTAGAG 


TAACTGCTCA 


GTGATGGTTG 


2640 


TTTTCCCCCC 


CTCCGGGTCG 


GAGATAATGG 


CAAAGGTACG 


ACGTTTCTTA 


ATTTCTTCTT 


2700 


GAATATTCAT 


aacttctctt 


TCTTTGATTC 


TCTATTTTTC 


TTGTTTCAAT 


AGCTGAGAAT 


2760 


GATTTTTACA 


TTGGATTTTA 


CCATTCCTTT 


CAACACTCCA 


TTATATCCGA 


TTTTAGCATT 


2820 


TTTTTCAATT 


TCTATTTCTT 


TTCACTTCCC 


CCTCCCTTAT 


TTATAGGAAA 


ATATGCTAAA 


2880 


ATAGAACAGA 


CTAAAAATCA 


TCATTTCACG 


AAAGGATGCA 


AGATGAAAAT 


TACGCAAGAA 


2940 


GAGGTAACAC 


ACCTTCCCAA 


TCTTTCAAAA 


TTAAGATTCT 


CTGAAGAAGA 


AACTGCTGCC 


3000 


TTTGCCACCA 


CCTTGTCTAA 


GATTGTTGAC 


ATGGTTGAAT 


TGCTGGGCGA 


AGTTCACACA 


3060 


ACTGGTGTCG 


CACCTACTAC 


GACTATGCCT 


GACCGCAACA 


CTGTACTCCG 


CCCTGATGTG 


3X20 


GCCGAAGAAG 


GAATAGACCG 


TGATCCCTTG 


TTTAAAAACG 


TACCTGAAAA 


AGACAACTAC 


3180 


TATATCAAGC 


TGCCAGCTAT 


CCTAGACAAT 


GGAGGACATG 


CCTAATGACT 


TTTAACAATA 


3240 


AAACTATTGA 


AGAGTTGCAC 


AATCTCCTTG 


TCTCTAAGGA 


AATTTCTGCA 


ACAGAATTGA 


3300 


CCCAAGCAAC 


ACTTGAAAAT 


ATCAAGTCTC 


GTGAGCAACC 


CCTCAATTCA 


TTTGTCACCA 


3360 


TCGCTGAGGA 


GCAACCTCTT 


GTTCAAGCTA 


AAGCCATTGA 


TGAAGCTGGA 


ATTGATGCTG 


3420 


ACAATGTCCT 


TTCACGAATT 


CCACTTGCTG 


TTAAGGATAA 


CATCTCTACA 


CACGGTATTC 


3480 


TCACAACTCC 


TGCCTCAAAA 


ATGCTCTACA 


ACTATGACCC 


AATCTTTGAT 


GCGACAGCTG 


3540 


TTGCCAATGC 


AAAAACCAAG 


CGCATGATTG 


TCGTTGCAAA 


GACCAACATC 


GACGAATTTG 


3600 


CTATGGGTGG 


TTCAGGTGAA 


ACTTCACACT 


ACCGAGCAAC 


TAAAAACGCT 


TGGAACCACA 


3660 


GCAAGGTTCC 


TGGTGGGTCA 


TCAAGTGGTT 


CTCCCGCAGC 


TGTAGCCTCA 


CGACAAGTTC 


3720 


GCTTGTCACT 


TGGTTCTGAT 


ACTGGTGGTT 


CCATCCGCCA 


ACCTGCTGCC 


TTCAACGGAA 


3780 


TCCTTGGTCT 


CAAACCAACC 


TACGGAACAG 


TTTCACGTTT 


CGGTCTCATT 


GCCTTTGGTA 


3840 


CCTCATTAGA 


CCAGATTGGA 


CCTTTTGCTC 


CTACTGTTAA 


GGAAAATGCC 


CTCTTCCTCA 


3900 


ACGCTATTGC 


CAGCGAACAT 


GCTAAAGACT 


CTACTTCTGC 


TCCTGTCCGC 


ATCGCCGACT 


3960 


TTACTTCAAA 


AATCGGCCAA 


GACATCAAGG 


GTATGAAAAT 


CGCTTTGCCT 


AAGGAATACC 


4020 


TAGGCGAAGG 


AATTGATCCA 


GAGGTTAAGG 


AAACAATCTT 


AAACGCGGCC 


AAACACTTTG 


4080 
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AAAAAribOvi 


IuL I A 4 *■ v. 


GAAGAAGTCA GCCTTCCTCA CTCTAAATAC 


GGTGTTGCCG 


4140 


TTT ATT A CAT 


LA ILuV. 1 1 WA 


TCAGAAGCTT 


CATCAAACTT 


GCAACCCTTC 


CACGGTATCC 


4200 


GTTACGGCTA 




GATGCAACCA 


ACCTTGATGA 


AATCTATGTA 


AACAGCCCAA 


4260 


GCCAAGGl 1 i 


lTj^j l\*AAOA*j 


GTAAAACGTC 


CTATCATGCT 


GGGTACTTTC 


AGTCTTTCAT 


4320 


ft It ^0W* ft 

CAGCTTACTA 


TGATGCtT AC 


TACAAAAAGG 


CTGGTCAAGT 


CCGTACCCTC 


ATCATTCAAG 


4380 


ATTTCGAAAA 


AGTCTTCGCG 


GATTACGATT 


TGATTTTGGG 


TCCAACTGCT 


CCAAGTGTTG 


4440 


CCTATGACTT 


GGATTCTCTC 


AACCATGACC 


CAGTTGCCAT 


GTACTTAGCC 


GACCTATTGA 


4S00 


CCATACCTGT 


AAACTTGGCA 


GGACTGCCTG 


GAATTTCGAT 


TCCTGCTGGA 


TTCTCTCAAG 


4560 


GTCTACCTGT 


CGGACTCCAA 


TTGATTGGTC 


CCAAGTACTC 


TGAGGAAACC 


ATTTACCAAG 


4620 


CTGCTGCTGC 


TTTTGAAGCA 


ACAACAGACT 


ACCACAAACA 


ACAACCCGTG 


ATTTTTGGAG 


4680 


GTGACAACTA 


ATGAACTTTG 


AAACAGTCAT 


CGGACTTGAA 


GTCCACGTAG 


AGCTCAACAC 


4740 


CAATTCAAAA 


ATCTTCTCAC 


f?TACTTCTGC 


CCACTTTGGA 


AATGACCAAA 


ATGCCAACAC 


4800 


TAACGTGATT 


GACTGGTCTT 


TCCCAGGAGT 


TCTACCAGTT 


CTCAATAAAG 


GGGTTGTTGA 


4860 


TGCCGGTATC 


AAGGCTGCTC 


TTGCCCTCAA 


CATGGACATC 


CACAAAAAGA 


TGCACTTTGA 


4920 


CCGCAAGAAC 


TACTTCTATC 


CTGATAACCC 


CAAAGCCTAC 


CAAATTTCTC 


AGTTTGATCA 


4980 


ACCAATCGGA 


TATAATGGCT 


GGATTGAAGT 


CAAACTAGAA 


GACGGTACGA 


CCAAGAAAAT 


5040 


CGGTATCGAA 


CGTGCCCACC 


TAGAGGAAGA 


CGCTGGTAAA 


AACACCCATG 


GTACAGATGG 


5100 


CTACTCTTAT 


GTTGACCTCA 


ACCGCCAAGG 


GGTTCCCTTG 


ATTGAGATTG 


TATCTGAGGC 


5160 


AGATATGCGT 


TCTCCTCAAG 


AAGCCTATGC 


TTATCTGACA 


GCCCTCAAGG 


AAGTTATCCA 


5220 


GTACGCTGGC 


ATTTCTGACG 


TTAAGATGGA 


GGAAGGTTCG 


ATGCCTGTGG 


ATGCCAACAT 


5280 


CTCCCTTCGT 


CCTTATGGTC 




CGCTACCAAG 


ACTGAATTGA 


AGAACCTCAA 


5340 


CTCCTTCTCA 


AACGTTCCTA 


AAGGTCTTGA 


ATACCAAGTC 


CAACGCCAGG 


f*IV7a & 1TTPT 
lunAn i 


5400 


TCGCTCAGGT 


GGTCAAATCC 


GCCAAGAAAC 


ACCCCGTTAC 


GATGAAGCGA 


ATAAAGCAAC 


5460 


CATCCTCATG 


CGTCTCAAGG 


AAGGGGCTGC 


TGACTACCGC 


TACTTCCCAC 


AACCAGACCT 


5520 


ACCCCTCTTT GAAATTTCTG 


ACGAGTGGAT 


TGAGGAAATG 


CGGACTGAGT 


TGCCAGAGTT 


5580 


TCCAAAAGAA 


CGTCGTGCGC 


GTTATGTATC 


TGACCTTGGT 


TTATCAGACT 


ACGATGCTAG 


5640 


TCAGTTGACT 


GCTAATAAAG 


TCACTTCTGA 


CTTCTTTGAA 


AAAGCTGTTG 


CCCTAGGTGG 


5700 


TCATGCCAAA 


CAAGTCTCTA 


ACTGGCTCCA 


AGGGGAACTC 


GCTCAGTTCT 


TGAATGCTGA 


S760 


AGGTAAAACA 


CTGGAACAAA 


TCGAATTGAC 


ACCAGAAAAC 


TTGGTTGAAA 


TGATTCCCAT 


5820 



WO 98/18931 



PCTAJS97/19588 



CATCGAAGAC GCTACTATTT CATCTAAGAT TCCCaIgAAA GTCTTTGTCC ATCTAGCTAA 
AAATGGCGCT GCCGCGCCTC AATACCTGGA AAAAGCACGT ATCCTTCAAA TTTCAGATCC 
AGCTATCTTG ATCCCAATCA TCCACCAAGT CTTTGCCCAT AACCAAGCTG CTCTTOCCCA 
CITCAAGTCA GCCAAACGTA ACGCCGACAA GGCtTTACAG GATTCCTTAT GAAGGCAACC 
AAAGGCCAAG CCAACCCACA AGTTGCCCTT AAACTACTTG CACAGGAATT GGCCAAGTTG 
AAAGAAAACT AGACAGAACA AAACCAGCCC TAAGGTTGGT TTTTTCTTCT CTACCAACTC 
CCAATAACTA TTTTGGCTTT ATTTCCAGAG TATTTTATGG TAAAATGAAG AGTAATAATA 
TTTATTAAAG AGGTAAAAAC ATGATTGAAG CAAGTACCTT AAAAGCTGGT ATGACCTTTG 
AAACAGCTGA CGGCAAATTG ATTCGCGTTT TGGAAGCTAG TCACCACAAA CCAGGTAAAG 
GAAACACGAT CATGCGTATG AAATTGCGTG ATGTCCCTAC TGGTTCTACA TTTGACACAA 
GCTACCGTCC AGAGGAAAAA TTTGAACAAG CTATTATCGA GACTCTCCCA GCTCAATACT 
TGTACAAAAT GGATGACACA GCATACTTCA TGAATACAGA AACTTATGAC CAATACGAAA 
TCCCTGTAGT CAATCTTGAA AACGAATTGC TTTACATCCT TGAAAACTCT GATCTGAAAA 
TCCAATTCTA CGCAACTGAA CTGATCGGTG TCACCGTTCC TACTACTCTT GAGTTGACAG 
TTGCTGAAAC TCAACCATCT ATCAAAGCTG CTACTGTTAC AGGTTCTGCT AAACCAGCAA 
CGATCCAAAC TGGACTTGTC GTAAACGTTC CAGACTTCAT CGAAGCAGGA CAAAAACTCG 
TTATCAACAC TGCAGAAGGA ACTTACGTTT CTCGTGCCTA ATCTCTAGAA AGAGGTCATT 
CTATGGGAAT TGAAGAACAA CTTGGCGAAA TCGTTATCGC CCCACGTGTA CTTGAAAAAA 
TCATTGCTAT CGCTACTGCA AAGCTAGAGG GTGTTCACTC TTTTTCAAAC ACATCAGTC? 
CTGATACCCT TTCAAAACTT TCACTCGGCC GTGGCATTTA TCTTAAAAAC GTGGACGAAC 
AACTCACAGC AGATATCTAT CTCTACCTTG AGTACCGAGT AAAAGTTCCT AAGGTAGCGG 
TTGCTATCCA GAAAGCTGTC AAAGATGCCG TCCGTAATAT GGCTGATGTA GAACTCGCTG 
CTATCAATAT TCACGTTGCA GGTATCGTCC CAGATAAAAC ACCAAAACCA GAATTGAAAG 
ATCTATTTGA CGAGGACTTC CTCAATGACT AGTCCACTAT TAGAATCTAG ACGCCAACTC 
CGTAAATGCG CTTTTCAAGC TCTCATGAGC CTTCAGTTCG GTACGGATGT CGAAACTGCT 
TGTCGTTTCG CCTATACTCA TGATCCTGAA GATACGGATG TACAACTTCC AGCCTTTTTG 
ATAGACCTCG TTTCTGGTCT TCAAGCTAAA AAGGAAGAAC TAGATAAGCA AATCACTCAG 
CATTTAAAAG CAGGTTGGAC CATTGAACGC TTAACGCTCG TGGAGAGAAA CCTCCTTCGC 
TTGGGAGTCT TTGAAATCAC TTCATTTGAC ACTCCTCAGC TGGTTCCTGT TAATGAAGCT 
ATCGAGCTTG CAAAGCACTT CTCCGATCAA AAATCTGCCC GTTTTATCAA TGGACTGCTC 



5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 
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AGCCAGTTTG TAACAGAAGA ACAATAAGGC TCTTTCTCAA CTGTAGTGGG TTGAAAAAAA 7680 

CCTAAGCTCG AGAAACGACA AATTTCGTCC T T T C TTTTT T CATCTTCAAA GCGATAAAAA 7740 

TCCGTTTTTT GAAGTTTTCA AAGTTTCGAA AACCAAAGGC ATTGCGCTTG ATAACTTTGA 7800 

TG AG ATT ATT GGTCCCTTCC AGTTTGGCAT TAGAATAGTG TACTTGAAGG CCCTTGACAA 7860 

TCTTTTCTTT ATCTTTGAGG AACCTTTTAA AGACAGTCTG AAAAATACGA TGAGCCTGCT 7920 

TAACATTGTC CTCAATAAGT CCGAAAAATT TCTCTGGTTC CTTATTCTGG AAGTGAAACA 7980 

GCAAGAGCTG ATAGAGCTGA TAGTGGTGTT TCAAGTCTTG TGAATGGCTC AAAAGCTTGT 8040 

CTAAAATCTC TTTATTGGTT AAGTGCATAC GAAAAGTAGC ACGATAAAAT CGCTT ATCAC 8100 

TCAGTCTACG GCTATCCTGT TGAATGAGTT TCCAGTAGCC CTTGATATCC TTGTATTCAT 8160 

GGGATTTTCG ATGAAACTGA TTCATGATTT GGACACCCAC ACGACTCATG CCACGGCTAA 8220 

GATCTTGTAC AATGTGAAAG CGATCAAGAA CGATTTTAGC ATTCGGGAGT GAAACAGTCT 8280 

GGCAGACTGT TTCAGCCTGA GCCTAGGAAT TTGAAAGCGA AGCTGTTTAG CCAAGTCATA 8340 

GTAAGGGCTA AACATATCCA TACTAATAAT TTTCACGCGA CATCGGACAA CTCTATCGTA 8400 

GCGAAGAAAG TGATTTCCAA TGATAGCTTG TGTTCTACCC TCAACAACAG TGATGATATT 84 60 

GAGATTGTTA AAATCTTGCG CAATGAACCT CATCTTTCCC TTTCTAAAAG CAT ACT CATC 8520 

CCAAGACATA ATCTCAGGAA GACAAGAAAA ATCATGTTTA AACTGAAAAT CATTGAGCTT 8580 

ACGAATAACA GTTGAACTTG AGATGGAAAG CTGATGGGCA ATATCAGTCA TAGAAATCTT B640 

TTCAATCAAC TTTTGAGCAA TCTTTTGGTT GATGATACGA GGCATTTCGT CATTTTTCTT 8700 

GACGATAGAA GTTTCAGCGA CCATCATTTT TGAACAGTGA TACCACTTGA ATCGACCCTT 8760 

TCTAAGGAGA ATTCTAGTAG GCATACCACT CGTTTCAAGA TAAGGAATTT TAGAAGCTTT 8820 

TTGAAAGTCA TATTTCTTCA ATTGGTTTCC CCACTCAGGG CAAGATGGGG CGTCCTAGTC 8880 

CAGTTTGCCG ATGATTTCCT TGTGTGTATC CTTATTGATG ATGTCTAAAA TCTG GAT ATT 8940 

AGGGTCTTTA ATGTCTAGTA ATTTTGTGAT AAAATGTAAT TGTTCCATAT CAATCTTTCT 9000 

AATCAGTTGT TTTGT CGCTT TTCATTATAG GTCATATCGG ACTTTTTTTC TACAATAAAA 9060 

TAGGCTCCAT AATATCTATA GGGGATTTAC CCACTACAAA TATTATAGAG CCAACAATAA 9120 

AAAGAAAAAG TGTTTGATAG ATATCAAACA CTTTTTTCTT TGCCTCCCAC TATCTAAAAA 9180 

AATGATAATA GATATAATTG TAAACAAAAA TCCAGATAGG TTTTGCATGA TTGAGAAAGT 9240 

TAAAAAAACT ATGGCAGAGA ATCGTTAATC TCAGATTCTC GGTAGAACGA TAAACAAGGG 9300 

CAAAAAAGAA ACCAATCAGA CTATAATATA ATAAACTAAT TGCATCTCTG TGAGATACTA 9360 



DWCfWIfV ^A/d OQ1 QQ-51 AO | „. 
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An 1 UWUvv\U 


AfflATAG£"!AG 


200 
ATAGGATAAC 


ATCCAAATAG 


TACTTGGACT 


9420 


« r*r*r* ft ft & /* ft ft 
AGGGAAAGaA 


Va» 1a1 a A.A i A 


AAA 1 Av. v. w A V. 


TATCAAGACT 
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GGATAAATTC 


ATCAAAAATG 
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AGTCCAAGGT 


TGTCAAGCGA 


10980 


CTCTGCTCAC 


GCGCTTCTCT 


GACTATTTTT 


GCAATATTCA 


TCTAGCACTC 


CCTCCATGCA 


11040 


ATCTGACTAG 


GCTAGCAATC 


GTATCTGGTA 


AGTCTCTTCT 
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TGGATGTGGA 


GAAGTCATGT 


14640 


GGTAGGCATC 


ACAAGTATTT 


CCGTAACCAA 


CCACTTCAGC 


CAGGATAGTA 
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CCAAATTCTT 


TCATCGGAAT 
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GTAGCTTCCT 
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TGACAGGATA 


GCGGTCATAT 


CAGACTCAAT 
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ATTCCGACTA 
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CCAATCAACC 
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C AG CCTT AG A 


AGCAGCATAA 


TTAGCTTGAC 


CAATATTCCC 


CATCAAACCA 


ACAACACTAG 
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ACATATTAAT 


GATAGCACCT 


TCTCTGGCTT 


TCATCATCGG 


TTTCAAGACT 


CATTGTGTCA 
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TATTAAAGGC 


ACCAGTCAGA 


TTGACCTTGA 


GCACTTTTTC 


AAAATCTGCT 


i-CTvJl L CATCT 
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TGAGCATAAG 


AGTATCTTGG 


GTAATCCCTG 


CATTGTTGAC 


CAAAACATCT 


ACTGAACCCA 


t CACA 
13700 


CTTCTGCAAT 


AGCTTGATCA 


ATCATACGCT 


TAGCGTCTCC 


AAAATCTGAT 


ACATCTCCTG 


16020 


AAATGGGAAC 


CACCTTGATA 


CCATAGTTTG 


AAAACTCAGC 


GAGCAATTCT 


TCTGAGATTG 


16080 


CCCCACGACT 


CTTTAAGACA 


ATGTTGGCTC 


CTGCTTGAGC 


AAACTTGTGG 


GCGATGGCAA 


16140 


CACCAATTCC 


ACGACTCGAA 


CCTGTAATAA 


AGATATTTTT 


ATGTTCT AGT 


TTCATTTTTT 


16200 


TCCTTTCAAA 


AC TTCT ACTT 


ATTTTAGTCT 


ATTTTTCTAA 


AAGTGCTACT 


AAACTCGCTT 


16260 


GATCTTCCAC 


ATGAGCTAAG 


TCAGCAGTTT 


GATCAATTTT 


TTTAACAAAA 


CCTGACAAGA 


16320 


CTTTCCCCGG 


TCCAATCTCG 


ATAAAGTTGC 


TTATGCCTGC 


TTCTTGCATG 


ACCCCAATAC 


16380 


TTTCATAGAA 


ACGAACGGGT 


TCCTTGACCT 


GACGCGTCAA 


GAGCTGAGCA 


ATGTCCTCTT 


16440 
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TTTGCATCAC 


AGCAGCTTCT 


CTATTGCCGA 


CTAGGGGACA 


AGTAAAATCT 


CAAAAACTTA 


16500 


CCTGACCTAG 


AGTTTCAGCT 


AGTTTCTGGC 


TAGCAGGTTC 


AAGGAGAGCG 


GTGTGAAAGG 


16560 


GACCTGACAC 


CTTAAGAGGA 


ATCAAGCGTT 


TGGCACCTCC 


TTCTTGCAAA 


AGTTCAACCG 


16620 


CTCGATCAAC 


TGCAACCACT 


TCTCCAGCAA 


TGACGATTTG 


TGCAGGTGTG 


TTATAGTTGC 


16680 


CTGGAGTAAC 


CACTCCAAGT 


TCAGAAGCTT 


TTTGACAGGC 


TTCTTCAATG 


ACCTCTACTG 


16740 


GCGTATTGAG 


AACTGCTACC 


ATCTTGCCAG 


AGTCAGCAGC 


AGCCGCTTCT 


TCCATATAGG 


16800 


CTCCACGCTT 


AGCTACCAAG 


GCAACCGCAT 


CTTCAAAATC 


CAAGGCGCCA 


CTTGCCACCA 


16860 


AGGCAGAGTA 


TTCTCCAAGA 


GACAAACCAG 


CAACCATATC 


AGGCTGATAG 


CCCTTTTCTT 


16920 


GCAATAAACG 


GTAGATAGCA 


ACCGAAGTCG 


CTAGAATGGC 


TGGTTGCGTA 


TAGCGGGTCT 


16980 


GATTGAGTTT 


GTCTTCTTCC 


CTATCGATGA 


GATAACGCAA 


ATCATAACCG 


AGCACCTGGC 


X7040 


TCGCTCGATC 


AATCGTTTCT 


TTAACAATCG 


GATACTGATC 


ATAGAAATCC 


CGTCCCATCC 


17100 


CTAGATACTG 


GGCACCTTGA 


CCAGCAAATA 


AAAAGGCTGT 


TTTAGTCATT 


TCTTACAACT 


17160 


CCTGTCCAGC 


GAGAGGCTTC 


TTCTTGAATT 


TTCTTAGCGG 


CTCCGTAATA 


CAAATCTTTT 


17220 


AGGATTTCTT 


CAGCTGTTTC 


TTCTTTAGAA 


ACAACCCCTG 


CGATTTCACC 


TGCCATAACA 


17280 


GACCCACCAT 


CCACATCACC 


GTGAACAACT 


GCTTTGGCTA 


GAGCACCTGC 


TCCCATTTGT 


17340 


TCAAAGATTT 


CTAAATCAGG 


ATCTTCTTGC 


TTAAAGGCAT 


CTTTTTCAGC 


CAGTTCAAAA 


17400 


TCTCTAGTCA 


ACTGATTTTT 


AATAGCACGA 


ACAGCATGAC 


CAAAGTGCTG 


AGCTGAAATC 


17460 


GTAGTATCAA 


TATCCCTTGC 


TTTTAAAATT 


TTCTCCTTGT 


AGTTTGGATG 


GGCATTCGAC 


17520 


TCTTTTGCAA 


CTACAAACCG 


TCTCCCCACC 


TGTAC AG CCT 


CTGCACCTAG 


CATAAAGCCA 


17580 


GCCGCAGCAC 


CTTCACCATC 


CGCAATTCCT 


CCTGCAGCAA 


TAACAGGAAT 


AGATATAGCT 


17640 


GTGGCTACCT 


GTCGCACCAA 


GGTCATGGTT 


GTTAATTTAC 


CGATATGCCC 


CCCAGCTTCC 


17700 


ATTCCTTCTG 


CAATAACAGC 


GTCTGCACCG 


ATTTTTTCCA 


TGCGTTTAGC 


TAAAGCGACA 


17760 


CTAGGAACAA 


CAGGAATAAC 


GATTATCCCA 


GCTTCATCCA 


AACGTTCCAT 


ATACTTGCTT 


17820 


GGATTTCCTG 


CTCCTGTTGT 


GACAACTTTA 


ACACCTTCTT 


CAATAACGAG 


ATCCACGATG 


17880 


TCTTCCACAA 


AGGGAGATAA 


GAGCATGATG 


TTGACCCCAA 


AGGGTTTATC 


AGTCAATGAT 


17940 


TTGATTTTAT 


CAATATTGGC 


CTTGACAACT 


TCTTTCGGGG 


CATTTCCCCC 


ACCGATAATT 


18000 


CCTAATCCTC 


CAGCCTTGGA 


AACACCCCCT 


GCCAAATCAC 


CATCAGCAAC 


CCAGGCCATC 


18060 


CCTCCTTGGA AAATAGGATA 


ATCAATCTTC 


AATAATTCTG 


TAATACGCCT 


TTTCATAGTG 


18120 


CCTCCAACCT 


TCCTTGCTTA 


CGTAATAGTT 


CGATTTCACC 


ATAATTTGAC 


AGTCAAACTA 


18180 


TTACCTAAAC 


AAGAGGGAGT 


GGCTTTCTCC 


CTACTCCTTC 


TACTAATATT 


CTGCTTATTT 


18240 
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TGCTTGCTCT 


TCAACGTAAG 


CAACCAAGTC 


ACCAACTCTT 


TTCAAGTCAT 


TTTCTCCTTC 


18300 


GATTTCGATA 


TCAAAAGCAT 


CTTCGATTTC 


TGAGATTACT 


TGGAACAAGT 


CCAATGAATC 


18360 


TGCGTCCAAA 


TCATCAAAAG 


TTGATTCAAG 


TGTTACTTCT 


GATGCGTCTT 


TTCCAAGTTC 


18420 


TTCAACGATA ATTTCTTGTA 


CTTTTTCAAA 


TACTGCCATG 


ATAGGACTCC 


TTTAAAATAA 


18480 


ATACTTTTTT 


TATAACAATG 


TGTTCACCAC 


ATGATTACCT 


AAATTGTAAG 


AATGAGCGTG 


18540 


CCCCAGGTCA 


AGCCTCCACC 


CAAGCCTGAT 


AGAAGAACAG 


TCTGGCTACC 


ATCTAAAGGG 


18800 


ATGAGACCTT 


GTTCTACACA 


CTCTGAAAGT 


AAAATCGGGA 


TACTGGCTGC 


ACTGGTATTG 


18660 


CCATATTCCA 


TCATATTGGC 


TCGAAGTTTG 


GCTCGGTCAA 


CACCAATTTT 


TCTAGCCATC 


18720 


TTATCCAAAA 


TACGGTCATT 


CGCTTGATGA 


AGTAGCAGAT 


AATCCAAGTC 


TGTCACCTCT 


18780 


ATAGGAGATT 


CATCAATAGT 


CTGCTTGATA 


GACTTGGCTA 


CATCTCGAAT 


GGCAAAATCA 


18840 


AAGACTGTGC 


GTCCATCCAT 


CTTCAAAAAC 


GAATCTGCAC 


TTTC TTGATC 


TGAAAATGGA 


18900 


GAATGTAAAC 


CTGAATGCCC 


ATAACTTAAA 


CACTCGCTGC 


GACTTCCATC 


GCTATTGAGA 


18960 


CTCTCAGCTA 


AGAAATGCTC 


TTGCTCGCTA 


CCTTCTAACA 


AGACACCACC 


AGCACCATCT 


19020 


CCAAACAACA 


CAGCTGTTuA 


TCGATCCGAC 


CAATCGACTC 


CCTT AG AG AG 


GG TTTC ACTA 


19080 


CCAATCACCA 


AG CCTTTTTG 


AAAGCGACCA 


GAAGCGATAA 


ACTTTTCAGC 


ACTTCAAAGA 


19140 


GCAAATACAA 


ATCCACTGCA 


AGCCGCCGTT 


AAGTCAAAAG 


CAAAGCCTTT 


ATTAGCACCA 


19200 


ATATTAGCTT 


GAACACGAGC 


AGCTGTAGAG 


GGCATCATCG 


AATCTGGAGT 


AATGGTAGCT 


19260 


AGGATCATAA 


AATCCAGTTC 


TTCTCCTGTT 


A 11 UUVA. • • 




TTTCTTAGCA 


19320 


ACCTCTGTAC 


CCAAATCACT 


GGTAGATTCT 


GTTCTTGAAA 


TATGCCTTTG 


TCGTATTCCC 


19380 


GTTCG AC TTG 


AAATCCACTC 


ATCATTGGTA 


TCCATAATCT 


GAGCCAAGTC 


GTGATTTGTA 


19440 


ACCACTTGCT 


CTGGCACATA 


ATGAGCAACC 


TGACTT ATTT 


TTGCAAAAGC 


CATTATTTCA 


19500 


AATCCTCCAA 


AAATTGGTAA 


AGATTAGTCA 


AACCTTTACC 


CATGACAGCA 


ATTTCTTCCT 


19S60 


CGCTCATGCC 


ATCAATAATT 


TTTTCTACCA 


TGGCCTTGTG 


GAAGCGTTTA 


TGCAGTCTAT 


19620 


GAATCAAGCC 


ACCCTTCTTT 


GTCAAATGCA 


GATGCACCAC 


ACGACCATCC 


TGTTCTGACC 


19680 


GAACTCGCTC 


AATGTAGCCC 


GG 








19702 



12) INFORMATION FOR SEQ ID NO: 8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



« 



GAAAATTTCC 


TCTCTTCTCT 


TCAAAAATTT 


TGAAAAAATG 


GTATGATAGT 


AACAAGTTAT 


60 


TTTTAAGACG 


AAAGAAAGGG 


GAATAATGGA 


GAAAATCAGT 


TTAGAATCTC 


CTAAGACGGG 


120 


CTCGGACCTA 


GTTTTCGAAA 


CACTTCGTGA 


TTTAGGACTT 


CATACCATCT 


TTGGTTATCC 


ieo 


TGGTGGTGCG 


GTTTTGCCTT 


TTTATGATGC 


GATATATAAT 


TTTAAAGGCA 


TTCGCCACAT 


240 


TCTAGGGCGC 


CA7GAGCAAG 


GTTGTTTGCA 


TGAAGCTGAA 


GGTTATGCCA 


AATCAACTGG 


300 


AAAGTTCGCT 


GTTGCCGTCG 


TCACTAGTGG 


ACCAGGAGCA 


ACAAATGCCA 


TTACAGGGAT 


360 


TGCGGATGCC 


ATGAGCGATA 


GCGTTCCCCT 


TTTGGTCTTT 


ACAGGTCAGG 


TGGCGCGAGC 


420 


AGGGATTGGG 


AAGGATGCCT 


TTCACGAGGC 


AGACATCGTG 


GGAATTACCA 


TGCCAATCAC 


480 


TAACTACAAT 


TACCAAGTTC 


GTCAGACAGC 


TGATATTCCG 


CGTATCATTA 


CGGAAGCTGT 


540 


CCATATCGCA 


ACTACAGGCC 


GTCCAGGGCC 


AGTTGTAATT 


GACCTACCAA 


AAGACATATC 


600 


TGCTTTAGAA 


ACAGACTTCA 


TTTATTCACC 


AGAAGTGAAT 


TTACCAAGTT 


ATCAGCCGAC 


660 


TCTTGAGCCG 


AATGATATGC 


AAATCAAGAA 


AATCTTGAAG 


CAATTCTCCA 


AGGCTAAAAA 


720 


GCCAGTCTTC 


TTAGCTGCTG 


GTGGAATTAG 


TTATGCTGAG 


GCTGCTACGG 


AACTAAATGA 


780 


ATTTGCAGAA 


CGCTATCAAA 


TTCCAGTGGT 


AACCAGTCTT 


TTGGGACAAC 


GAACGATTGC 


840 


AACGACTCAC 


CCACTCTTTC 


TTGGAATGGG 


AGGCATGCAC 


GGGTCATTCG 


CAGCAAATAT 


900 


TGCTATGACG 


GAAGCGGACT 


TTATGATTAG 


TATTGGTTCT 


CCTTTCGATC 


ACCGTTTGAC 


960 


GGGGAATCCT 


AAGACTTTCG 


CTAAGAATGC 


TAAGGTTGCC 


CACATTGATA 


TTGACCCAGC 


1020 


TGAGATTGGC 


AAGATTATCA 


GTGCAGACAT 


TCCTGTAGTT 


GGAGATGCTA 


AGAAGGCCTT 


1080 


GCAAATCTTC 


CTAGCAGAAC 


CAACAGTTCA 


CAACAACACT 


GAAAAGTGGA 


TTGAGAAACT 


1140 


CACTAAAGAC 


AAGAATCCTG 


TTCGTTCTTA 


TGATAAGAAA 


GAGCGTGTGG 


TTCAACCGCA 


1200 


AGCAGTTATT 


GAACGAATTG 


GTGAATTGAC 


GAATGGAGAT 


GCCATTGTGG 


TAACAGACCT 


1260 


TGGTCAACAC 


CAAATCTCGA 


CAGCTCAGTA 


TTATCCCTAC 


CAAAATGAAC 


GTCAGTTAGT 


1320 


GACTTCAGGT 


GGTTTGGGAA 


CAATGGGCTT 


TGGAATTCCA 


GCAGCAATCG 


GTGCTAAAAT 


1380 


TGCTAACCCA 


GATAAGGAAG 


TAGTCTTGTT 


TGTTGGGCAT 


GGTGGTTTCC 


AAATGACCAA 


1440 


CCAGGAGTTG 


CCTATTTTGA 


ATATTTACAA 


GGTGCCAATC 


AAGGTGGTTA 


TGCTGAACAA 


1500 


TCATTCACTT 


GGAATGGTTC 


GCCAGTGGCA 


GGAATCCTTC 


TATGAAGGCA 


GAACATCAGA 


1560 


GTCGGTCTTT 


GATACCCTTC 


CTGATTTCCA 


ATTGATGGCG 


CAGGCTTATG 


GTATTAAAAA 


1620 


CTATAAGTTT 


GACAATCCTG 


AGACCTTGGC 


TCAAGACCTT 


GAAGTCATCA 


CTGAGGATGT 


1680 
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TCCTATGCTA 


ATTGAGGTAC 


ATATTTCTCG 


TAAGGAACAG GTGTTACCAA 


TGGTACCGGC 


1740 


TGGTAAGAGT 


AATCATGAGA 


TGTTCGGGGT 


GCACTTCCAT 


GCGTAGAATG 


TTAACAGCAA 


1800 


AACTACAAAA 


TCGTTCAGGA GTCCTCAATC 


GCTTTACAGG 


TGTCCTATCT 


CCTCGTCAGC 


1860 


TTAATATTGA 


AAGCATCTCT 




CAGAAGATCC 


GAATGTATCG 


CGTATCACTA 


1920 


TTATTATTGA 


TGTTGCTTU A 


fATftATCAAG 


TGGAGCAAAT 


CATCAAACAG 


CTCAAT*CG*TC 


1980 


AGATTGATGT 


GATTCGCATT 


CGAwATATTA 


CAGACAAGCC 


TCATTTGGAG 


CGCGAGGTGA 


2040 


TTTTGGTTAA 


GATGTCAGCG 


CCAGCTGAGA 


AGAGAGCTGA 


GATTTTAGCG 


ATTATTCAAC 


2100 


CTTTCCGTGC 


AACAGTAGTA 


GACuTAuv.wL 


PAACCTCGAT 


TACCATTCAG 


ATGACGGGAA 


2160 


ATGCAGAAAA 


GAGCGAAGCC 


CTATTGCGAG 


*TCATTCGCCC 

4 4 * 


ATACGGTATT 




2220 


CTCCAACCGG 


TGCAACTGGA 


TTTACCCGCG 


ATTAAAAATC 


CAACTTAAAT 




2280 


AGCCTAAAAG 


GCAATAAATA 


ATAGAAAAGA 


CAftAJLAAGCT 


ATGACAGTTC 


A A ATfV^AATA 


2340 


TGAAAAAGAT 


GTTAAAGTAG 


CAGCACTTCA 


f/'N'TT A A. A A A. A 


ATCGCCGTTA 




2400 


TTCACAAGGG 


CATGCGCATG 


CTCAAAACTT 


WW v A W#* * A wet 


GGTCGTGACG 


TTATTATCGG 


2460 


TGTACGTCCA 


CGTAAATCTT 


TTGATAAAbL 


AAAAGAAGAT 


GGATTTGATA 




2520 


AGCAGAAGCT 


ACTAAGTTGG 


CTGATGTTAT 


fATCATCTTG 


GCGCCAGACG 




2580 


AGAATTGTAC 


GAAGCAGAAA 


TCGCTCCAAA 




GGAAACGCAG 


1 1 ^*v»-/\ 1 1 1 


2640 


CCATGGTTTC 


AACATCCACT 




CAAAGTTCCT 


GCGGATGTAG 


A 1 w A w » • Wr* 4 


2700 


GTCTGCTCCT 


M It H/^^*^^*fc^ 

AAAGCAtLAb 




ACGTCGT ACT 


TACGAAGAAG 


A ^^^P^^^^^^^P^T^l* 

V\ 4 4 A W* 4 V* 4 


2760 


TCCAGCTCTT 


TATGCAGTAT 


ACCAAGATGC 


AACAGGAAAT 


GCTAAAAACA 




2820 


CTGGTGTAAA 


GGTGTTGGAG 


CGGCTCGTGT 


AGGTCTTCTT 


GAAACAACTT 


ACAAAGAAGA 


2880 


AACTGAAGAA 


GATTIXjTTTG 


CTGAACAAGC 


TGTAltiiv* i 


GGTGG1TTGA 


CTGCCCTTAT 


2940 


CGAAGCAGCT 


TTCGAAGTCT 


TGACAGAAGC 


AGGTTACGCT 


CCAGAATTGG 


CTTACTTTGA 


3 000 


AGTTCTTCAC 


GAAATGAAAT 


TGATCGTTGA 


CTTGATCTAC 


GAAGCTGGAT 


TCAAGAAAAT 


3060 


GCGTCAATCT 


ATTTCAAACA 


CTGCTGAATA 


CGGTGACTAT 


GTATCAGCTC 


CACGTGTAAT 


3120 


CACTGAACAA 


GTTAAAGAAA 


ATATGAAGGC 


TGTCTTGGCA 


GACATCCAAA 


ATGGTAAATT 


3180 


TGCAAATGAC 


TTTGTAAATG 


ACTATAAAGC 


TGGACGTCCA 


AAATTGACTG 


CTTACCGTGA 


3240 


ACAAGCAGCT 


AACCTTGAAA 


TTGAAAAAGT 


TGGTGCAGAA 


TTGCGT AAAG 


CAATGCCATT 


3300 


CGTTGGTAAA 


AACGACGATG 


ATGCATTCAA 


AATCTATAAC 


TAATTAGAAA 


TATATAGCGC 


3360 


TGGAGATGAT 


TTTATGAAAA 


AGATTATGAG 


AAAAATTGCA 


TCGTTATTAT 


TGCTTCTAGT 


3420 
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TGTATAATGT 


AATTACACCG 


TCGGTAATAC 


TGCTAGCAGA 


CCAAAATAAA 


GCAGATTGGT 


3480 


CGTATGATGA 


AAATGCTGTA 


ATTAACATTT 


ATGATGATGC 


TAATTTTGAA 


GATGGTAGGT 


3S40 


TGCATATCAA 


CTTTGAACAA 


TTCTTCAAAT 


TGGCACAAAT 


AGCTAGAGAA 


GAAGGTCTTG 


3600 


AAATTCATTC 


TCCGTTTGAG 


AGAGCTGGTG 


CGACTAAATC 


TGCTCGTTAT 


ATAGCCAAAT 


3660 


GGATTTTGAG 


AAATAAAAAA 


CATTAACAAA 


TATAGTTGGT 


AAATCATTAG 


GACCTAAATC 


3720 


AGCTGTTAGA 


TTCGGAGAAG 


CTTTATCCTA 


TATTGAACGT 


CCTCTTCGCA 


GAATAAATGA 


3780 


GACGATAGAT 


GGCGGTTTAT 


ATCAAATAGA 


GCAAATTATT 


GCATCTGGAT 


TGAAAGAATC 


3840 


GGGTTTAAAT 


GACTGCACTG 


CGAAAACTTT 


AGCTTCAGCT 


ATTCGTGGGA 


TATTAGATGT 


3900 


ACTTATTTAG 


GGGTTGAAAT 


CATATGAATA 


TTACCAATTT 


GTTTTCTATC 


AAGACAGGAT 


3960 


GTGATGAAAC 


TGATAGGCAA 


CTGCAAAAAC 


TATTTTTTCA 


GTTGGATTTA 


CAATTGGGAG 


4020 


AATTCACAGA 


TCAACTAAGA 


AAATTAGATT 


CTAATTTTGT 


TCCTCGTACT 


CAATTTGTAG 


4080 


ACACGTTGGA 


TTTGAATGAT 


GTAGAATATA 


AAGAAATTTT 


AAACTATTTT 


ATCTTCCATC 


4140 


CTAATGATAC 


TGAACAAAGT 


TTGCTAGAAT 


GGTTATATGA 


TTGGATTTCC 


ACAAATCGTT 


4200 


ATGAACTTCC 


TAAAGAGTTT 


TCGATTCGTA 


TGGCTCATAA 


ATACCATGAA 


ACTGTTACTG 


4260 


AAGTTTTCGG 


ACATGAATAA 


CTAAAAAACA 


GTCATTACTG 


ACTGTTTTTT 


ATAGAAAAAG 


4320 


AGGTTTTATA 


TGTTAAGTTC 


AAAAGATATA 


ATCAAGGCTC 


ACAAGGTCTT 


GAACGGTGTG 


4380 


GTTCTGAATA 


CTCCACTGGA 


TTACGATCAT 


TATTTATCCG 


AGAAGTATGG 


TGCTAAGATT 


4440 


TATTTGAAAA 


AAGAAAATGC 


CCAGCGTGTT 


CGCTCCTTTA 


AAATTCGTGG 


TGCCTATTAT 


4500 


GCCATTTCCC 


AGCTCAGCAA 


GGAAGAACGT 


GAACGTGGGG 


TAGTCTGCGC 


TTCTGCGGGA 


4560 


AATCATGCGC 


AGGGAGTAGC 


CTATACTTGT 


AATGAAATGA 


AAATTCCTGC 


TACTATCTTT 


4620 


ATGCCCATTA 


CTACGCCACA 


ACAAAAGATT 


GGTCAGGTTC 


GCTTTTTTGG 


TGGGGATTTT 


4680 


GTAACTATTA 


AACTAGTTGG 


AGATACCTTT 


GATGCCTCAG 


CCAAAGCAGC 


TCAAGAATTT 


4740 


ACAGTCTCTG 


AAAATCGTAC 


CTTTATTGAT 


CCTTTTGATG 


ATGCTCATCT 


TCAAGCAGGT 


4800 


CAAGGAACAG 


TTGCTTATGA 


GATTTTAGAA 


GAAGCTCGAA 


AAGAATCGAT 


TGATTTTGAT 


4860 


GCTGTCTTGG 


TTCCTGTTGG 


TGGTGGCGGT 


CTCATTGCCG 


GGGTTTCTAC 


CTATATCAAG 


4920 


GAAACAAGTC 


CAGAGATTGA 


GGTTATCGGA 


GTAGAGGCGA 


ATGGAGCGCG 


TTCCATGAAA 


4980 


GCTGCCTTTG 


AGGCTGGAGG 


TCCAGTAAAA 


CTCAAGGAAA 


TTGATAAATT 


TGCTGATGGG 


5040 


ATTGCTGTGC 


AAAAGGTAGG 


TCACTTGACC 


TATGAAGCAA 


CTCGTCAACA 


TATTAAAACT 


5100 


TTGGTAGGTG 


TCGATGAGGG 


ATTGATTTCT 


GAAACCTTGA 


TTGACCTTTA 


CTCTAAGCAA 


5160 


GGGATACTCG 


CAGAACCTGC 


TGGAGCCGCT 


AGTATCGCCT 


CTTTAGAGGT 


TTTAGCTGAA 


5220 
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TATATTAAGG GGAAAACCAT TTGTTGTATC ATTTCTGGAG GAAATAATCA TATCAACCGT 5280 

ATGCCAGAAA TGGAAGAGCG TGCCTTGATT TATGATGCTA TCAAACATTA CTTTGTGGTC S340 

AATTTCCCAC AACGTCCAGG AGCTTTGCGT GAGTTTGTAA ATGATATCCT GGGCCCAAAT 5400 

GATGATATCA CACGTTTTGA GTATATCAAA CGAGCTAGCA AGCCAACAGC CCCAGTATTA 5460 

ATTGGGATCG CTTTAGCAGA TAAGCATGAT TATGCAGGTT TGATTCGTAG AATGGAAGCT S520 

TTTGATCCAG CTTATATTAA CTTAAATCGT AATGAAACGC TTTATAATAT CCTTGTCTGA 5580 

GGACTAATAA AAAAATATCA TACCTTCATT TTGATTTCCT ATCTATTGAC AAGCATAGTC 5640 

ACACTGTCTT TAATACTCTT CGAAAATCTC TTCAAACCAC GTTAGCTCTA TCTGCAACCT 5700 

CAAAACAGTG TTTTGAGCAA CTTGCCGCTA GCTTCCTAGT TTGCTCTTTG ATTTTCATTG 5760 

AGTATAAGGT ATGATTTGAT TTCTTTTTCT TGACAAATAT ACT AT ATT AA AAAGATATAT 5820 

AAGTAATTAA CTGAGCTTAT CTGTCTTGTC ATCTCTATTA AGGATGGTTT AGATAATCGG 5880 

CTGTCTGCTT CTAGGCTAGC ACCTCAATAT CCAAAGGAGT GATCAATTTC AACGACATAA 5940 

GGAATACCTA TCTCTCAGAT GATTTATTCA GGAAGAAAGA TAGGAGTTTT TGAGCTAGTG 6000 

AAGGCTTGGA TTTCTAAAGG TTAGAACTAT CATCTTCAGT TCTTAAATCG AAGAAATAAG 6060 

CTATCTTACC GAAATAGAGA AGCATTTTTT AAGAACTTGA AT AATTTCGC ACCTTAAGAG 6120 

GGTAATAATA CAGTATTTTT ATTAGCAAAT ATTTATGGTC TAGAGGCTAG CAAAACCTAT 6180 

ATATTATCCG ATTTAAAAAG GAAGTAAGAA A 6211 
(2) IN FORMAT I ON FOR SEQ 10 NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCCCACTCCC CACGATTCTT CAAAATAACT GAGTATATTT CTATCTTGAT TTTCAGATAT 60 

AAATTCTTCC TTCTGTGGCC TCTTCTTACG CTTGAGAAGA GCTTCTCCGA CATGGCTTCT 120 

TCCTTACTGA GCAAAACCTT GAGCATAGAT AAGTTTGACT GGCAACCGTG CTCTTGTATA 180 

TTTGGCTCCC TTCCCACTAT TGTGGATAGC GAGGCCTCTT CTCATATCAG TCGTATAGCC 240 

TATATAGTAG GATCCATCAC GACACTCCAG AACGTACATA TAAGCCTTAT GATCCATAAT 300 

AAATCTCTTC GATTTCGGGC CTATAAGAGC CATCATCATT GTGGACAATC AAAGGAGGTA 360 
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AGACCTTAAA 


GCCACTTGTT 


GACCCATCCT 


TGATCGCCTC 


AATCAAAAGC 


ATATTGGCTT 


420 


CCTTTTCTCT 


TTTTGGATAA 


ACAAACTGCA 


GGCGCTTAGG 


GGCTAGATTA 


TGTCGTTTTA 


480 


ACGTATCCAA 


AATATCCAGA 


AGTCGATCAG 


GACGATGAAC 


CATGCCCAAA 


CGCCCATTAG 


540 


ACTTGACAAT 


ACTCTGCGCA 


CTACGACAGA 


TTTCTTCCAA 


ATTAGTCGTG 


ATTTCGTGTC 


600 


GAGCCAAGAG 


ATAATGTTCA 


CTCTCGTTCA 


GATTAGAATA 


AGGATTCACC 


TTGAAATAGG 


660 


GTGGATTACA 


CAAAATCATA 


TCCACCTTAC 


TCCCCTGAAT 


GTGAGCACGC 


ATATTTTTCA 


720 


AATCATCGCA 


GATGACCTGC 


ATTTGCTCCT 


CTAATCCATT 


CAAACGGACA 


GAGCGTTCAG 


780 


CCATATCCGC 


CAAACGCTCC 


TGAATCTCAA 


CAGACAATAT 


CTGTGCTTGA 


GTACGAGTGC 


840 


TAGCAAAAAG 


CCCCACTGCT 


CCATTCCCAG 


CACAGAAATC 


CACAATCAAC 


CCCTTCTTAG 


900 


GAAAACCTGG 


AAATCGTGAT 


AAGAGAACAC 


TATCCACCGA 


ATAGCTAAAA 


ACCTCTCTAT 


960 


TTTGAATGAT 


TTTGATATCT 


GTCGAAAAGA 


GCTGGTTAAT 


CCGCTCTCCT 


GATTTTAATA 


1020 


ATTGTTCTTC 


TTCCATCGTC 


CTATTATAGC 


AAATTCATAT 


TAACATTACA 


AAAAATATAA 


1080 


AACTCTAAAC 


TACTTCTTCT 


TTTTTAAATG 


CTGCAGGGCT 


TCTCCAGTCC 


AGATTGGTAG 


1140 


CATTCGTCGA 


AAGGGACCAA 


AGCCGTAGTT 


AAAGCGGTCG 


CTTGAAAAGC 


GTCTCCGTCT 


1200 


AGGAAACTGG 


TACTTTTCTT 


CCTCCAAAGT 


GCGCATAGAA 


ACACTGGCTT 


TCCCTCTAAA 


1260 


TTCATCTAAA 


TCCACTACCT 


GAACTTGAAC 


CTCTTCATCG 


ACTTTCAAGG 


TTTCATGAAT 


1320 


ATTTTCAATA 


AATCCTGTCC 


GAATCTCTGA 


AATCTGAATC 


AGCCCCGTAT 


CACCCGTCTC 


1380 


TAACTCAACA 


AAGGCACCGT 


AGGGCTGAAT 


CCCTGTAATA 


CGCCCCTTTA 


GCTTATCACC 


1440 


GATTTTCATC 


TTACTCCTCG 


ATTTCAATAG 


TTTCAATTAC 


AACATCTTCA 


ACTCGCTTCT 


1500 


CCATAGCTCC 


TGTCTCAACA 


CCACCAATGG 


CATCCAACAC 


AGCGTAAGAT 


GCTTCATCAG 


1560 


CTAACTGACC 


AAAAACCGTG 


TCACCGCGCT 


CTAGGTGAGG 


TGTCCCACCT 


TGATTGGCAT 


1620 


AGATTTCTGC 


AATCGGTTCT 


GGCCAACCAC 


CACGAGTAAT 


TTCTTTCTTA 


GAATAAGGTA 


1680 


GGTGTTGGTT 


TTGCACGATA 


AAGAACTGGC 


TGCCGTTGGT 


ATTTGGACCA 


GCATTTGCCA 


1740 


TGGAAAGAGC 


ACCACGGATA 


TTGTAAAGCT 


CTTCTGAGAA 


TTCATCCTCA 


AAAGATTCCC 


1800 


CGTAGATTGA 


CTCGCCACCC 


ATACCAGTTC 


CAGTTGGCTC 


TCCACCTTGG 


ATCATAAAGT 


1860 


CCTTGATAAT 


ACGGTGGAAA 


ATGACACCAT 


CATAGTAGCC 


ATCTTT7GAA 


AGAGATACAA 


1920 


AGTTAGCCAC 


TGTTTTAGGA 


GCATGTTCAG 


GGAAAAGCTT 


GATACGTAAG 


TCTCCGTGAT 


1980 


TGGTCTTAAT 


AGTCGCAAGA 


GGACCTTCTA 


CTGTTTCAAT 


GTCTACTTGT 


GGAAAATGCA 


2040 


ATTCTTTTTC 


TACCATACCA 


AATACTTCTA 


AGGCAGCAAA 


AATGCCATCT 


TCTTCTAATG 


2100 


TTTTTGTAAT 


ATAATCTGCT 


TTTTCTTTGA 


TTTTATCATG 


AGAAATTCCC 


ATGGCAACGC 


2160 
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TCATTCCACC ATAATCAAAG AGTTCCAAGT CGTTGACACC ATCTCCAAAA ACCATGACCT 
TCTCTGGTTT CAAGCCAAGG TGTTCCACAA CCTTTTCCAC CCCCCTCGCT TTGGAGCCTG 
AAATCGGCAC AATATCACAC GAATGTTGAT GCCAACGAAC CATGCGAACT TTCTCTGAGA 
GACTGTCAGG CAAGTGCAAG TCATCTCCCT TATCTTCAAA ACTCCACATC TGATACATAT 
CTTCTTTTTC ATGGAAATCG GGATCTACAT CTAAGTCGGG ATAAATTGGA TTGATAGCTT 
CACTCATCAT ATCCGTGCCA GTCGACAACT TGGCATCATG ACTCCCAACC AAGCCATACT 
CAATTCCTTC TTGCTTAGCC CAAGAGATAT ACTCCTCAAC ATCTGACTTT TCAATCTGAT 
CCTGATAAAT GACCTGACCT TTTTTATCTT CGATATAAGC CCCATTCAAA CTTACAAAAA 
AGTCAGGCTT GAGATCACGA ATCTCTGGAA CAACACCAAA AATGCCACGT CCAGAGGCGA 
TTCCTGTTAA AATTCCTTTT TCACGCAACT GTTTAAAAAC AGTGGGAATT CTAGTTGCAA 
TAAACCCTGT CTTTGAATTC CGCAATGTAT CATCAATATC AAAAAAGACA ATCTTGATCT 
TCTTTGCCTT CTATCTTAAT TTCGCGTCCA TCTCACTACC TCTTTCAATC TAACTCTTTC 
CATTATATCA TAAAGTAGGC AAATCCCCTA TTTTCAAAAA GTTTATCATT TTTATTTTAA 
TTTCTTGGAT GAGAAAAGAG ACATATTTAT GAAAAAGCTC CATCGTGCTT TTAATGTCTT 
CTCTTGTTTT CAAACTCGTA AAAAGGGACC CACTGATCCT AACTCGCTCT CTCATTTCAA 
AGCTTGTGAA AAAAGACCCG TTGGGGTCTT AATTCGCTTT CTTGTTTTCA AGCTCATGAA 
AAAGAGACCC AACTGGGTCT TTTCTTTAAT CTTCGTTTAC GAAAGGCATC AAAGCCATTA 
CCCGAGCGCG TTTGATAGCT GTTGTTACTT TACGTTGGTT TTTAGCTGAA GTTCCTGTTA 
CACGACGAGG AACGATTTTC CCACCTTCTG AAACGAAACG GCTAAGAAGC TCAGTATCTT 
TGTAATCAAC ATATTCAATT TTCTTTGCTG CGATGTAATC AACTTTTTTA CGGCGTTTGA 
ATCCGCCACG ACGTTGTTGA GCCATGTTTT TTCTCCTTTA TAACTTTAGT TGTCCATTAG 
AATCGTAAAT CATCATCTGA AATATCCAAT GGCTTTGTTC CTCCAAATGG ATTTTCATTA 
CGTGAAAAGT CTGGTACTCA ATTTGTAGGT GCTGAATAGT TTGCAGTTGG TGCAGAGTAA 
GCTCCACCTG TGTGACCCTC ACGCACACTA CGGCTTTCCA ACATTTGGAA ATTCTCAGCC 
. ACGACCTCTG TCACGTAGAC ACGTTGTCCT TGCTGGTTAT CGTAACTACG AGTCTGGATA 
CGACCTGTCA CCCCGATAAG TGAGCCTTTT TTAGCCCAGT TAGCAAGATT TTCAGCCTGT 
TGGCGCCACA TAACGACATT GATAAAATCA GCCTCACGTT CACCATTTTG ACTCTTAAAT 
GTACCGTTTA CTGCAAGAGT AAAAGTCGCA ACTGCTACAT TTGATGGGGT ATAACGCAAC 
TCAGCGTCAC GTGTCATACG CCCTACAAGT ACAACATTGT TAATCATACT TTACCTTCTT 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 
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ACGCCTCAAT TTTGACGATC ATGTGACGAA GAATCTCAGC GTTGATTTTT GAAAGACCGT 3960 

CAAACTCTTT AAGAGCTGCA TCGTCATTTG CTTCAACGTT AACGATGTGG TAAACTCCTT 4020 

CACGGAAATC TTGGATTTCG TATGCAAGAC GACGTTTTTC CCAAGTTTTT GATTCAACAA 4080 

CAGTTCCACC GTTGTCAGTC AAAATAGAGT CAAAACGTGC TACCAAAGCG TTTTTAGCTT 4140 

CTTCTTCAAT CTTTGGACGA ATGATATAAA GAATTTCGTA TTTAGCCATT GATATGTTCC 4200 

TCCTTTTGGT CTAATGACCC CAAGACTTTG CAAGGGGTAA GTGAGGTTCG CTCACAATAA 4260 

ACT ATT AT AC TAGAAAAAAT TTTTTTACGC AAGTAAAAAC ACTAGAATTC CAAAAAACGC 4320 

CACATGCGCG TTTTCCTGTT CTTATGGTTT GATACGGTGC AACATACGTG GGAATGGAAT 4380 

AGCTTCACCG ATATGTTTTC TTCCTGCTGC GAAGGTTACC ATACGTTCGA TACCGATACC 4440 

AAATCCTCCC TGTGGAACTG TACCGTATTT ACGAAGGTCA AGGTAGAATT CATATTCTGT 4500 

ACGATCCATG CCAACTTCAT CCATCTTAGC GACAAGGGCA TCCTAATCTT CCTCACGCAT 4560 

AGACCCACCG ATAATTTCTC CATAGCCTTC TGGAGCAAGC AAGTCTGCAC AAAGCACCCG 4 620 

CTCTGGATTT CCAGGAACTG GTTTCATGTA GAAGGCCTTG ATGGCTGCTG GATA G TTCAT 4680 

GACAAATGTT GGCACACCAA AGTGGTTTGA AATCCAAGTT TCGTGTGGTG ACCCAAACTC 4740 

ATCACCATGC TCAAGATCCT CCTAGTCAGC ATCTTCATCA TTTTCATCCT CTTGCAAGAG 4 800 

GTCAATGGCT TGATCGTAAC TGATACGTTT GAATCGCTCT GCAATGTAGC GTTTCAAGAG 4 860 

TTCTCTATCA CGTTCCAACG TTTCCAAGGC TTGAGGCGCG CGGTCAAGAA CACCTTGTAG 4920 

AAGAGCTTTC ACATAAGCTT CTTGCAACTC AAGCGACTCA TCATGTGTCA AGTATGAGTA 4980 

CTCAGCATCC ATCATCCAGA ACTCAGTCAA CTGACGGCGT GTTTTTGATT TTTCAGCACG 504 0 

GAAAACTGGA CCAAAGTCAA AGACACGACC AAGACCCATA GCCCCTGCTT CTAGGTAAAG S100 

CTGACCTGAT TGGCTCAAGT AGGCTGGCGT TCCGAAGTAG TCAGTTTCAA AGAGTTCTGT 5160 

AGAATCTTCT GCCGCATTTC CTGAAAGAAT TGGGCTGTCA AACTTCATAA AACC G T TC TT 5220 

GTCAAAGAAC TCATAAGTTG CATAGATAAT ACCGTTACGG ATTTGCAACA CAGCTACTTG 5280 

CTTACGAGAG CGTAgCCACA AGTGACGGTT ATCCATCAAA AAGTCTGTTC C C T C TTCT T T 5340 

TGGTGTGATT GGGTAGTCTT GAGATTCACC GATCACTTCG ATGTCTCTGA TGTCCAACTC 5400 

ATAGCCAAAT TTAGAACGTT CGTCCTCTTT GACAATACCT GT C AC AT AAA CAGACGTTTC 54 60 

TTGGCTCAAG CGTTTGATAA CATCAAACTT CTCAAGTCCC ACTTCTTCAC CAAATTTTTC 5520 

GACAAAGTTT GGTTTAAAAG CCACACCTTG AAAGAAGGCT GTTCCATCAC GCAATTGTAA 5580 

GAAAGCGATT TTTCCTTTTC CTGATTTGTT GGCAACCCAA GCGCCAATCG TCACTTCCTG 5640 

ACCAACATAG TCTTTTACGT CAATAATCGT TACACGTTTT GTCATTATTT TTCCTTTTCT 5700 
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TTTTTATTCT 


mm fk ft ft ft 

TTATGGV.AAA 


l_l»s\\.l>. 1 L * /** 


ATTGTTCCCA 


TCCAGCTCAA 


TCATAAAAGC 


5760 


ACCATAGTAA 


ATCGGATGCT 




ACCAGGAGCC 


CCATTGTCTC 


GCCCACCTCC 


5820 


CTCTAAGCCA 


GCCTCATAAC 




TTCTTCCTTA 


TTTTCTGCTA 


AAAAAGCAAA 


5880 


ATGAACAGGA 


TCTTGTCTTC 




CCAAAAATCA 


CCACCAGGAT 


GAGGGCTGTT 


5940 


CGGGGATAGA 


AAACTAATTA 


GAGAACTAGT 


CTTAAAAGCC 


AATTTATACT 


CCAAAGGAGC 


6000 


GAGAAAACTC 


CTATAAAATC 


CTTATGAAAT 


TTGTAAATCC 


TTTACCTTAA 


TCTCAAAATG 


6060 


ATCAATCATT 


CTCACTACCC 


ATAAATGCTT 


TCAAGCGTTC GACTGCTTCT 




6120 


CTAGGTCTGT 


CGCATACCTG 


AGGCGGACAT 


1 1 I Iw I «>• 






6180 


CCAAGGCCAC 


TTCCGCTTCT 


TCTAAGATAA 








6240 


CTTTCATCTC 


CATGG CCTTT 


TTGACATTTG 




CAAGGCCCCT 


TGCGGTTTGA 


a inn 


CCACTTCAAA 


TCCTCGTACC 


TCTGCAAGGA 


t^nf^fl 4.T AC AT 
WmuA A Hurt • 


GGTATTAAGA 


UU 1 1 LI- 1 


6360 


AGGCCTGACG 


CATGCTTTCT 


ACAGTATCTT 










ATTGGGCTAC 


TGCTGACGGA 


TTCGAAGTTG 




AATCTTGGAC 






TAATGTCTGC 


TTCTCCAACG 


GCATAACCAA 




AGTCATGGCA 


i ill! nu 


6540 


ACACACCATT 


GATGACCACT 


CTTTGCTTGC 


GAATCGCTTC 


CGATAGGCTA 




6600 


TGAACTCATG 


ACCATTATAA 


ACCAAGCGGC 


CATAGATATC 


CTCTGCTAGG 


ATflAGAATAT 


6660 


CATTTTCTAC 


AGCCCAGTTT 


CCAATTGCCA 


AGAGTTCCTC 


ACGGGTGTAA 


ATCATACCTG 


6720 


TGGCATTAGA 


TGGCGAATTC 


AGCACCAAAA 


CCTTGGTCTT 


GTCACTCCGA 




6780 


ACTGCTCTAC 


GGTCACCTTA 


AAGTGATTGT 


CTTCCTTAGC 


AGAAACAAAG 


ACGGGAACCC 


6840 


CTTCTCCCAT 


CTTGACCTGA 


TCTCCATAGC 


TAACCCACTA 


TGCCCTTCCG 


ATGATGACTT 


6900 


CATCACCTGG 


ATTGACCACA 


GCCATAAAuA 


AGGTATAGAG 


AGAATATTTG 


GCTCCCGCAG 


6960 


CGACTGTCAC 


TTGATTTGAC 


GCTACAGAAT 


AGCCGTAAAA 


GCGCTCAAAG 


TAGCTATTGA 


7020 


CCGCCGCCTT 


AAGCTCTGGC 


AGACCTGAGG 


TTACTGTATA 


AAAAGAAGCA 


CGCCCATCTC 


7080 


GAATCGATGC 


AATCGCGGCA 


TCTTGGATAT 


TTTTGGGAGT 


AGTGAAATCT 


GGCTCACCCA 


7140 


AGGTTACAGA 


CAAAATATCT 


CTACCCTCAG 


CCTTCAGTGC 


TTTGGCACGG 


GCTCCAGCAG 


7200 


CCAAACTCAC 


ACTTTCTTCC 


ATTTCTAAAA 


CACCGTTGCA 


TAGTTTCATA 


GGCCCTCCTT 


7260 


CTTGACCAAT 


GCTCCTGTTT 


CAAAATCTAC 


TAGATAAAAA 


TCAGATCCTG 


ACTTAACTTC 


7320 


CCAGATTGGC 


TTATCTTGAT 


AACGGCCAAA 


GGTTATCTTG 


TCAATCTCGC 


CAGCTCCCTT 


7380 


TTCCTTAGAA 


ACCGTTTCTG 


CTTTTTCTTG 


TGAAACACCC 


TGATTTAGCT 


GATAAACCTA 


7440 
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AATCTTATGG TCATCTTTAC CAATCAGGAC AGCAAGCCCT TCTTGCTCTT TCTTACCACC 7500 

AACAACGCTG TAATAAGATT CCAAGCCATT GTATAAATCA ACCTGATCAG CCTGCTCTAA 7560 

TCCTGCATAC TGCTGAGCTA ATTTTTCTCC TTCACTTTTA CCTGTTTGAT AGGGTTTCAT 7620 

CCTAAGAGAA ACCATATACA GAAACGAACC ACTGATAACC ACAAACAAAA TCGTCATCCC 7680 

TAGACCATAC TGCCACACTA GATTATTTTT TGCTTTGTTT TGTCTTTTTT TCACTCGTCT 7740 

ATTTTACCAT CTATTAAGCT TT ATT AC AAG TGAATATAAG AATACTCTTC GAAAATCTCT 7800 

TCAAACCACG TCAGCTTTAT CTGCAGACCT CAAAGCTGTG CTTTGAGCAA CCAATTCTAT 7860 

TTCTCCCTTC AAACAAAACC GATTTTGAAA GTGAAACAGT TCTTACTTTT TCAGTCACAA 7920 



ATGATTAGAG TTT G CCGGG 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CCGCTCTACC 


GTCAAATAAT 


TACCATTTTG 


TTTAATACCG 


AAATTTTTAT 


CTACTGAAAA 


60 


TTCAGTTGGT 


CTGTTGGTAC 


CATCGTCGTA 


TACACTACCA 


TTCTCACGAA 


TAGTATAATT 


120 


CTAATCACTA 


TCACCTTGTT 


TCCTTAATTT 


AAGGTAATAA 


TTACCATCAA 


TTTGTTTATA 


180 


ACCTGAATCT 


TTTCTAGTTG 


CTTCTCTAAA 


ACTTACTCCA 


GCAGGCATCA 


CATCAGCAAA 


240 


CATGACTACT 


TGTTTGTTCT 


TTTTTTCAAC 


AATAACAGAG 


TCAATATAGG 


TTGCACCACC 


300 


GCTGATTTGT 


AAGTCACGTC 


CACCAACTTC 


ACGAGGCCAT 


TCTAATGGTA 


CTGGCGCAAA 


360 


ATCATCGAAT 


GCCAATGTTA 


ATTTTGGTTT 


AGTCCATGTC 


TTACCATTAT 


CATCACTATA 


420 


ACTTGTAGCA 


ATATTAATTT 


TATTCAAGAA 


ATCATGAGTT 


CCACCGTAAC 


GAGCCTCAAT 


480 


GCTTGAAAAT 


ACCCGACCAT 


TGCTAAAAGT 


ATACAGAACT 


GGAATACGGA 


AATAGTTAGA 


540 


ACCTGTTGTA 


TCATTAGCCG 


TATAAATTAA 


ATCTCCAGTA 


ACAGCGTTTG 


TTGTCATCTT 


600 


TTTAACAGTT 


TCTTCATCCA 


ATGCACTATT 


AAACAATTTG 


ATATTTTCTA 


GTGTTCCGTT 


660 


AAAACCAAAC 


CCCCTTTTTC 


CTGCACGTTT 


CACTCCCCCA 


AGCATATAGT 


AATCAATACC 


720 


TTTAATATCC 


TTGATGTTTA 


GGAAATTATC 


CACTTTCTTT 


TCTACTACTT 


TTGTACCATT 


780 


TGCGTATAAA 


GAATATGTTT 


TTTTGACTGA 


ATCTGCTACT 


ACTGCAACAG 


TGTTAGTCAC 


840 


AGCCTCTTGT 


TTGTACTTAC 


CCCAAACTGA 


AGCAGGTCTG 


GATACTAGGT 


TATTTTTATT 


900 
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-*«■» M ft fL ft ft ^+0B% ft 

GGAAGAAGTA 


TCACGCGCTT 


CCATLULL/WI 




TCTCTAAGGA 

4 W * w A #™W"*ft».p *J i * 




960 


ATAACTATTT 


IVI luACCuvi 


f<T>^/*T & ITT 




AACAGAGCTT 


GTAAGCCTTT 


1020 


CTCACTTGAC 


TGATTGTACT 


TAfll l_ AV- 1 A»» 


Af?TAJVAf?TCA 
Mv» i Anno i. wn 


CCGCTAGTAA 


ATTTATCCTT 


1080 


TAACTCTTTA 


GTAACATTTT 




1 \j i I MnAu 1 A 


A^* AT'T^ A^PT'T'T 
AW/% i Inl 1 1 i 




1140 


AGGAGTTTCT 


TCCGCTGTAG 


AAGATGGATC 


Lll AAWAU I A 






1200 


TACAGTAACT 


TCCGAAGAGT 


HI m akAAA ft MMII 

TATCCGATGT 


AGGTTGTACT 






i o An 


TGCAACACGT 


TGCACCAACT 


TTGGTGTTGA 


TACTTCAuAA 


GTTTCAGTCT 


I VjAl4\_ 1\A» 


i Jiw 


AACTGAGTTA 


GCAACAAATG 


CTGATAATAC 


CACT ACAu T A 


H- 1 AAvjIj 1 I A 


CATATTGTTT 




AATATTTTTT 


TTCATTTTAT 


TTTTCCTCGT 


TTAAAACTTT 


GATAACAAuT 


TTTTTAACAG 


1 A A rt 


TTTCATCATT 


GCAATGAATC 


TTTGGTTGGT 


GAAGATCTTC 


TTCAAAAGTC 


ft It ft ^ m «a» f> |p 

ACC-AACATAT 


1500 


TCCCTGCAAG 


CAATTCAACA 


ATTTGATAGT 


CTTTGCTATC 


i^ip ft ft ft ft ft A 

GTAAAAAGCA 


ATATaTCTTCT 


1560 


CTTCGCTAAA 


AGGTACACGT 


GACTGGGCAC 


GAACTGGGGA 


ft -fn«.Mt ft. ^ftMlj^-* A A 

AGTTACTGCC 


A i rrm\AG 


1620 


TATTTTCAAC 


AACAATATGA 


ATATCTAAAT 


ATTTCTTATG 


AGTTTCAAAA 


ATATCTCCTG 


1680 


G AACTCC ATC 


AGCTAGATAA 


GTCATACAAT 


HBlH" ^ It It Ik ft ft ^ 

TTGCAAAAAC 


ATTTTCCCCb 


TCAATATLAA 


1 IH O 


TTTTTCCATC 


AACTAAATCT 


GTCAAATTTG 


TATTTTCTAA 




ACTTTTGAAA 


i ann 
i. BUU 


AATATTTATT 


GACAGAAGCA 


TATCCTTTAA 


AATCAGATTG 


•fVI*^ It ft. m ft (ft ft 

1 ICAGAAATA 


A l WV1 ATI AX 


looU 


TTTCTCTTTT 


CTATTAGTGA 


CGAACTTCCC 


ft ft /^ffWfli^ ft ft **ft^ 

AACTTGAATC 


CGCTTTAATT 


TtTGTAATAT 




CATCAATCGT 


TCTATATTTA 


GGTGCAGATA 


CTTTATTTCC 


It / ft Jl /-• fl ft » 


«n i ALAA iAl 




AACCTGAAAC 


TACTGATACA 


GAGATTGAAA 


TCAATCAATA 


I bUUUAu I n« 




5 ft A ft 


TTGGAGGAAG 


GAAGTATTTA 


ft M » ft ft ft A 

ATAAATACCA 


I<JAUV»A 1 1 


I uA 1 AUnn i 




' 5i on 


AAGCACCTTG 


TTTATTTGCT 


TTTTTAGAAA 


V AAM 1 


A AX A A AT AC A 


CCACC AAGTA 


2160 


GACCAAGTAC 


AAGTCCCATG 


AAACTATTGA 


ACCATTCGTA 


TGCAGATTTA 


ATATCTGAGT 


2220 


GAGCCATGAC 


AATGGAAACA 


CCAATTGAGA 


ATAAACCTAC 


TGCTAGAGAT 


ACGAATTGTG 


2280 


CAATTTTCGT 


ACGACGATTG 


TCTGACATAT 


TTTTAGAAAT 


GACATCTTGA 


ATATCCAATG 


2340 


TCCATGAAGT 


TGCAACAGAG 


TTCAAACCTG 


TTGAAATAGT 


TGATTGAGAT 


GCTGCATAAA 


2400 


TCGCTGCCAA 


GATCAAACCT 


GTGATACCTA 


CTGGTAACTG 


CTATGCAATA 


AAGTACATAA 


2460 


AGATTTGGTC 


TTGAGGGATA 


TTGCTAGCTG 


CACTATCTGC 


ATTTTGTACT 


TGATAGAATA 


2520 


CCTACAAGCC 


TGTACCAATC 


AAGTAAAAGA 


CTGTTGCAGT 


TCCAAGTGAC 


AAAACACCGT 


2580 


TTGTGAACAA 


CATCTTATTA 


ACTTTCTTAA 


TATTTTCTGT 


TGTAGTAAAA 


CGTTGAACCA 


2640 
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AATCTTCAGA 


TGAAGCATAG 


GAAGACAAGA 


TTGTAAAGCC 


TGAACCCATC 


ACAATTAAAA 


2700 


AGATGGAGTT 


TGAAAGCAAG 


TTAGGATCGA 


AAAGTTTTTC 


ATTTGCAGCA 


AGGAATTTCC 


2760 


CGTTTGCTAA 


TGTTTCTGCT 


ACTGCACCAA 


AGCCACCTTT 


AATATTAGCA 


ATCAGTACAA 


2B20 


ATAAAGCTAA 


AACGACACCA 


CTAATCAGAA 


TCACACCTTC 


AATAAAGTCT 


GTCCATAATA 


2880 


CCCATTTTAG 


ACCACCAGTA 


TAAGAATAAA 


CAATTGCAAC 


TACACCCATC 


AAAATAATCA 


2940 


AAATATTGAT 


GTCAATTCCT 


GTCAATACTG 


ATAAACCACC 


TGATGGGAGG 


TACATAATGA 


3000 


TAGACATACG 


TCCCAATTGA 


TAAATAATAA 


ACAAGAGTGC 


TGAAATAATA 


CGAAGTGCTT 


3060 


TAGAATTAAA 


ACCTTTATCC 


AAGTAATCAT 


ATGCCGTATC 


GATGTCTATC 


CGTGCAAAGA 


3120 


TAGGTAAGAT 


AAAACGAATT 


GTCAGTGGAA 


TAGCTACTAC 


CATCCCTAAT 


TCAGCAAACC 


3180 


ATAAAATCCA 


GCTACCTGCA 


TAAGAGCTAC 


CACCGAGTCC 


CAAGAAGGAA 


ATCGGACTGA 


3240 


GCATTGTGGC 


AAAAATGGAT 


ACCGAAGTAA 


CATACCAAGG 


AACCGAACCA 


TCTCCTTTAA 


3300 


AGAACTCTTT 


TCCTTTCATC 


TCTTTTTTAG 


AGAAATAGAT 


ACCTGCAACC 


AACACCCCAA 


3360 


GTAAATAAAC 


AATCAAGATA 


ATTAAGTCAA 


TTATTGTAAA 


TCCTGTTGTG 


CCCATAACAT 


3420 


ATCTCCATAT 


TGATTTTATT 


TATTATAAAA 


ATTCTTTTCC 


TGCTTGTTGA 


ATAAGTTCTC 


3480 


CTGCTTGTTT 


TCCAACTTCC 


AAGTCACCTT 


CTGCCAATGC 


TTCTAAAGGT 


TGACGAACAG 


3540 


AACCTAAATC 


AAGTTTTTCA 


TTTAGACGCA 


AAACTTCTTT 


TGCTACAGCA 


TACATATTTC 


3600 


CCTTACCTGA 


TATCATCTTA 


TAGATAACTT 


CATTGATAGC 


ATATTGAAGT 


TTTTTAGCTC 


3660 


TATCTAAATC 


TCGTTCTTGA 


ATCAAACTTT 


CCAATTTCAA 


GAACAAATCT 


GGCATAACGC 


3720 


CATAAGTACC 


ACCAATACCA 


CCTTCTCCTC 


CCATCAACCC 


ACCACCAAGA 


TATTGTTCAT 


3780 


CTGGACCATT 


GAATACAATG 


TAATCTTCTC 


CACCTGCAGC 


TACAAACATT 


TGAATATCTT 


3840 


GTACAGGCAT 


AGAAGAATTT 


TTAACTCCAA 


TCACACGAGG 


ATTTTGACGC 


ATTGTTGCAT 


3900 


ACAAACTACC 


AGTCAACGCA 


ACCCCTGCCA 


ATTGTGCAAT 


ATTATAGATA 


ATAAAATCTG 


3960 


TATTTCACGC 


AGCTTCACTC 


ATTGCATTCC 


AATATGCTCC 


GATTGAATAC 


TCTGGCAATT 


4020 


TGAAATAAAT 


AGGTCGGATA 


GCTGCAATAG 


CATCGACTCC 


AACACTTTCT 


GAATGTTTTG 


4080 


CCAATTCGAT 


ACTATCTTTC 


GTGTTATTAC 


ATGCAATATG 


GTTGATAACT 


GTTAATTTAC 


4140 


CTTTACCAAC 


TTCCATAACA 


GCTTCAATAA 


TTTGTTTACC 


ATCTTCTACA 


CTTTGGTAAA 


4200 


TACATTCACC 


TGAAGAACCA 


TTTACATACA 


TACCTTTTAC 


ACCTTTGTCA 


ATGAAATATT 


4260 


GTACCAGAGA 


TTTTACACGA 


TCTTGGCTAA 


TTTCACCATT 


TTCATCATAG 


CAAGCATAAA 


4320 


ATGCAGGGAT 


AACGCCTTTG 


TATTTAGTTA 


AATCTTTCAT 


CAGATTTCTC 


CTTT AT ATTG 


4380 


TTTTTTATTT 


GATGACATTA 


ATAAATCGCT 


CAGCAATTTC 


TTTTGGACGT 


GTAATCGCTC 


4440 
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TAT AAG CTTT 


TTTTAATTGT 


TCTGGATAAT 


4500 


GAATTTTTCt 




LLuuAnlAi * 


AAAATCAGCC 


AATTTTTTCA 


TTAGTTCAAA 


4S60 


ATCAGCCTCA 


TCTGATTGTA 


A l\» i l\\m 4 


TCTGTAACCT 


GATAATGTTG 


TACCAACAAA 


4620 


ATCAACGCCT 


^* ft ffVfVf* ft ft ft ^ft^ 

GATTTAAATG 


^ ft (M ft ^ ft ^ ft f^f* 

CATAuAuALw 




TT ACTT AC AT 

4 4 4 4 «* 4 


CCGCCATCAG 


4680 


^ ft ft ft Mnn 

CAATTGATTC 


GGATATTTTT 


CTTTTATTTT 


TTTCATAAAT 

4 4 A 4 rW* 4 


TCACTGACAA 


CTAAGCCATC 


4740 


ATATCTTGGT 


CTTAAAGTTG 


CATCAAATGC 


An ruAL lull 


WA AWWwW\£, 4 


CTACAAGTTC 


4800 


ATCTACTTCT 


TTCATCGTAG 


CAGTAATATA 


TGGTTCTTGA 


CCTGCATAAT 
ivjun 4 nn 4 


WWW * • • * 


4860 


AATTCCAATT 


ATTGGTAAAT 


CTACTACTTT 


w i unA 4 iww i 


TTAATATCAC 

4 4 r^f\ 4 4 wn.w 


CCACAGAATT 


4920 


TCCGCGAATG 


CCCACTGCTC 


CTGCCTCTAA 


AGCTGCTTTA 


*^of*A*PAAAAft 


CCATCAAGCT 


4980 


AAATTCTTCA 


TTATAAAGGG 


CTTCACCAGG 


TAAAGCTTCA 


* ft /-•». ft »pkk 


TCACTCCACC 

A VM\V> * W\-/\UV- 


5040 


TTGAACTTGG 


CTTATAAATT 


J I j rTC J l 'I u r AGT 




CTCATTTTAT 


4 #» 4 4 4 \* W 4 


5100 


TATCGATAAT 


AGTTTGATTG 


T AAT AAT ATT 


GTCTCTCTGG 


nw ill Lv.MOn 


T AATTACACA 
4 r%#% 4 4 nwn>#n 


5160 


ATAAGCAGTC 


TCTAATTAAA 


AGTATTGGAA 


ft ft f^^f^ & 

ACTGAGGTUA 




rrATACCACA 
Lw\ 4 AUuAVin 


5220 


GATGATCGGT 


CGAAGCTAAT 


ft ft ^ ft ft ft* ft ^IWf< 

AACAATAG 1 i 






TCCTCAAATT 


5280 


TTCTTGTAGT 


CATTAAAACT 


GTTTTAGCGC 


l_ i 1 lAlvi 0\- 


ft/**l "J"I , "I"I"I'^T 


ACACCTTCTA 


5340 


GTACAATATC 


AGTTTGACCT 


GAAATGGATG 


j wmu»*ft ft*m* ft ^ 

CTCCAATGAC 


ft « fv«p ft ft n*^*F 
AAGGCAATTl 


4^>/\ 4 A A/%w & n 


5400 


GTAACCTACT 


CCACAAAATC 


ATATCCTCGT 




TTCAPCAATC 


ACTCCGAGAC 


5460 


GCATAAATCT 


CATCTTCATT 


TCTTGTAAAG 


AAuAnwnu n 


A t "*^VC "PTTTA 

#\W 4 1 ^ V~ 4 A * n 


CCGTAGAGAT 


5520 


ATACACGCTC 


AGCAGTTTCT 


■k ft ft ^ 

ATCATCTCAG 




AAfVPTGAACT 
#v\w 4 4 w nnv» 4 


TCATCAAGAA 


SS80 


CCGTGTAAGT 


TTTTCTCAAC 


ATTTCCTCAT 


iCTCGGATAA 


AACTTTTTCT 


GTTGCCTCTG 


5640 


TATATAATGC 


CAACTTTTCT 


1 1L1 4 Vj#W% 


1 \m.f\ 1 *- » ^ 4 4 \J 


GTATTTGAAA 


ATGAATTCTC 


5700 


TAAAACCTTT 


AAAACCACAT 


TTTTTCGCAA 


ATCGAGTCAA 


TGTTGCTTTG 


GATACATTAA 


5760 


CGTATTCGCA 


CAATGCTTTA 


GATGAATAAT 


CATTCAGAGG 


TTGCTGTTTT 


AAGAAGAATT 


5820 


TAGCAATGTC 


TTTTTCAGCA 


TATGCCATAT 


TTGGTAAGTT 


AGCTTCTATC 


ATTGGAATTA 


5880 


GTTCTTTTTG 


CACTAACATA 


TGAGCTCCTT 


AGTTGAAGTA 


AACGTTTACA 


TTCTTTATTT 


5940 


TAACACTTTT 


TTTTTTTTTC 


AATATTTTTC 


ATAAATTAGA 


AACTAGTTTC 


CAATTTCTTT 


6000 


CGTTTCATAA 


CAGAACAACA 


AACATAAAAA 


TATAATAGTT 


TTTATTCTTT 


TTATCGTAAT 


6060 


TATATGTATT 


GTAAGAACGT 


TTATCACTAA 


TAATATGTTC 


ATATTAAAAT 


ATTTTAGTAA 


6120 


TATTTTATTT 


TGGTTTTATT 


ATTTCTTTTC 


GGAATTTCTA 


T AT AAT ATTT 


TATTTC T AAA 


6180 
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AAAATTGAAA 


AAATATTTCT 


AGTTTCTTTA TTTTATATAG GXAA2/TATT 


TTATTTCTAA 


6240 


ATTAAAAGAG 


AATCCCATAA 


AAACTACAGA 


TTTATGACAT 


AAATCAGGTC 


ACCTATTTTA 


6300 


AAAAAGCAGC 


AAACTATAAA 


CTAAAAAGTT 


CCACACCAAA 


TGTAACCCCA 


TACTTCCCCA 


6360 


TAAGTCAGAT 


TTATAGCGCA 


CCATACCTAA 


AAACATTCCA 


AGTGAAACGT 


ACAGACACCA 


6420 


AGCTAGAATG 


GTTCCTGGAT 


GATGTACTAA 


GGCAAATAAA 


ACACTTGTCA 


AAGCAACTCC 


6480 


AATATCTAAT 


TTTCTAACCA 


ACTTCCATAA 


AATTTCACGA 


TACAGAAATT 


CTTCAACCAT 


6540 


ACTCGCATTG 


ATTAAGAACA 


ATAAAAATGA 


AAACCAAGGA 


ACTTGATGTT 


GAAGGCCAAT 


6600 


TAAATTTGTT 


TGATTCGTGC 


TTCCTTGAGC 


ATGAATCAGG 


CTAAAACATA 


CACTTATAAT 


6660 


CAGTAGACTA 


GCTAGTCCAA 


TACCAAGGCA 


TTT CATCCT A 


GTTTTCATAT 


TGACCTTGAC 


6720 


CACTTGTTTT 


CCTTGACCAT 


ACATCCATAA 


AAAAGAAAAA 


AGAGACGCAC 


CATAGAGAAC 


6780 


CTGTAGTATA 


CTTAACTCAC 


CGATACAAAG 


AAATTTCAAT 


AAGTATAGAG 


ATACCAATAG 


6840 


GACATTTACT 


TGTTCGAATA 


TATAAACTGG 


AATTATTCTT 


TTCATAGTTA 


CCTCCGAAAT 


6900 


AAATCTTCAT 


AATCTAAATC 


TAATATCTGC 


ACAATCCTTT 


CTACCCATGG 


ACTTTGAGGC 


6960 


ATTCCTTGTT 


CCATCTTGTA 


GTGGCCAATC 


TTTTGATATA 


AACCATTCAA 


TTCACTTGCA 


7020 


TAGTGAAACT 


CTCCCGCAAA 


CATTTTTCTG 


GTTAACTCAA 


TCCAGCTGAT 


ATTTCTTTCA 


7080 


GCCAAAATAA 


TGGACAAGTT 


CTCCCAAAAT 


CGTTCAGCCA 


TATTrCTTCT 


CCTTTACTTA 


7140 


GATAAATAAT 


GTGTTTGyGC 


CATGTAAATC 


AATTGTTTCG 


TATCTCTTCC 


CAATAGACCT 


7200 


CTAGCCTCTT 


CCAAATTCAG 


ACTTCCATAA 


ACCCCCTTAT 


TTGAAACCAC 


AAAAGGAACT 


7260 


CCGATGGTTA 


GTTCAGGATT 


TTTTAAAATT 


ATCTCAACGA 


AATCCGTTAA 


TCTTAGATTG 


7320 


TCACGGTTCT 


TAAATCGTAA 


TAAATTGGGA 


GATAAAAACT 


CAAAACAATC 


TGAAGAATAG 


7380 


CT CAT C ATCT 


CAATTAATTT 


GTCCTTTGTC 


ATTTCAGAAA 


CTCAATGACA 


AGATACCTCA 


7440 


ATGCCATAGT 


TTTGGAAGAA 


GTCTAAAAGA 


AGTTCATTTC 


TTTGGCTATT 


TTTACTTACA 


7500 


TAGAGATCAA 


TCATGGGAGA 


CCTCCAACAA 


ATTTGCTTCC 


ATTTGATATT 


CTGAGACGAT 


7560 


TAAGGAATCT 


AACAACTTTG 


AGAAGTT AAT 4 


t CGATTTCTTG 


TCTTCATCAT 


AAGCTTTTAC 


7620 


AGTTACTTGG 


GTTGTAAGTA 


TCCCCTCTTT 


TCCCTCGGCT 


CGATAGTCTT 


GTCAATATAA 


7680 


AACAAAAACA 


AGATTCTGAT 


TATCATCTAC 


AAAGGCATTA 


ACTCCGTTCT 


TTATATCCTG 


7740 


ACTTTCAAGG 


AATTCCATAA 


CGTTTTGAAG 


ATAGGATTCA 


TAAAATAGTG 


GGTAATTATG 


7800 


TTTTTTATGG 


TAATCATCTA 


AAAATGTTAC 


CTCAAACTCA 


CATGGATAAT 


TGGGCATCAA 


7860 


AAATATTTGT 


TCATCCAGCT 


GTTTGATTTC 


TGCATCATGT 


AATTCTGTTT 


CTAATTCATC 


7920 


AC AATCT ACT 


ATTGATTCTT 


TATTTAATGC 


TTTTATCTTT 


TTCCTCTATT 


TCTTTTAATT 


7980 
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TCTTTGCGAT 


TGCGGCAATC 


ACAGGAACGG 


TTACACTATT 


ACCAACTTCT 


TTATAGAOCT 


Ov**\J 


GACTATTAAT 


AGAGACTTTT 


CTAGCAGCTT 


CAAAAGCCTA 






A1 Oft 


GAAAACACTC 


TTTAGGAGTG 


ATTCGTCGTA 


TTCTCAAACG 


GTAAAATTGT 


CCATCTATTA 


pi £n 
O iOU 


AAACACCAGC 


TACTTGGTAA 


ACTTGTTTAT 


CTTCTCCTTC 


ATAGCTAGCC 


ACT A CT ACTC 




CCATTTGACC 


ACTAGTTGTT 


AACGTATTAG 


CTATACCTTT 


TCCAACTCTA 


CCACGACGAT 


o2oD 


ACTGAGAACT 


TGGTCTTTCT 


AAATTGATTG 


AATCCCCAAT 


CTCTGCTTGA 


GCATATCCTT 


8340 


TTTTCCTTGC 


TTCCCCTACT 


TTTAGAAATT 


GGATTGGTTC 


TGGAATTAGT 


ATTTTGGGGA 


8400 


TTTTATCTCC 


TCCTTGCATC 


GTAGTCAGTG 


TTGGAGATAA 


GCCCTCACTT 


CCATAGACAC 


8460 


GACCTGTCTC 


CTTAAAGCTA 


GTCGGTAAAT 


CTCCAACAAC 


GACAATGCCA 


TAACGATCCT 


8S20 


GAGTATTTAA 


AGTAAACATC 


GGCTCTTGAT 


TTTCCTTAAA 


GCGTCTCCCA 


TTT'l'GTCrCT 


8580 


TGTCTAATCT 


ATCTGCTCTC 


ATACAAGGAA 


TCGCAACTTT 


AAATCCTTCT 


CCTTTACCAC 


8640 


GAACTAAGGT 


TGGCGCAAGA 


CCTTCTGAAT 


AATAGACTTT 


ACCGCTCATT 


CCAC'ri'CTTG 


8700 


ATGGATTCAA 


ATTTCCTAGT 


GCTTTCAAAG 


TCTCAGACTT 


AGTTCCTTGA 


CCTTCTCCTC 


8760 


TGAAAGGAAA 


TAAGAGTCTG 


GTACCTTTCT 


TTCTAGAATG 


TCCGATAATA 


AACACCCTCT 


8820 


CTCTCTTTTT 


GGGAACGCCA 


AAATCCTTAC 


TCTTAAGCAC 


CTGCCACTCA 


ACATCAAACC 


8880 


CCAACTCATC 


AAGTGTCCTA 


AGTATTGTGG 


TGAACCTCCC 


TCCCTTATCG 


TGATTGAGTA 


8940 


GCCCTTTAAC 


ATTTTCAAGA 


AAAAGAAAAC 


CTGGTTGGAT 


TTGTTTGGCC 


GCCCGAGCAA 


9000 


TTTCAAAGAA 


CAAAGTTCCT 


CTAGTATCTT 


CAAATCCCAA 


TCGTCTTCCT 


GCGATTGAAA 


9060 


ATGCTTGACA 


AGGGAATCCC 


CCACAGATGA 


CATCGACTTT 


CCCTCTAAGT 


TTTTTAAATT 


9120 


CGTCATCTGA 


AACATCTCGT 


ATGTCATGAA 


ATTCTATTTC 


TCCTTCCGTT 


TGAAAAATGG 


71BO 


ACTTATAAGA 


TTTCCTACCA 


AATTTATCAA 


TCTCACAAAA 


TCCCAAGCAC 


TCATGCCCTT 




GAGCTTCCAT 


TCCCATCCTA 


AAGCCTCCTA 


TCCCAGCAAA 


TAAATCTAAA 


ACCCAAATCA 


9300 


TTCATACCTC 


TCTCAACTAG 


ATGTAACTTA 


CAAAACCCCT 


GACCTCATGA 


GCCACTTTCT 


9360 


TCCTCCTCAT 


GAGGTCAGTT 


TTACTTTCTG 


CTCTTCCACT 


ATCGTTTTTC 


CTCGCTAGAT 


9420 


TTCCTCAAAA 


CGCCAGACTC 


CTCCCTTGGT 


TCGTCACACG 


ATTTTTTCAT 


CTCGACTGTT 


9480 


CTTTAATGCA 


TCATTAACGA 


CGCTTTTCTT 


CTAGGTGGTT 


CATAAGGAAC 


AGGAAGATTC 


9540 


AGCTTGACTT 


TTCTAATCCT 


AGAATAAAGT 


GCTGAAAACA 


ATTCGGAATA 


GGCATAGAGA 


9600 


CTAGACAATT 


TGAGGAGCTG 


CTTGCGTCCT 


GTTCGAACAC 


ATTTTCCTAC 


C ACG TGAAG A 


9660 


AAAAGATGGC 


GGAAGCGTTT 


GATTGTTAAA 


GTTTGGAAGT 


CACCTCCAGC 


TAGATGTTTG 


9720 
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AGAAAAAGAT AGAGATTGTA GGCGATACAG CTCATCATCA TACGAACTCG TTTTTGATTA 9780 

AGGTTGAACT ATCCGTTTTA TCGCCAAAAA ATCCCTCCTT CATCTCCTTG ATGAAATTCT 9840 

CGGCTTGACC ACGTCCACCA TAAACCTGAA ACTGGTCTTG GCTTGTTCCG GTACCGA 9897 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



4 



CCGTCGAACA 


AGCCAAGACC 


AGTTTCAGCT 


TTATCCTGGA 


CGTGGTCAAC 


CCGAGAATTT 


60 


CATCAAGGAG 


ATGAAGGAGG 


GATTTTTTGG 


CGATAAAACG 


GATAGTTCAA 


CCTTAATCAA 


120 


AAACGAAGTT 


CGTATGATGA 


TGAGCTGTAT 


CGCCTACAAT 


CTCTATCTTT 


TTCTCAAACA 


180 


TCTAGCTGGA 


GGTGACTTCC 


AAACTTTAAC 


AATCAAACGC 


TTCCGCCATC 


TTTTTCTTCA 


240 


CGTGGTAGGA 


AAATGTGTTC 


GAACAGGACG 


CAAGCAGCTC 


CTCAAATTGT 


CTAGTCTCTA 


300 


TGCCTATTCC 


GAATTGTTTT 


CAGCACTTTA 


TTCTAGGATT 


AG AAAAGTC A 


ACCTGAATCT 


360 


TCCTGTTCCT 


TATGAACCAC 


CTAGAAGAAA 


AGCGTCCTTA 


ATGATGCATT 


AAAGAACAGT 


420 


CGAGATGAAA 


AAATCGTGTG 


ACGAACCAAG 


GGAGGAGTCT 


GCCCTTTTGA 


GGAAATCTAG 


480 


CGAGGAAAAA 


CGATACTGGA 


ACAGCAGAAA 


GTAAAACTGA 


CCTCATGAGG 


AGGAAGAAAG 


540 


TGGCTCATGA 


GGTCAGGGGT 


TTTGTAAGTT 


ACATCTAGTT 


GAGAGAGGTA 


TGAATGATTT 


600 


GGGTAAATAC 


AATGAGCTTG 


AAAGAAGTAC 


CAAACTCACC 


AAGCGCCAAT 


TCTTTGAGAA 


660 


TCAGATGCTG 


GATTATACCA 


TCATTGCGCA 


TGAGAGTTTT 


GAAATCATCC 


GTCATTCTGT 


720 


CTACCAGACA 


GATGATCGTG 


AAGTGGAAAA 


TGCTCTGGCT 


TTTGAAGTGA 


AAAATGATGA 


780 


AACAGACAAG 


CTGATTCTCT 


TATTAAGCGA 


GGATATTGGT 


CTAGCTGAAA 


AATTGTGCCT 


840 


CCTTGACGGA 


ACAAAAATGC 


GTGGAAAATG 


TTTAGTATAT 


GATAAAATAA 


ATGAGAGAAT 


900 


GATTCCCTTG 


CAGTGCTAGA 


AATAGGCATT 


TTGAATAGTG 


AATATGTTAT 


AATAAGTATT 


960 


AGTAGCAGGT 


GTTTTAGATT 


GGACAAGAAA 


CTGACCATAA 


AAGACATTGC 


GGAAATGGCT 


1020 


CAGACCTCGA 


AAACAACCGT 


GTCATTTTAC 


CTAAACGGGA 


AATATGAAAA 


AATGTCCCAA 


1080 


CAGACACGTG 


AAAAGATTGA 


AAAAGTTATT 


CATCAAACAA 


ATTACAAACC 


GAGCATTGTT 


1140 


GCGCGTAGCT 


TAAACTCCAA 


ACGAACAAAA 


TTAATCGGTG 


TTTTGATTGG 


TCATATTACC 


1200 


AACAGTTTCT 


CAAACCAAAT 


TCTTAAGGGA 


ATTGAGGATA 


TCGCCAGCCA 


GAATGGCTAC 


1260 
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f*AftCtTAA*TGA 


TAGGAAATAC 


TAATTACAGC 


CAAGAGAGTG 


AGGACCGGTA 


TATTGAAAGC 


1320 


M> IvC 1 iV.lv.1 


TGGGAGTACA 


CGGCTTTATT 


ATTCAGCCGA 


CCTCTAATTT 


ccgaaaatat 


1360 


TfTCCTATCA 


TCCATGAGAA 


AAAGAAGAAA 


ATGGTCTTTT 


TTGATAGTCA 


gctctatgaa 


1440 






AAC C AAT AAC 


TATGATGCCG 


TTT ATG A CAT 


CACCCAGTCC 


1S00 


TuTAl\.unAA 


AArvrTTATGA 


A A * A W A W 


TTGATTACAG 


CGGATACGAC 


TCCTTTGAGT 


1560 


AL TLuOAl avj 




Tf^flTTTTGTG 

A OV A A * * ** * W 


GATGCTTTAA 


CAGATGCTAA 


TATGCGTCAC 


1620 


GCCAGTCTAA 


CCATTuAAbA 


i AAuUt A /\V»w 


AATTTGGAAC 


AAATTAAGGA 


ATTTTTACAA 


1680 


AAAGAAATCG 






GTATTTATCC 


CTAACTGTTG 


GGCCCTACCT 


1740 


CTAGTCTTTA 


V-vJk* i I A I tAA 


Af7 AftTTCAAT 


TATAACTTGC 


CACAAGTTGG 


GTTGATTGGT 


1800 


TTTCACAATA 




TTCCTTTTCT 


TPTCCAAGTG 


TTTCGACGCT 


GGTTCAGCCC 


1860 


TCCTTTGACG 


AAGGACAACA 


VfGwTAvAAAb 


nl A J A A v» 


ACCAGATTGA 


ACGTCGCAAT 


1920 


CAAGAAGAAA 


GGCAAwAAli A 


CTTGGATTGT 


AGTGTGAATT 


GGAAACAGTC 




1980 


AATGAAGCAA 


AATGACTTGC 


AATCTCTGTT 


AAf!AAATAAA 


ATAATCCCAC 


CTAGAACAAG 


2040 


CTAGGTGGGA 


TT ATTTvit- LTT 


A A A uAu 


AAATTATGGG 


AGCAAGCTCC 


TAAATCAACT 


2100 


GTTTTTGATC 


TACTTCTTTA 


1 /\w A A A 


AAAAGTTATA 


GAAGTAGGCC 


aaacttgaaa 


2160 


TGATGGTTAC 


GACTAWiAAT 


& TTT AAA ATT 
nl l\*AAnnl A 




AGGG TTGGTT 


AAAAGTTGTG 


2220 


^ m — ft ft rr* ft m 

GAAAGGATAT 


UAuuAuAAAu 


AAuAUUUIb a V* 


CGTTGAGGAC 


ACGTATCCGT 


TTTGATTGTA 


2280 


TTTTCTCAAG 


TCCTTTATTG 


Aftr^nfARCAA 


GAAAGAGGAG 


TAGGAGTAGT 


AAAACTGTAT 


2340 


GAGAAATAGC 


1\V. iVjAAU A n, 


AnnnPGAAGA 


AAAGGAAAAT 


ACTCATAAAA 


ACATGAATGA 


2400 


TCAGTAGTCT 


nut X AW A 1 


TTGATAAGGC 


ACCTCCTAAT 




TTTAGCTCTT 


2460 




GTCACTCCAC 


AATATGTATC 


ATCACTCCGA 


AAAAGAAAGC 


TCCCAGTATA 


2S20 


GTTTTAAAAA 


TATGTTTTCT 


ATTTAGAAGA 


GAACTGATAA 


AATTTGGATT 


TTCAL l JTj I A' 


« 3BU 


AGGGTATCAA 


TGAGTGGAAT 


TATAAAAAAT 


ATCACTGTTC 


CATAAATCGA 


ACCTCCTTTC 


2640 


AGACCAGGAT 


AACGTAACTG 


TTTCTTTTCT 


TTTTTCATGA 


GTTTCCTCCT 


AATCCTCATC 


2700 


TTGATTTTTC 


TTAGTTTTTG 


CAATGCGACC 


GGACATGAGG 


AACTGTATGC 


TCGCTCCGAA 


2760 


GAAAATACAA 


CCGAGAATAC 


TTGATACACC 


ATTTCTTATA 


GTGAGAAGAG 


AATGAAAATA 


2820 


CTCCTGACCT 


TCATCTATGA 


GTATCCTGAG 


AAGAGGAGTT 


ATAAAAAACA 


TCCATAGACC 


2880 


AAAGAACAAA 


CCTGCTTTCA 


GACCTGGGTA 


GTGTAGTTGC 


TTCCTTTCTT 


TCTCATTCAG 


2940 


CATATCTGGT 


TCAATGACTG 


TCATGCCTGT 


TTT T TTCATT 


TGGTAGGTGA 


CATAGCCAGA 


3000 
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AGCCATGACG 


GCAATCACTA 


AAATCAGAGG 


AGGATAGATT 


AGAGCCACTT 


CTTGAGGGTA 


3060 


TTTATAGGCC 


AGAAGGAGTG 


GAATAAGATT 


TCCGAAAATC 


ATCAGATAAA 


AGAGGATGAT 


3120 


AAAGACTTCG 


TTCCCAATAC 


TATCGGCCTC 


ACGCCGTTTG 


TATTCGTCAA 


GGGGACCAGA 


3180 


AATACCGTAT 


GTGCGTTTGA 


TCAGTTTTTC 


AGTGAAGGTT 


TCTTTTTTCA 


TGAGTTTGCT 


3240 


CCTTTTTTAA 


AAATCTTCCT 


CCCAAAAGAG 


ACTGTTGAGG 


TCAGTTTGGA 


GGCTCCGGGC 


3300 


GAGATTCAGA 


CACAGTTCCA 


AGGTTGGATT 


GTACTTGTCG 


TTTTCAATCA 


TATTGATAGT 


3360 


CTGTCTCGAG 


ACACCGATAT 


CCTTGGCGAG 


TTCCAGCTCG 


GAAATACCCA 


ATTCCTTGCG 


3420 


AAATTCTTTC 


ACACGATTCA 


TCTGTTCTCC 


TTTCTGATTT 


ATGTCGTATA 


TATTTGACTA 


3480 


TATTATAGTC 


TTTTAAACAT 


AAAGTGTCAA 


GTATTTTTGA 


CATATTrTTT 


GAAGAAATAG 


3S40 


TAGTCTCCTT 


GTCCTATTTG 


TCTGACAAGT 


GCAAGCTGGT 


CGGATTTCTG 


GTAAAATACA 


3600 


TAAGATATGA 


CAAAAGAATT 


TCATCATGTA 


ACGCTCTTAC 


TCCACGAAAC 


GATTGATATG 


3660 


CTTGACCTAA 


AGCCTGATGG 


TATCTACGTT 


GATGCGACTT 


TGGGCGGAGC 


AGGACATAGC 


3720 


GAGTATTTAT 


TAAGTAAATT 


AAGTGAAAAA 


GGCCATCTCT 


ATGCCTTTGA 


CCAGGATCAG 


3780 


AATGCCATTG 


ACAATGCGCA 


AAAACGCTTG 


GCACCTTACA 


TTGAGAAGGG 


AATGGTGACC 


3840 


TTTATCAAGG 


ACAACTTCCG 


TCATTTACAG 


GCATGTTTGC 


GCGAAGCTGG 


TGTTCAGGAA 


3900 


ATTGATGGAA 


TTTGTTATGA 


CTTCGGAGTC 


TCTAGTCCTC 


AATTACACCA 


GCGTG AG CGT 


3960 


GGTTTTTCTT 


ATAAAAAGGA 


TCCCCCACTG 


GACATGCGGA 


TGAATC AGG A 


TGCTAGCCTG 


4020 


ACAGCCTATG 


AAGTCGTGAA 


CAATTATGAC 


TATCATGACT 


TGGTTCGTAT 


TTTCTTCAAG 


4080 


TATGGAGAGG 


ACAAATTCTC 


TAAACAGATT 


GCGCGTAAGA 


TTGAGCAACC 


GCCTGAAGTC 


4140 


AAGCCGATTG 


AGACAACGAC 


TGACTTACCA 


GAGATTATCA 


AGTTGGTCAA 


ACCTGCCAAG 


4200 


GAACTCAAGA 


AGAAGGGGCA 


TCCTGCTAAG 


CAGATTTTCC 


AGGCTATTCG 


AATTGAAGTC 


4260 


AATGATGAAC 


TGGGAGCGGC 


AGATGAGTCC 


ATCCAGCAGG 


CTATGGATAT 


GTTGGCTCTG 


4320 


GATGGTAGAA 


TTTCAGTGAT 


TACCTTTCAT 


TCCTTAGAAG 


ACCGCTTGAC 


C AAGC AATVG 


4380 


TTCAAGGAAG 


CTTCAACAGT 


TGAAGTTCCA 


AAAGGCTTGC 


CTTTCATCCC 


AGATGATCTC 


4440 


AAGCCCAAGA 


TGGAATTGGT 


GTCCCGTAAG 


CCAATCTTGC 


CAAGTGCGGA 


AGAGTTAGAA 


4500 


GCCAATAACC 


GCTCGCACTC 


AGCCAAGTTG 


CGCGTGGTCA 


GAAAAATTCA 


CAAGTAAGAG 


4560 


GGAAAAAGAT 


GGCAGAAAAA 


ATGGAAAAAA 


CAGGTCAAAT 


ACTACAGATG 


CAACTTAAAC 


4620 


GGTTTTCGCG 


TGTGGAAAAA 


GCTTTTTACT 


TTTCCATTGC 


TGTAACCACT 


CTTATTGTAG 


4660 


CCATTAGTAT 


TATTTTTATG 


CAGACCAAGC 


TCTTGCAAGT 


GCAGAATGAT 


TTGACAAAAA 


4740 


TCAATGCGCA 


GATAGAGGAA 


AAGAAGACCG 


AATTGGACGA 


TGCCAAGCAA 


GAGCTCAATG 


4800 
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AACTATTACG 


TGCAGAACGT 


TTCAAAGAAA 


TTGCCAATTC 


ACACGATTTG 


CAATTAAACA 


4660 


ATGAAAATAT 


TAGAATAGCG 


GAGTAAGATA 


TGAAGTGGAC AAAAAGAGTA ATCCGTTATG 


4920 


CCACCAAAAA 


TCGGAAATCG 


CCGGCTGAAA 


ACAGACGCAG AGTTGGAAAA AGTCTGAGTT 


4980 


TATTATCTGT 


CTTTGTTTTT 


GCCATTTTTT 


TAGTCAATTT 


TGCGCTCATT 


ATTGCGACAG 


S040 


GCACTCGCTT 


TGGAACAGAT 


TTAGCGAAGG 


AAGCTAAGAA CGTTCATCAA ACCACCCGTA 


5100 


CAGTTCCTGC 


CAAACCTCGG 


ACTATTTATG 


ACCGAAATGG 


AGTCCCGATT 


GCTGAGGATG 


5160 


CAACCTCCTA TAATCTCTAT 


GCCGTCATTG 


ATGAGAACTA 


TAAGTCAGCA 


ACGGGTAAGA 


5220 


i rx ACGT 




CAATTTAACA 


AGGTTGCAGA 


GGTCTTTCAT 


AAGTATCTGG 


5280 


ACATGGAAUA 


x TiTT KTfyrik 
AIV.L lAluin 


AGAGAGCAAC 


TCTCGCAACC 


TAATCTCAAG 


CAAGTTTCCT 


5340 


TTGGAGCAAA 


VMiAAAlViuv 




CCAATATGAT 


GTCTATCAAA 


AAAGAATTGG 


5400 


AAGCTGCAGA 




ATTGATTTTA 


CAACCAGTCC 


CAATCGTAGT 


TACCCAAACG 


5460 


GACAATTTGC 


TTCTAGTTTT 




CTCAGCTCCA 


TGAAAATGAA 


GATGGAACCA 


5520 


AGAGCTTGCT 






GTTCCTTCAA 


CAGTATTCTT 


GCACGGACAG 


5580 


ACbGCAnAT 




AAGGATCGTC 


TGGGTAATAT 


TGTACCCCGA 


ACAGAACAAG 


5640 


TTTCCCAACG 


AACGATGGAC 




TTTATACAAC 


CATTTCCAGC 


CCCCTCCAGT 


5700 


CCTTTATGGA 


AACCCAGATG 


CATGCTTTTC 


AAGAGAAGCT 


AAAAGGAAAG 


TACATGACAG 


5760 


CGACTTTGGT 


CACTGCTAAA 


ACAGGGGAAA 




aapgcaacga 


CCCACCTTTG 


5820 


ATGCAGATAC 


AAAAGAAGGC 


ATT AC AG AGG 


actttctttg 


GCGTGATATC 


CTTTACCAAA 


5880 


GTAACTATGA 


GCCAGGTTCC 


ACTATGAAAG 


TCATGATGTT 


GGCTGCTCCT 


ATTGATAATA 


5940 


ATACCTTTCC 


AGGAGGAGAA 


GTCTTTAATA 


CTAGTGAGTT 


AAAAATTGCA 


GATCCCACGA 


6000 


TTCGAGATTG 


GGACGTTAAT 


GAAGGATTGA 


CTGGTGGCAG 


AACGATGACT 


TTTTCTCAAG 


6060 


GTTTTGCACA 


CTCAAGTAAC 


GTTGGGATGA 


CCCTCCTTGA 


GCAAAAGATG 


GGAGATGCTA 


6120 


CCTGGCTTGA 


TTATCTTAAT 


CGTTTTAAAT 


TTGGAGTTCC 


GACCCGTTTC 


GGTTTGACCG 


6180 


ATGAGTATGC 


TGGTCAGCTT 


CCTGCGGATA 


ATATTGTCAA 


CATTGCGCAA 


AGCTCATTTG 


6240 


GACAAGGGAT 


TTCAGTGACC 


CAGACGCAAA 


TGATTCCTGC 


CTTTACAGCT 


ATTGCTAATG 


6300 


ACGGTGTCAT 


GCTGGAGCCT 


AAATTTATTA 


GTGCCATTTA 


TGATCCAAAT 


GATCAAACTG 


6360 


CTCGGAAATC 


TCAAAAAGAA 


ATTGTGGGAA 


ATCCTGTTTC 


TAAAGATGCA 


GCTACTCTAA 


6420 


CTCGGACTAA 


CATGGTTTTG 


GTAGGGACGG 


ATCCCCTTTA 


TGGAACCATG 


TATAACCACA 


6480 


GCACAGGCAA 


GCCAACTGTA 


ACTGTTCCTG 


GGCAAAATGT 


AGCCCTCAAG 


TCTGGTACGG 


6540 
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CTCAGATTCC 


TGACGAGAAA 


AATGGTGGTT 


ATCTAGTCGG 


GTTAACCGAC 


TATATTTTCT 


6600 


CGGCTGTATC 


CATGAGTCCG 


GCTGAAAATC 


CTGATTTTAT 


CTTGTATGTG 


ACGGTCCAAC 


6660 


AACCTGAACA 


TTATTCAGGT 


ATTCAGTTGG 


GAGAATTTGC 


CAATCCTATC 


TTGGAGCGGG 


6720 


CTTCAGCTAT 


GAAAGACTCT 


CTCAATCTTC 


AAACAACAGC 


TAAGGCTTTA 


GAGCAAGTAA 


6780 


GTCAACAAAG 


TCCTTATCCT 


ATGCCTAGTG 


TCAAGGATAT 


TTCACCTGGT 


GATTTAGCAG 


6840 


AAGAATTGCG 


TCCCAATCTT 


GTACAACCCA 


TCGTTGTGGG 


AACAGGAACG 


AAGATTAAAA 


6900 


ACAGTTCTGC 


TGAAGAAGGG 


AAGAATCTTG 


CCCCGAACCA 


GCAACTCCTT 


ATCTTATCTG 


6960 


ATAAAGCAGA 


GGAGGTTCCA 


GATATGTATG 


GTTGGACAAA 


GGAGACTGCT 


GAGACCCTTG 


7020 


CTAAGTGGCT 


CAATATAGAA 


CTTGAATTTC 


AAGGTTCGCG 


CTCTACTGTG 


CAGAAGCAAG 


7080 


ATCTTCGTGC 


TAACACAGCT 


ATCAAGGACA 


TTAAAAAAAT 


TACATTAACT 


TTAGGAGACT 


7140 


AATATGTTTA 


TTTCCATCAG 


TGCTGGAATT 


GTGACATTTT 


TACTAACTTT 


AGTAGAAATT 


7200 


CCGCCCTTTA 


TCCAATTTTA 


TAGAAAGGCG 


CAAATTACAG 


GCCAGCAGAT 


GCATGAGGAT 


7260 


GTCAAACAGC 


ATCAGCCAAA 


AGCTGGGACT 


CCTACAATGG 


GAGGTTTGGT 


TTTCTTGATT 


7320 


ACTTCTGTTT 


TGGTTGCTTT 


CTTTTTCGCC 


CTATTTAGTA 


CCCAATTCAG 


CAATAATGTG 


7380 


GGAATGATTT 


TGTTCATCTT 


GGTC TTGT AT 


GGCTTGGTCG 


GATTTTTAGA 


TGACTTTCTC 


7440 


AAGGTCTTTC 


GTAAAATCAA 


TGAGGGGCTT 


AATCCTAAGC 


AAAAATTAGC 
GCGATATCCT 


TCTTCAGCTT 


7500 
7S60 


CTAGGTGGAG 


TTATCTTCTA 


TCTTTTCTAT 


GAGCGCCGTG 


CTCTGTCTTT 


GGTTATCCAG 


TTCATTTGGG ATTTTTCTAT 


ATTTTCTTCG 


CTCTTTTCTG 


GCTAGTCGGT 


7620 


TTTTCAAACG 


CACTAAACTT 


GACAGACGGT 


GTTGACGGTT 


TAGCTAGTAT 


TTCCCTTCTC 


7680 


ATTAGTTTGT 


CTGCCTATGG 


AGTTATTGCC 


TATGTGCAAG 


GTCAGATGGA 


TATTCTTCTA 


7740 


GTGATTCTTG 


CCATGATTGG 


TGGTTTGCTC 


GGTTTCTTCA 


TCTTTAACCA 


TAAGCCTGCC 


7800 


AAGGTCTTTA 


TGGGTGATGT 


GGGAAGTTTG 


GCCCTAGCTG 


GGATGCTGGC 


AGCTATCTCT 


7860 


ATGGCTCTCC 


ACCAAGAATG 


GACTCTCTTG 


ATTATCGGAA 


TTGTGTATGT 


TTTTGAAACA 


7920 


ACTTCTGTTA 


TGATGCAAGT 


CAGTTATTTC 


AAACTGACAG 


GTGGTAAACG 


TATTTTCCGT 


7980 


ATGACGCCTG 


TACATCACCA 


TTTTGAGCTT 


GGGGGATTGT 


CTGGTAAACG 


AAATCCTTGG 


6040 


AGCCAGTGGA 


AGGTTGACTT 


CTTCTTTTGG 


GGAGTGGGAC 


TTCTAGCAAG 


TCTCCTGACC 


8100 


CTAGCAATTT 


TATATTTGAT 


GTAAGAATGG 


CACCCTGATG 


TTTCAGGG 




8148 


(2) INFORMATION FOR SEQ ID NO: 12: 









(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9909 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TACTCCACCC TTAATATCCG TTCCTGTAAA TAC TTTACCG CTTTTAAGTT CATAGAATTG 60 

AACTTTTAAA TGCTTGTCTT CAAGCATCTT TTCCATCCAA TTTTTAGGAC TTTGACCAGC 120 

TTTAAATAAA AACCTTGCTG GGGTGATTAG TATAGATTTA TCTGCGATTT TATAAGCTTC 180 

ATCAATAAAA TAGTGATATA TCGGCTCATC TCTGGCTTCT CCTGTTTCCT GATAGGGAGG 240 

ATTTCCTATC ACGACATCAA ATTTCATTTC ACTTTCCTCG CTAGATAGGC GCTCAAAACC 300 

TATCATTCTA TTCTTTTTCC AGTCTTTGAT ATGGGTTTTA GATTCTTCTA CTTCTTGGAC 360 

TTCTAGCTCA TCCGCAAACA AACTCAATTC TTGAGATTGC TTTTCTTTAG CTGAATAAGG 420 

ACTACTTTTT TTCAATCCAT CCATCTGAAA GACATTGTAA GAGATAATAG TCGCAATTTC 460 

TTTCTTTTGC TCTAATGTTG GTTGATTTCC AGTCTTAGCT AGATAATAGT CCTCAAAAGT 540 

TGCCAAAAGA TTCTCACGCG CCAAAAGGAG AGAATCTCCT TGATACTCAT AACCATACGA 600 

AGCATGATAA GCATCTTTTA CAACTTTATA AAATGTCACT TCATCTCAAA CCTCACGACT 660 

AATCCCTTGC AGTTTTCTAT CAACAAAACC AACTCGCTCA GATAATGGAA TTTCCTCACC 720 

AGTTACGGTA TCATATCTCG TTACCATATA AGGTGCTTCA CCACAAGTTA CCTCTAACCA 780 

TCGTAACTCC ACATACTCCT CAAGACTTAA CGAGCCTAAT TTCGATTCTA CATATCCATT 840 

TTGCTTTGCG ACCAACCACG TTGGTGTAAA CACTTCTGCC CTTATTTTTC TCCGATCTTT 900 

TTGTTCATAT TTGGATTTTT CAGATCTGCC CTGAATCAAG TTCGCAAAGT TTCCAGTAAC 960 

CTTACTTGGA TTGATGCGAT CACTTGGAGC AAATCCCTTT CCTAACAATT CATAAGAATG 1020 

CGTAnGCCAA ACAATTGATT TCTTTGTCGT TCGATCTTTT AAAAGAATTT TTAATAAGTC 1080 

AGCCGATTCT TTAGCCAAAC TTTCTTCACT AATATCTATT GTCATCAGCA ACCTCTCTTA 1140 

TATTGTAAGC CCTATTATAT CATATTTTAA AGAATGAAAA TTTACTTGAA AAAAGTAATT 1200 

CAATAAATAT CTCTCCGATG ACCAACTTCT AGAGTAGCAA CGACTAATTC ATCATCTACA 1260 

ATTTGTACGA TAACTCGATA ATTACCAATT CTATACCGCC ATTGACCAAC GCGATTACCA 1320 

ACCAAAGCCT TTCCGTGTCG TCTTGGGTCT TCCAAAACAT TGGTTTGTAA ATAGTTTGTA 1380 

ATTAGCTTCT CCGTATAACG GTCCAATTTT TTCAATTGCT TGATAAAACG TCTT G TTCGA 1440 

ACTAATTTAT ACAAATTATT CATCCTTCAA GCCTAAATCA TGCATCATTT CTTCCCAAGT 1500 

AATGGGTTCA ACTCCTTTTT CCAACTCTTC TAAATACTCT TGATAGGCTA AATCTGCCAC 1560 



OP1P0^1A9 I t 
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ACGACCATCG 


TATTCATCTT 


CTAGGGCTTC 


AAGACTTTTG 


GTGCGAATAA 


GTTCCGAAAG 


1620 


GGAAACTCCT 


TCAAACTTAG 


CCATTGCTTT 


CATAAATGTT 


TTATCAGCTT 


CAGAAACTTT 


1680 


TAATGTAATA 


GTAGTCATCT 


TTTGTGCTCC 


CTTTTTTAAT 


GGTAACACCA 


TTGTATTACT 


1740 


TTTTACGTGT 


TCAGTCAATA 


TAAAAAGAAC 


ACCTTCTCAG 


CGTTCTTTCT 


ATATCTCTGT 


1800 


CAATCGTGTT 


GCGGTATCTG 


GTGAGGTATC 


ATAAACCTTA 


AAGTCTACTC 


CGACTCCCAG 


1860 


ATCAGCTTGA 


GCCACCTGAT 


TGACCATGGT 


CATATGAGCC 


AGTTCCTTGA 


TATTGTTTTC 


1920 


CTTAGATAAA 


TGCCCAAGGT 


AAATCTTCTT 


AGTACGATTT 


CCTACCGTCC 


GAATCATAGC 


I960 


TTCAGCACCG 


TCCTCGTTAG 


AAAGGTGACC 


AAGGTCAGAT 


AGGATTCGTT 


CTTTGACTCG 


2040 


CCAAGCGTAA 


GAACCTGATC 


GCAAAATCTC 


TACATCATGG 


TTGGCCTCGA 


TAAGATAACC 


2100 


ATCCGCATTT 


TCGACAATGC 


CCGCCATACG 


GTCACTGACA 


T AACCTGT AT 


CTGTCAAGAG 


2160 


GACAAAACTC 


TTATCATCCT 


TCATAAAGCG 


ATACAACTCC 


CGTGCGACTC 


CATCATGCCT 


2220 


TACACCAAAA 


CTCTCGATGT 


CGATATCTCC 


AAAGGTTTTG 


GTTTTACCCA 


TTTCAAAAAT 


2280 


ATCCTTTTGC 


GAAGAATCCA 


CCTTGCCAAG 


ATATTTACTA 


TTTTCCATAG 


CTTGCCAGGT 


2340 


CTTTTCATTG 


GCATAAAGAT 


CCATACCATA 


CTTCCCAGCC 


AAAACGCCTA 


CTCCATGGAT 


2400 


ATCATCTGAA 


TGCTCATGGG 


TAATCAACAT 


GGCATCCAGG 


TCTTCTGGCT 


TACGGTTAAT 


2460 


TTCAGCTAGC 


AGACTGGTAA 


TTTTCTTGCC 


AGACAACCCT 


GCATCTACTA 


AAAGCTTCTT 


2520 


TTTTGAGGTT 


TCCACATAAA 


AAGAATTTCC 


ACTGGAACCC GACGCTAAAA 


TACTGTATTT 


2S80 


AAAGCCTATT 


TCACTCATTC 


TAGTCTTCTA 


CTTCATCCTC 


CCATACTTCT 


TCTTTCACTG 


2640 


CATCCTTATC 


ATAAGGGAGT 


ACAATGGTAA 


AGGTTGAACC 


CTTGCCGTAT 


TCACTCTTGG 


2700 


CCCAAATAAA 


GCCCTTATGT 


TGTTTGATAA 


TTTCTTTAGC 


GATAGACAGT 


CCTAGACCTG 


2760 


TACCACCTTG 


TGCACGACTT 


CTAGCACGAT 


CCACACGATA 


GAAACGGTCA 


AAGATACGTG 


2820 


GTAAATCCTG 


CTTAGGAATC 


CCCAAACCGT 


CCTCACAAAT 


GGATAAAATC 


ATCTGGTCTT 


2880 


CAGTTGTCTT 


CATTCTGACA 


GTGATTTTAC 


CCCCATCTGG 


CGAATACTTA 


ATAGCATTAT 


2940 


TTAAAATATT 


GTCGACAACC 


TGCGTCATCT 


TATCTGTATC 


AATTTCCATC 


CACATAGAAT 


3000 


TGATGGGATA 


ATCTCTCACC 


AACTCATATT 


TTTTCTCCTT 


TTCCTGTCCT 


TTCATCTTGT 


3060 


CAAAACGATT 


GAGGATAAAG 


GTAATAAAAG 


CAGTGAAGTT 


AATCAGTTCC 


ACATCTAGGT 


3120 


GACTGGTAGC 


ATTATCAATA 


CGTGAAAGAT 


GGAGGACATC 


CGTCACCATG 


CGCATCATAC 


3180 


GGTTGGTCTC 


ATCAAGAGAA 


ACCTTGATAA 


AGTCTGGTGC 


TACAGTTTCA 


CACAAAGCCC 


3240 


CCTCATCCAA 


GGCTTCAAGA 


TAGGATTTTA 


CGCTAGTCAC 


AGGAGTCCGT 


AACTCATGGC 


3300 


TAACATTGGA 


AACAAAGAGT 


CTTCGTTCGC 


GTTCTTC CTT 


CTCCTGCTCC 


GTCGTATCAT 


3360 
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GCAAAACAGC CACCAAACCT CAAATAAAGC CAGACTCTCG ACGTATCAAG GCAAAGCGAA 3420 

CTCGAAGGTT CAAATATTCG CCATTGATAT CTTGGGAATC TAGCAACAAT TCTCGA C TTT 3480 

GGGTAATCAA ATCACGCAAT TCATAGTTTT CTTCTATCTT GAGCAATTCC AAAATGCTTC 3S40 

TATTCAGAAC ATCTTCCTTA ACCAACCCCA GTTGCTTCTT GGCTGTATCG TTAATCATGA 3600 

TAATCTGACC CCGACGGTTA GTCGCAAGAA CCCCATCTGT CATATAAAAC AGAATACTAT 3660 

TTAGCCTCTT ACTCTCTTCT TCTAG A TTTT CCTGAGTGAG ACGAATAACC TCCGACAAGT 3720 

CATTCAAATT ATTGGTAATA TTGGTGATTT CAGACCCACC TTGCATATCA AGAACCTTGG 3780 

AATAATCTCC TCCAATCAAA TCTTTAACCT TTTGATTGAC TTGCTTCAAC TGAATATTAT 3840 

CACGTCTATT TTCCAGTAAT AAGAGGGTCA CAACAAGGAT GAAACCTAAC AAAATCAGGA 3900 

TAAAGATAAA ATCTCTGGTA AAAATGGTTT CTTTCACTAA ATCAAGCATT ATTTCTCATG 3960 

TAATACCCTA CACCACGGCG CGTCAAGATA TACTCTGGTC GGCTGGGCGT ATCTTCAATC 4020 

TTCTCACGCA GACGTCGTAC AGTCACATCA ACTGTACGGA CATCACCAAA ATAGTCATAA 4080 

CCCCAGACAG TCTCAACCAA GTCTTCGCGC GTGATGACTT GACCTGTATG CGATCCTAAA 4140 

TCATACAAAA GCTCAAATTC ACGATCGGTT AACTCTAGTT CTTCGCCATA TTTTTTAGCC 4 200 

ACG T AGGCGT CTGGAACAAT TTCTAAATCC CCAATTTGGA TAGGTTGAGG TTT A CTATCT 4 260 

GCTTCCTGAC CATCTACTGG CATAGGTTGA GAACGACGCA GAAGACCTTT AACACGCGCC 4 320 

TCCAACTCAC CATTGGAGAA CGGTTTTCTT ACATAGTCAT CTGCCCCAAG TTCCAAACCG 4380 

ATAACCTTAT CAAATTCACT ATCTTTGGCT GAAAGCATAA CAATGGGCAC ACTGCTT G TC 4440 

TTACGAATGG TCTTAGCAAC TTCTAAACCA TCAATTTCTG GAAGCATCAA ATCCAGAATA 4500 

ATAATATCTG GTTGCTCTGC TTGAAATTGC TCTACCGCTT CACGACCATT AAAAGCAGTT 4560 

ACAACTTCCT AACCTTCCTT GGTCATATTA AACTTGATAA TATCCCAGAT TGGTTTCTCA 4620 

TCATCTACAA TTAGTATTTT TTTCATATGT TCACCTTTTT CTCTACTATT ATACCAAAAA 4680 

AATAGTCAGA AGACACAATA GCTACTCTTG GCTACTGTCT AAGTTGGCTT GTGCATAAAC 4740 

CTGCCAGATT TTTTGTTGGG C TTT G GCAAG TGGGTAATTC TTGAATTCTT CTGGTGAAAG 4800 

CCAGCGAACT TCCCTATCTG AAAAATCATG GAAGTCACTC ACCTGACCTG CTACAATCTG 4 860 

TACATCCCAT TTTCCATGAC TAAAAACATG CTGGACTGTA TCAAAACAAA CATCAAGCCA 4920 

ATCAACATCT AGGTCATAGT CCTGCTGGAA ACTCTCTTCT GGACTGGGAC CAAAGTTCAC 4980 

ACTTTCTTCC GCAACCTGAT GAAAGAGGTC AAACTGCTCT TCTTGCGAAA AGTTATCAAC 5040 

TTCTATAAAG GGGAAATGCC AAAAACCTGC CAAGAGCTTT TCGCTTTCAT TTTTTTCAAG 5100 
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TAAAAATTGT 


CCTTGAGAAT 


TTTTCACAAC 


TAAGGCTTTA 


AGATAAATAG 


GAACCGGCTT 


5160 


TTTCTTAGGA 


GATTTAATTG 


GATAACGGTC 


CATGGTTCCA 


TTCTGATATC 


CCGCACTAAA 


5220 


CTCCTTGACT 


GGGCTTTCTT 


CAGGTCTGGG 


ATTTACAGGA 


GACTCAATAT 


CAGACCCTAA 


5280 


GTCCATCAAG 


GCTTGATTAA 


AATCACCCGG 


ACGATCCGGA 


TTAATCAAGA 


TCTCCATCAT 


5340 


TGCCTGAAAA 


ATTTTTCGAT 


TACTTGGAAT 


CCCAATATCG 


TGGTTGACTT 


CAAACAGACG 


5400 


CGCCAAGACC 


CCCATGACAT 


TACCATCTAC 


AGCTGGCTCA 


GGCAAGTTAA 


AAGCAATACT 


5460 


GGAAATGGCT 


CCTGCTGTGT 


AAGGTCCAAT 


CCCTTTCAAG 


CTGCAAATTC 


CTTCATACGT 


5520 


ATTTGCAAAT 


TCCCCACCAA 


ACTCAGTCAT 


AATCTGCTGC 


CCTGCAGCCT 


GCATATTGCG 


5580 


AACTCGAGAA 


TAATAGCCCA 


ACCCCTCCCA 


AGCTTTCACT 


AAACTCTCCT 


CAGGCCCAGT 


5640 


TGCCAGACTT 


TCGACAGTTG 


GAAACCAGTC 


CAAAAATCTT 


TCGTAGTAAC 


GGATAACTGT 


5700 


ATCCACCCTG 


GTCTGCTGAA 


GCATGATTTC 


AGATACCCAG 


ATGTGATAAG 


GATTTTTACT 


5760 


TCTCCTCCAA 


GGCAAATCTC 


TTTTGTTTTC 


ATCATACCAA 


GCCAGAAGTT 


TCTCACGGAA 


5820 


AGAAATGACT 


TTCTCCTCCG 


GCCACATGAC 


GATACCGTAT 


TCTTTCAAAT 


CTAACATATC 


5880 


TCTAGTATAA 


CACAGAAGGT 


TTCACCTGTC 


TTTGTATCTG 


ATTTATAATA 


TTTTCAATAG 


5940 


ATAGTATATA 


ACTTTTCTAT 


CTACTTATAC 


TCAATGAAAA 


TCAAAGAGCA 


AACTACGAAG 


6000 


CTAGCCGCAG 


CTTGCTCAAA 


ACACTGTTTT 


GAGGTTGTGG 


ATAGAACTGA 


CAGAGTCAGT 


6060 


ATCATATAcT 


ACGGCAAGGT 


GAAGCTGACG 


TACTTTGAAG 


AGATTTTCGA 


AGACTATAAA 


6120 


TCTTATTGAT 


GAACTGCTTG 


CACTCTGAGA 


AAAAATGAGC 


TTGGATATTA 


TTTCCAAACT 


6180 


CACTTAAACT 


CAATTTCAAT 


CCACTAGAAC 


AAGCCTAGTA 


CAGTTCCATC 


GCTTTCAACA 


6240 


TCCATGTTGA 


GAGCTGCTGC 


ACGTTTTGGA 


AGACCTGGCA 


TGGTCATAAC 


ATCACCAGTT 


6300 


AAGGCAACGA 


TGAAGCCTGC 


ACCTAATTTT 


CGTACCAATT 


CACGAATGGT 


AATTTCAAAG 


6360 


TTTTCTGGTG 


CTCCAAGCGC 


ATTTGGATTG 


TCTCAGAAAC 


TGTATTGAGT 


TTTAGCCATA 


6420 


CAGATTGCCA 


ATTTGTCCCA 


ACCGTTTTGA 


ACGATTTGAG 


CAATTTGTGT 


TTGAGCTTTC 


6480 


TTCTCAAAGT 


TCACTTTGCT 


ACCACGATAG 


ATTTCAGTGA 


CAATTTTTTC 


AATCTTTTCT 


6540 


TGGACAGAAA 


GCTCATTATC 


ATACAAACGT 


TTATAGTTAG 


CTGG AT TTTC 


AGCAATTGTC 


6600 


TTAACAACTG 


TTTCGGCAAG 


TGCTACTCCA 


CCTTCTGCTC 


CATCAGCCCA 


GACACTAGCC 


6660 


AATTCAACTG 


GTACATCGAT 


TGAGGCACAG 


ACTTCTTTTA 


AGGCTGCAAT 


TTCAGCTTCT 


6720 


GT ATC AG AT A 


CAAATTCGTT 


AATAGCTACA 


ACTGCTGGAA 


TACCGAACTT 


ACGGATATTT 


6780 


TCAACGTGGC 


GTTTCAAGTT 


AGCAAAACCT 


GCACGAACTG 


CCTCTACATT 


TTCTTCACTC 


6840 


AGAGCGTCTT 


TAGCCACACC 


ACC ATT CATC 


TTAAGGGCAC 


GAAGCCTTCC 


GACAATAACA 


6900 



WO 98/18931 



PCT/US97/19588 



229 



ACTG C ATCTC 


GACATGTTGG 


f*AAfiTTTGnT 


U IH AUA 1 HI 


raA/V*lATTT 


1 f^T*?* A t^Lf* A A 
\* A LAuUAViUA 




AGCTCCCCAC 

* ^* ^ V- V» 




TTrARTAACA 
A i l»Mu A fv\\mn 


m*nTAAt*f*An 

UlVil AA A 




GGCTGTTGTC 




VJ 1 WV>l_v» AAAA 


^ AVjAVj k 1 AV-A 


GG LA 1 uAuCu 


MIA J luOUAA 


ATGGAGI.ACG 


GTGTAGAAAG 


*) AO A 




l»u i AAA 1 1\* I 


V_ ruAALLAAu 


TTTGGCTTAA 


TAW. ATwTCTT 


LAAAATCAAA 


' i«o 


arc x Afvtf" & c 

V* V— \_ AA^fO LA V. 


rr^Tf* a & /~r**TV* 

i UAAUU *\» 


wVAATV.AcvT 


AwAGAAAWAG 




ATAGvTgATAA 


72U0 


LL. AA 1 AALGA 


TATTCGCCAA 


ACGACGTTTC 


AAGTwCTCGA 


TGTCCGTTGC 


ft ft ft ft ft ^« ft 

caagcaaaga 


7260 


ATTGCCATCA 


TTTCTGAAGC 


& Ik 0*>^0^9f% ft ft #*• ft 

AACTGTAATA 


k t t i 0*0* ik *n 

TCAAAACCAT 


CCTCACGTGG 


AATACCGTTT 


7320 


A ^ A f"*f A A 


ft ft <^ « r+r* ft * 

CAAGACCAAC 


AGTCACATGG 


CGGAGCGTAC 


GGTCGTTCAA 


GTCCACAACG 


7380 


CGTTTCCAGA 


GGATACGACG 


TTGATCAATT 


CCCAGCTCAT 


TCCCTTGGTG 


CAAGTGGTTG 


7440 


^/*» & ft ft ft 

TCAATCAAGG 


CAGAAAGGGC 


ATTGTTGGCA 


CTTCTAATAG 


CATGCATATC 


TCCAGTAAAG 


7500 


TGGAGGTTGA 


TGTCTTCCAT 


TGGCAGAACT 


TGTGCATACC 


CACCACCAGC 


AGCACCACCC 


7560 


TTGATCCCCA 


TGACTGGACC 


AAGAGACGGT 


TCGCGGATAG 


CAATCATCGT 


T'riMT'l'GCCA 


7620' 


ATCTTCTTCA 


AGGCATCCCC 


AAGACCAATG 


GTAAGCGTCC 


ACTTTCCTTC 


ACCTGCAGGT 


7680 


G * 1 (jGGTTGA 


WT*?** 0^ 0** ft 0*0** ft Ik 

TGGCAGTAAC 


CAAGATCAAT 


TTACCGACTG 


GATTGCTCTC 


AACTGCACGA 


7740 


ft 1 • 1 ' > ft ft ft 

ATTTTATCAA 


AG CTGAGTTT 


AGCCTTGTAC 


TTTCCCTACA 


ACTCCAAATC 


CTCATAAGAA 


7800 


ATACCAAGTT 


TCTCTACAAC 


ATCAACAATT 


GGCTTCAACT 


CAATACTCTG 


TGCGATTTCA 


7860 


ATATCTCTTT" 


•1*^ ft fffWft^ ft ft ft ft 

TCATTCAAAA 


TTCCTCTAAC 


CTCTTATATG 


ATAATTCATT 


AT ATC AC AAA 


7920 


ACAAGATTTT 


ft ft /"^ ft MU if ft 

TAACATCCTA 


fe K K 0*m0^+t*0*0*% m 

AAACTCTCTA 


AACGTTCGTA 


AATATCTCTG 


TTTTTAAGAC 


7980 


TTTTAGAGTC 


CTTTCTTAAA 


TTTTATATGG 


OTTTATAGTT 


IIM ■ ft ft ft MM ft fff% ft 

TUAAACTATA 


ATAAATCTTC 


8040 


y i I I I i AV^.l»»/\ 


AAAAA 1 1 A1W 


ACTTTCATTT 




TTATTTTTGT 


ft f» ft ft *f»A 
WTACAATACT 


8X00 




TTTTAGTTAC 


ft *pfwr*f*frr**r 
A 1 LuvjuLuvj I 


ArrArwAAr 

AV. LAG 1 vjAAu 


L fATCuA A AG 


UiTCCGCTCT 


olou 


ATCACTAACC 


ATTCTACAGG 


TCACTTGCGC 


AAAATTATCA 


CAGAGACTTT 


GCTTTCTGCA 


8220 


GGGTATGAAG 


TTTGTTTAAT 


TACGACAAAA 


CCAGCTCTGA 


AGCCAGAGCC 


TCATCCTAAC 


8280 


CTAACTATTC 


GAGAAATTAC 


CAATACCAAG 


GACCTTCTAA 


TAGAAATGCA 


AGAACGTGTT 


8340 


CAGGATTATC 


ACGTCTTGAT 


CCACTCAATG 


GCTGTTTCTG 


ACTACACTCC 


TGTTTATATG 


8400 


ACAGCGCTTG 


AGGAAGTTCA 


GGCTAGCTCC 


AATCTAAAAG 


AATTTTTAAG 


CAAGCAAAAT 


8460 


CATCAGGCCA 


AGATTTCTTC ■ 


AACTGATGAG 


GTTCAGGTTT 


TGTTCCTTAA 


AAACACACCC 


8520 


AAAATCATAT 


CCCTAGTCAA i 


GGAATGCAAT 


CCTACTATTC 


ATCTGATTGG 


TTTCaAACTC 


8S80 


CTGGTTGATG ' 


TTACCGAAGA 1 


TCATCTGGTT « 


GACATTGCAC 


GAAAaAGTCT 


TATCAAGAAT 


8640 
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CAAGCAGATT TAATCATCGC GAATGACCTG ACTCAAATTT CAGC&.GATCA CCACCGAGCT 8700 

ATATTTGTTG AGAAAAATCA GCTTCAAACA GTCCAGACTA AAGAAGAAAT TGCAGAACTC 8760 

CTCCTTGAAA AAATTCAAGC CTATCATTCT TAGAAAGGAA AACTATGGCA AACATTCTCT 8820 

TGGCTGTAAC GGGTTCAATC GCCTCTTATA AGTCGGCAGA TTTAGTCAGT TCTCTAAAAA 8880 

AACAAGGCCA TCAAGTCACT GTCTTAATGA CTCAGGCTGC TACAGAGTTT ATCCAACCTT 8940 

TGACACTACA GGTACTCTCA CAGAATCCTG TCCACTTGGA TGTCATGAAG GAACCCTATC 9000 

CTGATCACCT CAATCATATC GAACTTGGAA AAAAAGCAGA TTTATTTATC GTGGTACCTG 9060 

CAACTGCTAA CACTATTGCA AAACTAGCTC ACGGATTTGC GGACAACATG GTAACCAGTA 9120 

CAGCTCTAGC CCTACCAAGT CAT ATTCCCA AACTAATAGC TCCTCCTATG AATACAAAAA 9180 

TGTATGACCA TCCAGTAACT CAGAATAATC TGAAAACATT AGAAACTACG GCTATCAGCT 9240 

GATTGCTCCT AAGGAATCCC TACTAGCTTG TGGACACCAC GGACGAGGAG CTTTAGCTGA 9300 

CCTCACAATT ATTTTAGAAA GAATAAAGCA AACTATCGAT CAAAAAACGC TCTAATATTG 9360 

CACCCATTGC TATCTTTTTT GCTACCATGC TCGTGATACA CTTTCTGAGC TCACTTATCT 9420 

TTAACCTTTT TCCATTTCCA ATCAAACCGA CCATTGTTCA TATTCCTCTC ATTATTGCCA 9480 

GCATTATTTA TGGTCCACGA GTTGGGGTTA CACTTGGATT TTTGATGGGA TTACTTAGCT 9540 

TGACGGTTAA CACGATTACG ATTCTACCGA CAAGCTACCT CTTCTCTCCC TTCGTACCAA 9600 

ACGGAAACAT CTACTCAGCT ATCATTGCCA TCGTCCCACG TATTTTGATT GGTTTAACTC 9660 

CTTACTTAGT CTATAAACTG ATGAAAAACA AGACTGCTCT GATTTTAGCT GGAGCCCTTG 9720 

GTTCc TTGAC AAATACTATC TTTCTCCTTG GAGGAATCTT CTTCCTATTT GGAAATGTTT 9780 

ATAATGGAAA TATCCAACTT CTTCTGGCAA CCGTTATCTC AACAAATTCA ATTGCTGAAT 9840 

TGCTCATTTC TGCAATTCTA ACCCTAGCCA TTGTTCCACG ACTACAAACC TTGAAAAAAT 9900 

AAAAACAGG 9909 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 112 6 base pairs 

(B) TYPE: nucleic acid 
<C> ST RAND EDN ESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TAATTTTCAT ATAATAGTAA AATAGAATGT GTGATTCAAT AATCACCTCA AATAGAAAGG 60 

AAATTCTATG TCAAATCTAT CTGTTAATGC AATTCGTTTT CTAGGTATTG ACGCCATTAA 120 
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TAAACCCAAC TCAGCTCATC CAGGTGTGGT TATCGGAGCG GCTCCGATGG CTTACACCCT 180 

CTTTACAAAA CAACTTCATA TCAATCCAGC TCAACCAAAC TGGATTAACC GCGACCGCTT 240 

TATTCTTTCA GCAGGTCATG GTTCAATGCT CCTTTATCCT CTTCTTCACC TTTCTGGTTT 300 

TGAAGATCTC AGCATGGATG AGATTAAGAG TTTCCCTCAA TGGGGTTCAA AAACACCAGG 360 

TCACCCAGAA TTTGGTCATA CGGCAGGGAT TGATGCTACG ACAGGTCCTC TAGGGCAAGG 420 

GATTTCAACT GCTACTGCTT TTGCCCAAGC AGAACGTTTC TTGGCAGCCA AATATAACCG 480 

TGAAGGTTAC AATATCTTTC ACCACTATAC TTACGTTATC TGTGGAGACG GAGACTTGAT 540 

GGAAGCTGTC TCAAGCGAGG CAGCTTCATA CGCAGGCTTG CAAAAACTTG ATAACTTGCT 600 

TGTTCTTTAT GATTCAAATG ATATCAACTT GGATGGTGAG ACAAAGGATT CCTTTACACA 660 

AAGTGTTCGT GACCGTTACA ATGCCTACGG TTGCCATACT GCCTTGGTTG AAAATGCAAC 720 

AGACTTGGAA GCCATCCATG CTGCTATCGA AACAGCAAAA GCTTCAGGCA AGCCATCTTT 780 

CATTCAACTG AAGACGGTTA TTGGATACGG TTCTCCAAAC AAACAAGGAA CTAATGCTCT 840 

ACACCGCCCC CCTCTTGGAG CAGATGAAAC TCCATCAACT CGTCAAGCCC TCGGTTGGGA 900 

CTACGAACCA TTTGAAATTC CAGAACAAGT ATATGCTGAT TTCAAAGAAC ATGTTGCAGA 960 

CCGTGGCGCA TCAGCTTATC AAGCTTGGAC TAAATTAGTT GC AG ATT ATA AAGAAGCTCA 1020 

TCCAGAACTG GCTGCAGAAG TAGAAGCCAT CATCGACGGA CGTGATCCAG TCGAAGTGAC 1080 

TCCAGCAGAC TTCCCAGCTT TACAAAATGC TTTTtCTCAA GCAACT 1126 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CCGGCAACAA AAAAGAAAAA ATCAACAGTT AAAAAAAATC TAGTCATCGT GGAGTCGCCT 60 

GCTAAGCCAA GACGATTGAA AAATATCTAG GCAGAAACTA CAAGGTTTTA GCCAGTGTCC 120 

GGCATATCCG TGATTTGAAG AAATCCAGTA TCTCCGTCGA TATTGAAAAT AATTATGAAC 180 

CGCAATATAT TAATATCCGA GGAAAAGGCC CTCTTATCAA TGACTTGAAA AAACAACCTA 240 

AAAAACCTAA TAAAGTTTTT CTCGCGAGTG ACCCGGACCG TGAAGGAGAA CCGATTTCTT 300 

GGCATTTGGC CCATATTCTC AACTTGGATG AAAATGATCC CAACCGTGTG GTCTTCAATG 360 
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AAATCACCAA GGATGCAGTC AAAAATCCTT TTAAACAACC TCGTAAGATC GATATGGACT 420 

TGGTCCATGC CCAACAAGCT CGTCGGATCT TGGATCGCTT GGTAGGCTAT TCGATTTCGC 480 

CTATTTTGTG GAAGAACGTC AAGAAGGGCT TGTCAGCAGC TCGCCTTCAG TCCATTGCCC 540 

TTAAACTCAT CATTGACCGT GAAAATGAAA TCAATGCCTT CCAGCCAGAA GAATACTGGA 600 

CAGTTGATGC TGTCTTTAAA AAGGGAACCA AACAATTTCA TGCTTCCTTC TATGGAGTAG 660 

ATGGTAAAAA GATCAAACTG ACCAGCAATA ACGAAGTCAA GGAAGTCTTG TCTCGTCTGA 720 

CGAGTAAAGA CTTTTCAGTA GATCAGGTGG ATAAGAAAGA GCGCAAGCGC AATGCTCCTT 780 

TACCCTATAC CACTTCATCT ATGCAGATGG ATGCTGCCAA TAAAATCAAT TTCCGTACTC 840 

CAAAAACCAT GATGCTTGCC CAACAGCTCT ATGAAGGAAT TAATATCGGT TCTGGTGTTC 900 

AAGGTTTGAT TACCTATATG CGTACCGATT CGACTCGTAT CAGTCCTGTA CCGCAAAATG 960 

AGGCGGCAAG CTTCATTACG GATCGTTTTG GTAGCAAGTA TTCTAAGCAC GCTAGCAAGG 1020 

TCAAAAACGC ATCAGCTGCT CAGGATGCCC ATGAGGCTAT TCCTCCGTCA AGTGTCTTTA 1080 

ATACACCAGA AAGCATCGCT AAGTATCTCG ACAAGGATCA GCTTAAGCTA TATACCCTTA 1140 

TCTGGAATCG TTTTGTGGCT AGCCACATCA CACCGGCCGT TTTTGATACC ATGGCTCTTA 1200 

AATTGTCTCA AAAAGGGGTT CAATTTCCTG CCAATGGTAC TCAGGTTAAC TTTGATGCTT 1260 

ATCTTGCCAT TTATAATGAT TCTGACAAGA ATAAGATGTT ACCGGACATC GTTGTTGGAG 1320 

ATGTGGTCAA ACACGTCAAT AGCAAACCAC AGCAACATTT CACCCAACCG CCTGCCCGTT 1380 

ATTCTGAAGC AACACTGATT AAAACCTTAC AGGAAAATGG GGTTGGACGT CCATCAACCT 14 40 

ACGCGCCAAC CATTGAAACC ATTCAGAAAC CTTATTATGT TCGCCTGGCA GCCAAACCTT 1S00 

TTGAACCGAC AGAGTTGCGA GAAATTGTCA ATAAGCTCAT CGTTGAATAT TTCCCAGATA 1560 

TCCTAAACGT GACCTTCACA GCTGAAATGG AAGGTAAACT GGATGATGTC GAAGTTGGAA 1620 

AAGAGCAGTG GCGACGGGTC ATTGATGCCT TTTACAAACC ATTCTCTAAA GAAGTTGCCA 1680 

AGGCTGAAGA AGAAATGGAA AAAATCCAGA TTAAGGATGA ACCAGCTGGA TTTGACTGTG 1740 

AAGTGTGTGG CAGTCCAATG GTCATTAAAC TTGCTCGTTT TGGTAAATTC TACGCTTGTA 1800 

GCAATTTCCC AGATTGCCGT CATACCCAAG CAATCGTGAA AGAGATTGGT GTTGAGTGTC 1860 

CAAGCTGTCA TCAGGGACAA ATTATTGAGC GAAAAACCAA GCGTAATCGC CTATTCTATG 1520 

GTTGCAATCG CTATCCAGAA TGTGAATTTA CCTCTTGGGA CAACCCTGTT GGTCGTGACT 1980 

GTCCAAAATG TGGCAACTTC CTCATGGAGA AAAAAGTCCG TGGTGGTGGC AAGCAGGTTG 2040 

TTTGTAGCAA AGGCGACTAC GAGGAAGAAA AGATGGCTCT TTGTCAACTG TAGTGGGTTG 2100 

AAGTCAGCTA AGCTCGAGAA AGGACAAATT TTGTCCTTTC TTTTTTGATA TTCAGAGCGA 2160 
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TAAAAATCCG TTTTTTGAAG TTTTCAAAGT TCCGAAAACC AAAGCCATTC CGCTTGATAA 2220 

GTTTGATGAG ATTATTGGTC CCTTCCAATT TGGCGTTAGA ATAGTGTAGT TCAAGCCCCT 2280 

TGACGATTTT CTCTTTGTCC TTTAGAAAGG TTTTAAAGAC AGTCTGAAAA AGAGGATGAA 2340 

CCTGCTTTAG ATTGTCCTCA ATGACTCCGA AAAATTTCTC CGGTTCCTTA TTCTGAAAGT 2400 

CAAACAGCAA GAGTTGATAG AGCTGATAGT GATGTTTCAA GTCTTGTGAA TAGCTCAAAA 2460 

CC T T G TTTAA AATCTCTTTA TTGCTTAAAT GCATACGAAA AGTAGGGCGA TAAAAATGTT 2520 
(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10993 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TTTTCTCGAT AATAACTTCC ACCTTATTAT TTGGGATACC CTCCTCTTCT TCACCACCAC 60 

CTTCATAGTA GTCATCGCGA T AG AG AAAAG CTACGATATC ACCCTCCTGC TCAATACACC 120 

CAGATTCACG AAT ATC AG AC AAGACCGGTC TCTTGTCCTG ACGTTCTTCT ACACCACGAC 180 

AAAGCTGACT CAGAGCGATT ACTGGAACCT TCAATTCCTT GGCTAGTATT TTCAACTGAC 240 

GAGAAATTTC AGAAACTTCT TGTTGACGAT TTTCTCGACC AGTTCCCGTG ATAACTTGCA 300 

AATACTCTAT CAAAATCAAA CCAAGATTTC CAGTTTCTTG AGCCAATTTA CGAGAACGAG 360 

AACGAATCTC TGTAATCCGA ATACCTGGCG TATCATCGAT ATAGATACTG GCGTTAGcTA 420 

GATTACCCTG AGCAATAGTA TATTTTTGCC ACTCCTCATC TGTCAATTGC CCTGTACGGA 480 

TAGAATGTGA CTCCACTAAG CCTTCTGCAG CTAACATACG ATCTACCAAG CTTTCCGCAC 540 

CCATTTCCAG TGAAAAAATA GCAACCGTTT TGTCCAACTT AGTCCCAATC TTCTGAGCGA 600 

TATTCAAGGC AAATGCTGTC TTACCAACTG CTGGACGAGC TGCTAAGATA ATCAACTCCT 660 

CCTCATGAAG TCCTGTTGTC ATATGATCCA AATCACGATA ACCTGTCGCA ATACCTGTAA 720 

TATCGGTCGT TTGTTGCG AG CGACCTTCCA GATTTCCAAA GTTGAGATTC AACACATCTC 780 

GAATGTTCTT AAACCCGCTT CGATTTGCAT TTTCACTGAC ATCAATCAAC CCTTTTTCTG 840 

CCTGAGCAAT AATTTCATCA GCTGGTTGTG ACGCTTCGTA AGCTTGGTTG ACAGACTCTG 900 

TCAACTTGGC AATTAAACGA CCTAGCATTG CTTTTTCTGC AACAATCTTA GCATAATACT 960 

CCGCATTACC AGAAGTTGGC ACAGAATTAA CAATCTCAAC CAAGTAAGAC AAGCCACCAA 1020 
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TATTCTCTAA ATCACCTTGA TTATCAACGA TAGTACGAAC CGTTGTTGCA TCTATGGCAT 1080 
CACCACGATC GGATAAATCG ACCATGGCTT GGAAAATCAA ACGATGGGCA TACTTAAAAA 1140 
AGTCCCGAGA CTCAATGTAT TCTCGCACAA AAACAAGTTT ACTCTCATCA ATAAAGATAG 1200 
CCCCTAAAAC CGATTGCTCA GCTAAGATAT CTTGAGGTTG TACTCCTAAC TCTTCTACTT 1260 
CTGCCATCAG ACTTCCCTTC C TTTTAC AAT CTTGTCAAGA AGGTGTAAAC TTATCCTTCT 1320 
TTCACACGAA GATTGATTAC ACTTGTCATA TCTTGATAGA TTTTCACTGG CACATCAATC 1380 
AAACCAACCC CTCGAATCGG AGCTTGTACT TGAATATGAC GTTTATCAAT CTTAATTCCA 1440 
AATTGCTTTT GCAATTCTTC TGCAATCTTC TTATTGGTAA TAGAACCAAA GGTACGACCA 1500 
TCTGGACCAA CTTTTTCAAC AAATTCTACA ACAGTTTCTT CTGCTTCAAG TTCTGCTTTA 1560 
ATTCCTTTTC CTTCTGCAAT CATCTCACCG TCAGCTTTTT CTTCCGATTT TTGTTTACCA 1620 

CGAAGTTCAC CTACAGCTTG AGCAGTCGCT TCTTTGGCTA GATTCTTTTT GATAAGAAAG 1680 

TTTTGCGCAT ACCCTGTTGG TACTTCCTTA ATTTCGCCTT TTTTACCTTT TCCTTTAACA 1740 

TCTGCTAAAA AGATTACTTT CATTCTTCTT TCTCCTTTTC CTTCATTTCA TTTAATACAA 1800 

TTTCTGTCAG TTTTTCACCT GCTTCTGACA AGGTTACATC TTTAATTTGA GCTGCTCCCA 1860 

AATTAAAGTC GCCTCCACCC CCTAACTCTT CCATAATCCG TTGTACATTC ACTTTACTAC 1920 

GACTTCGAGC TGAGATAGAG ATAAATCCTT GTGTATTCTT CGCAAGAACA AAACTCCCTT 1980 

CAATACCTGA CATGGCTAAC ATGGCATCTC CTGCCTTACT AATAACAACT CTATCATAGC 2040 

ATTTCATGTC CTTAGCCTCT GCTATTAGTA CATCTGAACC TAATTTACGC CCCTGTAAAA 2100 

TAACTTCATT GACCTCACGA TATTCTTCAA AATCTGTCGC ACCCATTTCC TGGATAGCAA 2160 

T ACT AT C ACT TCCGCGCGTT CTGAGATACC TACCAACATC AAATGTCCGA CTAGTTACTC 2220 

GCCACCTGAA ATTTTTAGTA TCCAACATCA TACCAGCCAT CAAGACACTT GCTTCCATAC 2280 

GACTCAAACG ATTTTTCTTA GAATTCTGGA ACTGAATCAA TTCCGTTACC AACTCACTGG 2340 

CACTACTTGC ACCACTTTCG ATATAAGTAA TAACCGCATT ATCTGGAAAA TCCTGATCCC 2400 

TTCTATGGTG CTCAATAACA ATGGTTTCGG TAAATAAATC ATAAAATTCT TTTGATAATG 2460 

TTAAGGCTGT CTTTGAATCC TCTACAAGAA TCAACAAAGA ACGATTGGTC ACCATCCCCA 2520 

TTGCATCCTT AACAGACAAC AACTTCGTAA CTCCTTCTTT TTCTATGAAT CAAACAGCTC 2580 

GTTCAATATC TGGAGACATT TGTTCTTCAT CATAAAGACC ATAGCTATTT TCAATCACAT 2640 

TGCTCGCGAA CAACTGCATA CCTACAGCAG AGCCCAAAGC ATCCATGTCT AAATTTTTGT 2700 

CACCGACTAC AAAAACCTGA TCTACACTCC GAATCTTATC TGAAATAGCT CTCATCATAG 2760 

CGCGCGTACG AGTCCGTGTA CGCTTCATTG AAGCAGCAGA CCCACCACCA AAATAAACTG 2820 
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* 4 A • * \«u A 


TTCCTCGTTT 


TCCTTAACAA 


CCACCTGGTC 


GCCACCACGT 


ACTTCAGCCA 


2880 


* a w>*w\ a a 


GAGCAAAGCA 


ACTTTCCCTA 


TCTCATCATG 


ATTTCCATCG 


CCATAAGAAA 


2940 


/% X ^L^A I /\w 4 


TAAGGTCAAG 


GGCAACTGTC 


TCTCTTTCGA 


CTCTTCTCTG 


AAACCATCAA 


3000 




TTTATCATTC 


ATCAAGCCCT 


CAAGCACCGT 


GTAGTCAGTA 


AATAGATAAA 


3060 


n 1 >>un A k» w>v l 


ACTTACCCGA CGACAAAACA TCATGTGTTT 


TTCTGAAAAC 


TCTGATATAA 


3120 




AAAACTATTG 


ATTTGACTAA 


TATCTGACTC 


AGAAGTTTCA 


TCCTCCAAAT 


3180 


(.AiUAlnnl I 


ATCCACACAG 


ACAATCCCAA 


TCACTGGTCT 


ACTTGTTACC 


AATTCATCTG 


3240 


TTATGGCTTC 


TTCCCTGGAT 


ACATCTACAA 


AATACAAAAC 


ACCGGAAGAA 


GCATCCATAT 


3300 


(inALAbLA I A 


ACGCTTCTCA 


CCAAGCTTGG 


CATAAGTACA 


CGGATTTCCT 


ACTGAAGCCT 


3360 


TGATAATCGT 


TTGAACAGCT 


TCTAAATCAA 


AATCACCATC 


TTCCTTGGTC 


AAAATCAATT 


3420 


CAGCATAGGG 


ATTAAACCAC 


TCAACCTCTC 


CAGAAGATAA 


ATTCAATTTC 

A 4 * » * w 


ATAACACCTA 


3480 


CAGGCATCTG 


TTCCAATAGA 


GCTCTCAAAC 




TTGGTGGTTT 


ACATACTGTA 


3540 


TCTGTTCTAC 


ATCACTCCTT 


CTATAATGCA 


CTCTCAGTTT 


CTTAAATAAA 


AAAACATAGC 


3600 


CTtCTACAAA 


AAGAAACAAA 


ATTAAAACCG 


TCAACAGATT 


ATTATTAACA 


AAAATAATGA 


3660 


AAGTGGATAA 


GACTCCAAAC 


GCAATCAATC 


CTACTACAAT 


AGGAAAAATT 


GGACTTACAT 


3720 


AAAA'l l 1 


CATTCAAAAC 


CTCTTGGCAC 


CCATTATACC 


ATAATACCCC 


TCAAAAAGCG 


3780 


ACTTTTTAAA 


AGTGTAATCA 


GTAATTCTAT CAATTATAAG 


AAAAAGGTAG 


TTTACAATTC 


3840 


AU l AAALL r« 


CCTTTACACA 


TATTGAAATT 


AAGATTCTTT 


AACCTCTAAC 


AAACCAATTT 


3900 




ACGACGATAA 


ATCACATTCG 


TTCTCTGATC 


TTCAACATCC 


ACATAGATAA 


3960 


A JL&&T CATC 


CCCCAATAAA 


TCCATTTGTA 


GAATTGCTTC 


TTCCAAATCC 


ATTGGTTTTA 


4020 




TTTTGAACGA 


ACAACTTTAG 


ACTGGACAAT 


ATTTGAATCT 


TCCACCAAAG 


4080 


CATCTGTAAA 


TAATTGACCA 


GTTGCTACCT 


TATTTTTATT 


TTTACGCTCG 


ATTTTTGTTT 


« i ad 
4 14 U 


TATTTTTACC 


AATCTGACGT 


TCAATTTTAT 


CAGTTACAAG 


GTCAATTGAA 


CCATACATAT 


4200 


CTTGAGATAC 


ATCTTCTGCG 


CGGAGAGTAA 


TAGATCCAAG 


CGGAATCGTT 


ACTTCCACTT 


4260 


TAGCCCTTTT 


TTCACGATAA 


ACTTTTAAGT 


TAATTCGGGC 


ATCCAACTCT 


TGTTCTGGTT 


4320 


GGAAGTACTT 


TTCCATCTTT 


TCGAGTTTAG 


AAACT AC ATA 


ATCACGAATT 


GCTTCTGTTA 


4380 


CTTCTACCTT 


TTCACCACGG 


ATACTATATT 


TAATCATATG 


AGTACCTTCT 


TTCTAAACAT 


4440 


TTTTCTTTTT 


ATGATTTTAT 


TATAACGCTT 


TCATTCTATT 


TTTGCAAATT 


TTTTCCTCAT 


4500 


CTTACAAGGG 


AAAATGTTTT 


TACATCCTTA 


GCACCACCTT 


CTTCCAACAG 


TTTCTTAACA 


4560 
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CGATTTATAG 


TTGCTCCTGT 


AGTATAGATA 


TCATCTATAA 


GTAGGATTTT TTTAGGAATA 


4620 


GTGACTCCAC 


TTTTAATAAA 


GAAAGGAAGT 


TCTGTCCCCA 


AGCGCTCTGA 


ACGATTTTTA 


4680 


GAAGAACTGG 


CTCTCTCTTC 


TCTTTTCTCT 


AATAAATCCA GATACTCAAA GCCTGCTGCC 


% t HV 


TCTACCAAGC 


CCTCAACCTG 


ATTAAATCCT 


CTATTAGCAT 


ATCTATCAGG 




a o An 


ATTACAACAA 


ATTGATACTC 


TTTGTACTTT 


TTCAACTCCT 


CACTTAAAAA 




4 OOU 


ACTTTTCTTA 


ACAGGAACTC 


TCCATCAAAC 


TTATACCGAC 


TGAAAAAATC 


CTTCATAGtl 




TGATTGTAAG 


TAAAAATCGC 


TCTATGACTG 


ACTTCAACTC 


CCTCTTTACA 


CCAAAGTTGA 


a &on 


CAATCTTGAC 


ACTTTGTTGA 


CAACTCTGTT 


TTCATACAAT 


TTGGACAGTT 


CTCTTCCCCA 


cnA ft 

3U4U 


ATTCTTTCAA 


AAGTAGAATC 


ACAGTCTCAA 


CAAAGACAAG 


AGTCATCATT 


CCTCAGAAGT 


£ 1 AA 
3l<JU 


AAGAGACTAC 


TAAAACTTAA 


AACAGTCTTC 


ATAGTCTGCC 


CACATAACAA 


GCACTTCATA 


SI 60 


CACCAGCCTC 


CTTATTCATC 


ATCTGAATTT 


CCTTAATCGC 


CTTCTTGATT 


CAAGCATTTA 


S220 


ACCCATCATG 


GAAGAAAAGC 


AAATCTCCTG 


TCGGTCTATC 


CATGCTTCCT 


CCAACTCGTC 


5280 


CACCAATCTG 


AATCAAACTA 


GACTTGCTAA 


ACAAACGATC 


ATTGGCCTCT 


ACTACGAAAA 


5340 


CATCCACACA 


AGGGAAGGTA 


ACTCCGCGCT 


CCAAGATTCT 


CGTACTGATA 


AGTATTGTCA 


5400 


GTTCTCCATC 


TCGAAAAGCT 


TGTACTTGCT 


CTAATCGATC 


CTCTGTTACA 


GAACATACAA 


5460 


AGCCAATTTT 


CTCATTTGGA 


AATTGCTCCT 


GTAAGATTTC 


TGCTAACTGC 


TCCCCTTTCT 


SS20 


TAATTTCTGA 


AGCAAAAATG 


ACTAACGGAT 


AAGCTGTCTT 


TCTCTGCTTC 


TCAATATAGG 


5S80 


ACTTTAACTT 


TGGTCACAAA 


CGATTCTTGT 


CTAAGTAGCG 


ATTAAAATCC 


GATAACCAAA 


5640 


TTGGTTTTGG 


AATAATCAAC 


GGATTTCCAT 


GAAACCCTCT 


CGGTAAAT7C 


AGTCTTTTTA 


5700 


GTTCTCCTAA 


ACGCACCTTT 


TTATCTAACT 


CATTGGTCGA 


AGTCGCTGTT 


AAAAAGATTC 


5760 


TCAATCCATT 


CTCCTTTACA 


CTATTCTTGA 


CAGCGTGGTA 


AAGCATGGGA 


TTATCAACAT 


902 U 


AAGGAAAAGC 


ATCTACTTCA 


TCCACTATCA 


GCAAATCAAA 


AGCTTGATAA 


AACTTCAATA 




ACTGATGGGT 


TGTTGCAACA 


ACTAGTCGTG 


TTCGAAAATA 


AGGTTCCCAT 


TCTCCATCTA 




GCAAAGCTAT 


CCCGCAAGAA 


AAATCCTGTT 


GCAGGCGCTT 


GTACAGCTCC 


AAACAAACAT 


6000 


CTATGCGAGG 


ACT AG CC AAA 


CACACTGCAC 


CACCCGCATT 


GATCACTTTA 


GCCACTACTT 


6060 


GATAAATCAT 


TTCTGTCTTT 


CCAGCTCCTG 


TTACCGCATG 


AACTAAGGTT 


GGCTTTTGCT 


6120 


TGTCTACTAC 


TTGAAGCAAT 


CCCTCTGACA 


CCTTCTCTTG 


AAAAGGAGTT 


AATTGCCCGC 


6180 


GCCATTTGAC 


AACATCTTGC 


TTTGGAAAAT 


CCTCCTCCCG 


AAAATAGTAT 


AAAGTTTCAT 


6240 


CACTTCTGAC 


TCGCTTCATC 


AGCAAGCACT 


CTCGACAATA 


GTAAGCACCG 


ATGGGCAAAT 


6300 


ACCATTCTTC 


TAGAATACTA 


CTATTACAGC 


GTTGACAGAA 


AAGTTTCCCC 


TTCTCCTTTC 


6360 
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TCATTGCTGG 


AAGTTTCTCC 


GCCAACTGAC 


GTTCTTCTTC 


TGTTAATTCA 


TTCTCAGTAA 


6420 


ATAAACGACC 


CAGATAATCT 


AAATTTACTT 


TCATACTTCT 


TTATTCGTAA 


AAACTAGCAC 


6480 


TTTAGATGAT 


TTTTTAGTAC 


AATTAAATCA 


TGGAATTTAG 


GACAATTAAA 


GAGGACGGTC 


6540 


AAGTCCAAGA 


AGAAATCAAA 


AAATCTCCCT 


TTATCTGCCA 


TGCCAAGCGT 


GTTTATAGCC 


6600 


AAGAAGAGGC 


TCGTGACTTC 


ATTACTGCCA 


TCAAAAAAGA 


ACACTACAAA 


GCGACACATA 


6660 


ACTGCTCTGC 


CTTCATTATT 


GGAGAACGTA 


GTGAAATTAA 


ACGTACAAGT 


CATGATGGTG 


6720 


AGCCTAGTGG 


TACTGCTGGT 


GTTCCCATCC 


TTGGGCTACT 


AGAAAATCAC 


AATCTCACCA 


6780 




GGTCGTGACA 


CGCTACTTTG 


GTGGTATTAA 


ACTAGGCGCT 


GGAGGACTAA 


6840 


TTCGTGCTTA 


CGCCGGCAGT 


GTCGCCTTAG 


CTGTCAAAGA 


AATTGGTATT 


ATTGAAATAA 


6900 


AAGAACAGGC 


TGGCATTGCT 


ATTCAAATGT 


CTTATGCTCA 


GTACCAAGAG 


TACAGTAACT 


6960 


TCCTTAAAGA 


ACATGGTCTC 


ATGGAGCTGG 


ATACAAACTT 


TACAGATCAA 


GTCGATACGA 


7020 


TGATTTATGT 


TGATAAAGAA 


GAAAAAGAAA 


CTATTAAAGC 


TGCACTTGTG 


GAGTTTTTTA 


7080 


ATGGAAAAGT 


CACTTTAACT 


GACCAAGGTT 


TACGAGAGGT 


TGAAGTTCCT 


GTAAACTTAG 


7140 


TGTAAACAAT 


GAATAATACA 


GCGTTTCGTT 


GACATTCTCA 


CAACTACTTT 


AGCCAGCAAA 


7200 


AT AAAAAG AG 


CCCTACCAAA 


ATATACTAGA 


AAATGAAGCA 


ATTCAAACGA 


AACCTGATAT 


7260 




CACACCTATT 


TACTAGAATT 


AGCTGAACGC 


AATCACTTGA 


AAATTAATGA 


7320 


CTTTGATCTA 


TGATATATAG 


AAATGGTATG 


GATAGCGTTA 


TACTAAACAT 


ATCTTATACA 


7380 


AACAGGTATT 


CATATGTCTA 


TTTATAACAA 


CATTACTGAA 


TTAATCGCTC 


AAACACCCAT 


7440 


TGTTAAACTT 


AACAACATCG 


TGCCAGAAGG 


TGCTGCAGAC 


GTCTATATAA 


AGCTTGAAGC 


7500 


ATTTAATCCT 


GGTTCATCTC 


TAAAAGACCG 


TATTGCCCTT 


AGCATGATTG 


AAAAAGCTCA 


7560 


ACAAGATGGT 


ATTCTGAAAC 


CTGGTTCTAC 


TATTCTTCAA 


GCAACAAGTG 


GAAACACCGG 


7620 


TATTGGACTT 


TCATGGGTAG 


GTGCTGCTAA 








7680 


AACTATCAGT 


GTAGAACGAC 


GTAAAATTAT 


CCAAGCTTAT 


CGTCCTGAAC 


TCGTCCTAAC 


7740 


TCCTGGTAGC 


GAGGGAATGA 


AAGGTGCTAT 


TGCTAAGGCT 


CAAGAAATCG 


CTGCTGAACG 


7800 


TGATGGTTTC 


CTTCCTCTTC 


AATTTGACAA 


TCCAGCTAAT 


CCAGAAGTAC 


ACGAAAGAAC 


7860 


AACAGGAGCT 


GAGATACTAG 


CTGCTTTCGG 


TAAAGATGGA 


TTAGATGCCT 




7920 


AGTAGGT ACT 


GCTGGAACGA 


TTTCTGCTGT 


TTCTCATGCA 


CTCAAATCAG 


AAAATTCTAA 


7980 


CATTCAAGTT 


TTTGCAGTAG 


AAGCAGATGA 


ATCTGCTATT 


CTATCTGGTG 


AAAAACCTGG 


8040 


TCCTCACAAA 


ATTCAAGGTA 


TCTCAGCTGG 


ATTTATTCCT 


GATACACTTG 


ATACTAAAGC 


8100 
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CTATGATGGT ATCGTTCGTG TAACATCAGA TCACGCTCTT GCACTCGGAC GTGAAATTGC 8160 
TGGAAAAGAA GGCTTCCTTC TACGGATTTC CTCAGCTGCA GCTATCTACG GAGCCATCGA 8220 
GGTTGCCAAA AAATTAGGTA CAGCTAAAAA AGTCCTTGCC CTAGCACCAG ATAACGGTGA 8280 
ACGTTATCTC TCTACAGCAC TTTATGAATT GTAACCGTCC AATAACGAAG TCTATTGAAA 8340 
AATCTCCAGA CTAGAGAACT CACGGATAGT TCCTAATCTG GAG7CTTTCTT ATTTCCACTT 8400 
TTCTTCTACA ACTTTAGTCC ATGCTAAATA GGCCTCTAAA ACCTCTTTGT TTACGAGAGT 8460 

TTCCACGTTT GGAAGACATT CTAGAAGATA CGATAGATAT TTCTCACTAT TTATAATGGA 8520 

TTCAAATAAG ATATGAACAA ATCGATTAGA ACATGATGGT AAAGCGTAAT CCCTTGTTTC 8580 

TCAGCTTTCC CAGACAAAAA AGTCCAATAC TAAGTCAGCT GACTATCACT CTCTAGCACC 6640 

CTATAAGAAC TTTCATCCCC ATGAAGTAAG GGCTGAGTCA ATAGTCTCTC TCGCAAGAGG 8700 

TTATAAAGGC CCTCCAAATA GTATTGACTC GTCTTGATAT GCCAATTAGA GATTTCCTTA 8760 

CGTGTGATTG GTAAACCCAT CCTAGCCCAA TCTTCTT C TT GGCGATAATT GGGTACCTTC 8820 

AGATTAAACT TCTCATGGAT GGTGTGAGCC ATAATAGAAG CTGAGCCAAA GTTATGCGCT 8880 

AAAGGGCCTT TAGGAATAGG AGCTTTCACA AGCTTATCCA GATGATTATC TTTTACTCCT 8940 

TATGGACAAT GCTATATGGC ATAAATCAAG TACCTTAAAG ATTCCGACTA ATATTGGCTT 9000 

TGCATTTATT CCTCCATACA CACCAGAGAT GAACCCCATT GAACAAGTGT GGAAAGAGAT 9060 

TCGTAAACGT GGATTTAAGA ATAAAGCCTT TCGAACTTTG GAAGATCTCA TACAAGGACT 9120 

CGAGAAGGAG GTGATAAAGT CCATCGTTAA TCGGAGACGG ACTAGAATGC TTTTTGAAAA 9180 

CAGATGAGTA TAAAAAGAAA GTCCTCATTT CAATAGAAAT CACGACTTTC TGATGAATTT 9240 

ATAGTAAAAT GAAATAAGAA CAGGATAGTC AAATCGATTT CTAACAATGT TTTAGAAGCA 9300 

GAGGTGTACT ATTCTAGTTT AAATCCACTA TATTTGGGGA GTGATAGAAA AGCCCTTCAT 9360 

CAGCCAATCT ACTTGTTCAG CTGCGAGAGC TTTGACATCC TTTTCTGTAC TGGACCAAGT 9420 

CAGTTTTCCG TTCTCAAAGC GTTTATATAA TATCCAAAAT CCTTGACCAT CCCAGTAAAG 94 80 

AACTTTAAAG CGGTCTTTAC GTCCACCACA AAAGAGAAAC ACTTCATCGG AGAAAGGATC 9540 

CAATTCAAAG TGGGTTTTAA CTACATAGGC TAATGAGTCT ATTCCCTCCC TCATATCTGT 9600 

CTTGCCACAA ACAAGGTGAA CTTGACCTAA ATCACTTAGT TGAATTATCA TAGTACAATA 9660 

CCTTTCCTCC GATAATTATT TTTTATCTGG TATACTGGAA GTTGGGGAAT TAGGATAGAT 9720 

ACCTTGTTAT GACGCGCTTA CTATGAATTT GAAGTATAGT CTCCTAAATG CACTTAGCCC 9780 

TTATTATAGG GCTTTTTGTT TTAATTATTC TAATCGAGTG AGACTGGGGA AAAAACAATT 9840 

TCAGGAAAAA TCTAAGCCCT ATACAAAAAA GGAAGCAATT TGCTTCCTTT CTATTATTAG 9900 
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TTATTCAAGG CTGCTGCCAT TGTAGCTGCA ACTTCAGCTT CGAAGTCGTT TGCAGCTTTC 9960 

TCGATACCTT CACCAACTTC AAAGCGAGCA AACTCAACTA CCGAAGCGTT AACTGATTCA 10020 

AGGTATGCTT CAACTGTCTT GCTCTCATCC ATGATGTAAA CTTGTGCAAG AAGTGTGTAA 10080 

GCTTGGTCAA CTTTAGTGTT ATCAAGCATG AAGCGATCCA TTTTACCTGG AATAATTTTG 10140 

TCCCAGATTT TTTCTGGTTT GCCTTCTGCA GCCAATTCAG CTTTGATCTC AGCTTCACCT 10200 

TGAGCAATAA CATCATCAGT TAATTGAGCT TTTGATCCAT ACTTCAAGTG TCGAAGAGCT 10260 

GGTTTATTAA CCATTGCACG GCTTTCCTTG TCTTGGTCGA TAACGTGATT CAATTGTGCC 10320 

AACTCATCTT TAACGAATTG CTCATCCAAT TCTTTGTAAG AAAGAACTGT TGGTTTCATC 10380 

GCTGCGATGT GCATTGACAA TTGTTTAGCA AGTGCTTCGT CTCCACCTTC AACAACTGAA 10440 

ATAACACCGA TACGTCCACC CTTATGTTGG TATGCTCCAA AGTGTTGTGC GTCTGTTTTT 10500 

TCAATCAATC CAAAGCGACC GAATGAGATT TTCTCTCCGA TAGTTGCTGT TGCAGATACG 10S60 

TATGCAGCTT CAAGAGTTTC ACCTGAAGGC ATTATCAAAG CAACACCTTC TTCGTTGTTA 10620 

CCACCTTT TC CTTCAGCAAT GACTTTAGCT GTAGTATTTA CCAATTCAAC GAATTGAGCG 10680 

TTTTTTGCAA CGAAGTCAGT TTCAGCGTTT ACTTCAATAA CTCCTGCAAC ATTACCGTTA 10740 

ACATAAACAC CAGTCAAACC TTCTGCAGCA ACACGGTCAG CTTTCTTAGC TGCCTTAGCC 10800 

ATACCTTTTT CACGAAG CAA TTCAATCGCT TTTTCGATGT CACCGTCTGT TTCTACAAGC 10860 

GCTTTTTTAG CGTCCATAAC ACCGGCACCA GATTTTTCAC GCAACTCTTT TACAAGTTTA 10920 

GCTGTAATTT CTGCCATTTT AATTCTCCTA TATTTTTTGA AAATAGGAGA GCCCGGCTAA 10980 

GCCCCGCCTC CGG 10993 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CGACGGGGAG CTTTGGCACC TCGATGTCGG CTCGTCCCAT CCTGGGGCTG TAGTCGGTCC 60 

CAAGGGTTCG GCTGTTCGCC CATTAAAGCG GCACGCGAGC TGGGTTCAGA ACG TCGTG AG 120 

ACAGTTCGGT CCCTATCCGT CGCGGGCCTA GGAAATTTGA GAGGATCTGC TCCTAGTACG 180 

AGAGGACCAG AGTGGACTTA CCGCTGGTGT ACCAGTTGTC TTGCCAAAGG CATCGCTGCG 240 



merwin. no 1 ooo 1 4 1 ) i 
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TAGCTATGTA 


GGGAAGGGAT 


AAACGCTGAA 


AGCATCTAAG 


TGTGAAACCC 


ACCTCAAGAT 


300 


GAGATTTCCC 


ATGATTATAT 


ATCAGTAAGA 


GCCCTGAGAG 


ATGATCAGCT 


AGATAGGTTA 


360 


GAAGTGGAAG 


TGTGGCGACA 


CATGTAGCGG 


ACTAATACTA 


ATAGCTCGAG 


GACTTATCCA 


420 


AAGTAACTGA 


GAATATGAAA 


GCGAACCGTT 


TTCTTAAATT 


GAATAGATAT 


TCAATTTTGA 


480 


GTAGGTATTA 


CTCAGAGTTA 


AGTGACGATA 


GCCTAGGAGA 


TACACCTGTA 


CCCATCCCGA 


S40 


ACACAGAAGT 


TAAGCCCTAG 


AACGCCGGAA 


CTAGTTGGGC 


GTTGCCCCCT 


GTGAGATAGG 


600 


GAAGTCGCTT 


AGCTTTAATC 


CGCCATAGCT 


CAGTTGGTAG 


TAGCGCATGA 


CTGTTAATCA 


660 


TGATGTCCTA 


GGTTCGAGTC 


CTACTGGCCG 


AGTAAT t G AT 


AAAAGGCaAC 


ACAGCTGTGT 


720 


TCCTCTTTTT 


GTATCAATTT 


GTATCACCAA 


GCATTTTCAT 


AAGGAAGTCT 


GTTA'l'TTCTT 


780 


GAGAACTTTC 


TTTTTTTCCA 


TGTGCAATCC 


AACTTTGGCA 


GACACCAAAA 


AGTGCATGAG 


640 


TTAGATAGAT 


CCTACTATAT 


TCTAATTCAG 


TGGTATTTAG 


ATTCAGTTGC 


ATAAATCGCT 


900 


TTTGTAAATC 


TGTACTAAGC 


ATGATATGAA 


CTTTATTTCG 


TAAGAAATTT 


TGGATTTCTT 


960 


TAGTCCCATT 


TTCAGAAAGA 


AGGGCAGCCA 


GAAGTGCTTC 


TGACTCTAGA 


TATTCAAAAA 


1020 


CTTCTAAAAT 


AGCGTCTCTT 


TTGTGATGAG 


CATGTTTTTG 


AAAAATATAT 


TCAAATCTAT 


1080 


GGAATAGCTT 


GCTTTGATAG 


TGCTCAATCA 


TATCATACTT 


ATCCTTATAG 


TG ACT AT AG A 


1140 


AGCTGCAACG 


ACTAATTCCG 


GCTTTTTCTA 


CT AATTTG AC 


AGTAGAAATT 


TTATCAAATG 


1200 


GCTGTTCCAT 


CAGTAATTGT 


ACCATACCAT 


TTTCAATAGT 


TCGCTTTGTT 


TTTAAGCGTT 


1260 


TGTTACTTTC 


TTGCATATTT 


CCTCCTTGTA 


AACAAATTAG 


ACTATATGTC 


TAAAAATAGA 


1320 


TTTTTTATCT 


TGTAATTTAG 


ATTTTTTAAT 


GTATAATCTA 


TTATATCAAA 


ATTTTAGACA 


1380 


ATATGTTTAA 


AAAAGCAGAA 


ACTAAGTTTA 


AAC AAT~n AA 


ACCAATTTAA 


AAAAAACCAA 


1440 


CCTTTATTAT 


TGTCATGATC 


CGGATTTCTC 


TTATTCCAGA 


TCTGTACAAT 


ATCATATTTT 


1500 


TGTCATCAAT 


GTGGGATCCA 


TATGGGCAAT 


TGTCTGACTT 


ACCTGTCGCA 


GTTCTAAATA 


1560 


ATGATAAAGA 


GGCTTCCTAT 


AATCGTAATA 


CTATGCCAAT 


AGGAAAAGAC 


ATGGTGTCCA 


1620 


ATTTAAAAGA 


AAATAAAACC 


TTGGATTTTC 


ATTTTGTAGA 


TGAAGAGGAA 


GGAAAGAAGG 


1680 


GATTGGAAGA 


TGGCGATTAC 


TATATGGTAG 


TGACTTTACC 


AAGTGATTTA 


TCTGAAAAAA 


1740 


CAACTACATT 


ATCCAATATT 


CAATCGACAG 


CAGCTTATCA 


ATCATTGACA 


AGTGAGCAAC 


180C 


AAACTGAGAT 


AAGTGATTCT 


GTATCTCAAA 


ATTCAACTGA 


TAGTATTCAA 


TCGGCTCAGT 


1860 


CAATTGTAGC 


TTTAGTACAA 


GATTTACAGC 


GAAGTTTAGA 


AAACTTACAA 


AATCAATCTT 


1920 


CTAATCTTTC 


GACTTTAAAA 


AATCAATCTA 


ATCAACTATC 


ACCTATTACT 


TCTACTTCTT 


1980 


TGATAGGATT 


GTCAACTGGA 


TTAACAGAGA 


TACAAGGAGA 


TGTT ACT AG C 


AAATTAGTTC 


2040 



A 
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CTCCCAGTCA 


GTCGATTGCA 


TCAGGTG I AA 






GATAAAGTTT 


2100 


CTCAGGGCGC 


AACTCAACTA 


AGTGAAAAAA 


A*IV*0^ A f*l 'I'O' 






2160 


TAGTTTCACC 


CTCAAACACC 


jii_iri_ i ft 0* it & IV 

TTGACACAAA 


AAlVI 'W'l'AG 






2220 


AATTACAATC 


AGGATCTGGG 


CAATTAGCAG 


ACAAATLLAu 


TCAGTTACTT 




2280 


CTCCATTAGA 


GAATAGAGCT 


AATAAATTGG 


CAGATGGATv. 






2340 


GAACAAAGTT 


AACTTCTGGA 


TTGGAAGATT 


TACAGACAGG 


ACTTGCTTCT 




2400 


CACTACCTAA 


TGCTAGTGAT 


CAACTCAAAT 


CAGTATCAAC 


ft ^it ft tn^Atu Ik k 
AGAATCTAAA 


A A IT Af A 




TTTTGTCAAA 


TCCACTCAAT 


CTTTCAAAAA 


CAGACAATGA 


TCAAGTTCCT 


G IAAA luUAA 




TCGCAATAGC 


TCCTTATATG 


ATATCACTTG 


CTCTTTTTTT 


GCAGCAATAT 


*"*A IPillTKT 
LAALAAA 1 A 1 




GATATTTGCG 


AAATTCCCTT 


CAGGACGTCA 


TCCAGAGAGC 


CGTTGGGCTT 


GGTTGAAATC 




TTGAGCTGAA 


ATAAATGGTA 


TTATAGCTGT 


TTTGGCAGGA 


ATTTTGGTAT 


ATGGAGGAGT 


i. /UU 


TCAGCTTATT 


GGTTTAACTG 


CTAATCATGA 


GATGAGAATA 


TTTATTCTCA 


TCATCCTAAC 


2 7 oU 


AAGTTTAGTA 


TTCATGTCTA 


TGGTCACCAC 


TTTAGCAACG 


TGGAATAGCC 


CTATAGGAGC 


2820 


TTTTTTCTCA 


CTTATTTTGC 


TTTTACTACA 


GTTAGCATCA 


ACT GC AGGTA 


CTTATCCACT 


2880 


TGCTTTGACA 


AATGATTTCT 


TTAGATCTAT 


TAATCCCTGG 


TTACCAATGA 


GCTATTCAGT 


294 0 


TTCGGGATTA 


CGACAAACAA 


TCTCTATCAA 


CAAGTCATTT 


TCCTAGCTGT 


CATACTAGTT 




CTATTTACTA 


GTTTAGGTAT 


GCTAGCCTAT 


CAACATAAGA 


AAATGGAAGA 


m M ft It ft It & It 

AGATTAAAAA 


JUOU 


AATCGACCGA 


TTAACTGGTC 


GATTTTTTAT 


CCCTTAGATG 


ACTTTCGTCT 


G 1 G A 1 1 A 1 Au 


J It V 


ATTCCAAATA 


GTAAGAGAGA 


ACTAAAGGAA 


CAGATTGCTC 


Jt X ft ff* It R ft 

CAGTAATAAA 


riCAGTT 


JlOv 


.... 

TGAGCTTGTT 


* 

TAATTTCTAT 


TTTCTTACCA 


TCTTGGTAGG 


C AOAv. LAAL l» 




3300 


GCAATGCTGA 


AGAAAATACA 


TGTATCTTGT 


TGGACATCAT 


ATG 1 AutAAA 




3360 


TTAGAAGTTG 


ATACTGTGAC 


AGGTTGTTCT 


TTAATTTTTT 


GAATTGCCTC 


GGTGAAAGTT 


3420 


TTGGTATCTA 


AACGATAGAA 


GCTAGGAGAT 


TCAAATGATA 


CTTGTGAATT 


TCCAGGGAAA 


3480 


CTAACATTGA 


TATTGAAAGT 


TTTTTTCTCT 


TTAGTATATC 


CTAGATTAAA 


GAAGGAGAAG 


3540 


ACATTATC AG 


TTCTAAAAGT 


CTTTTTTTCA 


CCATTTACAA 


GGATGTCAAC 




3600 


TTATCGTTAG 


AAAAGTGAAG 


GTTTATGAAA 


GAGAGATAAA 


CTTGGCTGTT 


TTCTGGAACT 


3660 


TCAATTTGAT 


ACTGGATTGC 


TGCATCTTCA 


TTTGAAGAAC 


TTGTGACACT 


AATCAAATCA 


3720 


TTAGTATTTT 


CTATITTTTC 


TGTTTTTTCA 


TAAGGTATTG 


GAGAAAAATA 


ATCAAAATTG 


3780 



WO 98/18931 



PCT/US97/19588 



242 



* 



ACGTTAGCAA 


CTTGATTTAA 


AAATGAGGCC TGATTATCCA AGCTAlpTTC 


ATTGAACTTG 


3840 


ACATCATTGT 


AAACAGATTG 


ACTCCCAACT 


GCAATCGGAA 


GACAGTATTG 


ATTTTCATAT 


3900 


AGCGTAAGAT 


TATCTTTTTG 


ATAGATATCT 


TTAAAGCCAT 


ACTTATCAAT 


AGGACTGTCT 


3960 


GAGATATTGT 


ACTGGATACC 


AAATAAACTA 


TCAGCCAAAA 


TACTATTATT 


TGCATATCCG 


4020 


AGATTGAGAT 


TAGTCCCAGA 


GGATTTAAAA 


CCAACTTTAT 


CTAAAGTAGA 


GCTTGATGAA 


4080 


CGATTTCGAA 


CAGATGAAAA 


TTGAGACATT 


CCATTGTAGT 


TGAATTTCAT 


ACTGTCATTT 


4140 


CCTGTCTGAG 


TTTGTAGTTT 


TTCAGTACGA 


GTAAATTGAT 


TTCCAATATA 


TGTTGAGAAA 


4200 


GATTCCATAG 


CTGGGATATC 


TCGACTATAA 


GCACTTCGAG 


AAGCAAATCC 


CCATTCCTTA 


4260 


GCAATTCCGT 


CCATTTGAGA 


TGAAGCATTT 


AAACTCATTT 


CAACCAGTAT 


AAATAAAGAG 


4320 


ATTAGAATGG 


CAAATAGATT 


CACAGATATA 


AACTTTTTGA 


TAACTGCAAG 


GAGTAAAAGA 


4380 


GAATAGACAA 


CCAAAAATTC 


AAGAGTAAGC 


AGAATATTCA 


AATCTGTTAA 


AAAAGAATAA 


4440 


TGCCATTTTA 


GATAGATGGT 


AGCTAAAAAT 


CCTGCTACTA 


CAACAAAAAG 


CGAAACTAAA 


4500 


AAATTCCAGA 


CTTTAAGTTC 


TTTCAGACGC 


TTTAACACTT 


CTGCTCCTGT 


CTAAATTAAC 


4560 


AAGCTAGAGA 


AAATCCAAGC 


ATAGCGATGT 


AAAAACATGT 


TTGGAGTATG 


CATGCCTTCC 


4620 


CAAAATAAGT 


CAAGAGCTTC 


TATGTAAAAG 


CTTGCAATTA 


GAAATGCAAA 


GAATATTACA 


4680 


TATATGAGTT 


TCACGTGAAA 


CTTAATAGAT 


TTCAGCGTAA 


AAAATAAAAT 


GGTCAAAATA 


4740 


AAGGGAAATA 


GTCCAACAAA 


AATCATTGGG 


ATGGCCCCAT 


ACTTTGTTGT 


CTCAAAGGAA 


4800 


CCAATGAATT 


GCTTAGCAAA 


GAGATCAAGA 


TACCACCTAC 


TTTCAGTTTG 


AAACTTTGTA 


4860 


ACTTCAGTCA 


ATTTTTCCCC 


ATCTCTCTGT 


AAATCAAATA 


GAGTGGGAAG 


AGTCATAATC 


4920 


AAACTAGCCA 


TACCAGCTAA 


AAAGGAGATA 


ACTATOAAAT 


CAACAACACA 


TGATTTTCGA 


4980 


CTCTTAAAGT 


CCCACGAAAT 


TTGACAGAGA 


TACCAGAAAA 


TAAGAAACAA 


TACTGTCATA 


5040 


TATCCAAAAT 


AATAATTTTG 


AATAAATAAG 


ATTGACAGAC 


TTGTAAAGTA 


CAATAGGAGT 


5100 


TTCTTTTCAG 


TTATCAGTAG 


ATGTAAACCA 


CTTATAATTA 


AAGGAATCAA 


GATAAAAACA 


5160 


TCTAGCCAGG 


TTTTTATCTC 


TAATTGACTG 


ACAGTGAAAC 


TCATCAGAGC 


ATAGGAAGTA 


5220 


GATAAGGCTA 


CTTTTAAAAT 


CTGAGGGATA 


GATTGAAACA 


ATTTATTCAA 


ACTAAAAAAG 


5280 


CTTGACAGAC 


CAATCAATCC 


AAATTTTAAG 


AGAGTTGTCA 


GATACATAGC 


ATCTGGCATA 


5340 


TTCGTTACAT 


CAAAAAAGTA 


AACCAGAGGC 


GCGAGAAAAC 


TACCCAACTA 


ATAACTAGAT 


5400 


AGGGCATAGA 


AGTTTAGCCC 


TAGACCACTT 


GTAAAGGTGT 


AAAAC AG ATT 


ACTATTTCCA 


5460 


TGTAGGATAT 


TTCGTAAGGC 


TACATCAAAA 


ATAACGTATT 


GATGAAAGCC 


ATCTCCTAAT 


5520 


AGAGGAGAGT 


TGTCGCTATT 


CCAGTAGATA 


CTTTGACATA 


GATATACTCC 


AGACATAATC 


5580 
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ACTACAGGAA 


TGATGAAAGA 


AATAAAATAO 


Oil k J\ lA* 


1 A 1 1 i 


TGATTTCATG 


5640 


TTACCTCGTA 


GAATGATACA 


AAACTCAGTT 




ftW 4\J^W * * 4 4 


C AAfa TTTT AT 

A 4 4 4 ■ » 4 


5700 


TTACTCTTTC 


CAAAGTTCTT 


TAACTTTTGC 


TTGTACTTCT 


GCATTTTCTA 


wnn 1 1 4 l> 


5760 


GTAGGTTTCA TCGATACCGT 


CAATGACGCC 


ATTTTTAGAT 




1 /V iWU 1 1 AW. 


5820 


CAAAGTTTGA 


ATAAATTVvTT 


GGTCATGGCT 


CGCAAACATW 


ATTGATTCTT 


Innnwl 1111- 


5860 


CAATCCATCA 


TTCAAGCTTG 


AGATAGATTC 


CAAGTCCAAG 


TGAri'lVI 


L»/\ twA l^AAU 


37VU 


TACAAGGACA 


TTTGATTTTA 


AGAGCATGAG 


TTTTGAAAGC 


ATGACACGAA 


CTTTTTCTCC 


C/1AA 
OUVU 


CCCTGACAAG 


ACATTTACAG 


GTTTGTTAAC 


TTCATCTCCA 


GAGAAGAGCA 




DUOU 


GAAGCCACGT 


AGGAAAGTAT 


TGTCATCTTC 


'flUTTTACTT 


GCGAATTGAC 


CCAACCAGTw 


C 1 Oft 


AAGAATTGAT 


TCTCCTCCTG 


CAAAATCAGC 


TGAGTTATCT 


TTTGGTAGGT 


ft ft ^ ft MMA ft 

AAGATTGACT 


6180 


AGTTCTAACT 


CCCCACTTGA 


CAGTTCCTTC 


ATAGTCAATA 


TCTCCCATGA 


riMMfe** ft ft ft Mi 

TTGCACGAAT 


6240 


TAATGCAGTC 


GTTTGAATAT 


CATTTTGTCC 


AATAAGTGCT 


GTCTTATCAT 


CTGGACGCAA 


6300 


GATGAAACTA 


ATATTATCCA 


AGATAGTTTC 


ACCATCAATC 


TTTACACTTA 


AATTTTCTAC 


6360 


TGTCAAGAGA 


TCATTACCAA 


TCTCACGTTC 


CCCTTTAAAC 


TTGATAAATG 


GATATTTACG 


6420 


ACTAGATGGC 


ACAATCTCTT 


CTACCTCAAT 


CTTATCAAGC 


ATTCT CTT AC 


GTGATGTTGC 


64 80 


CTGCCTTGAC 


TTAGAAGCAT 


TGGCAGAGAA 


ACGAGCAACA 


AATTCTTGCA 


A"i"i"V»"4 i 1'AAT 


6540 


TTTTTCTTCT 


CCTTTACCAT 


TACGGTCTGC 


TAGCAATTTA 


GCAGCAAGCT 


ft ^ ft ft ^ ft MWft^ 

CAGAAGATTC 


OOUU 


CTTCCAGAAG 


TCGTAGTTTC 


CGACATAGAG 


TTTGATTTTT 


CCAAAGTCAA 


CCTCwCCAT 


ODOU 


GTGAGTACAA 


ACTTTGTTTA 


AGAAGTGACG 


GTCGTGGGAT 


ft ft ft *f%ft ft 

ACTACGATAA 


CTGTGTTATC 


D '4U 


AAAGTCAATC 


AAGAA J .* ; . : 


.TAACCAAG? 


AA 7 C«I*A rTGG 




„ « * AO 1 fM»W 




CTCGTCCAAG 


AGAAGAACAT 


CTGGTTTACC 


ft ft ft ft ft 

AAAAAGTGCT 




UAALLl 1 i 


DOW 


TTTTTCACCG 


TTGGCCAATT 


CGCTCATGTT 


TTGGTAGTGT 


AATTCTTCTG 


GAATCTTTAG 


6900 


GTTTTGAAGT 


AGTTGAGAGG 


CTTCACTCTC 


TCCTTCCCAA 


CCTCCAAGTT 


CGGCAAACTC 


6960 


TCCTTCGAGT 


TCGGCAGCAC 


GAACCCCGTC 


CTCGTCTGAG 


AAATCTTCCT 


TCATGTAGAT 


7020 


AGCATCTTTC 


TCTTTCATGA 


TGCTATAAAG 


TTTTTCATTT 


CCCATGATAA 


CGACATCAAT 


7080 


GGCACGTTCA 


TCTTCGTAGT 


CAAAGTGATT 


TTGACGAAGA 


ACAGAGAGAC 


GTTCATCTGG 


7140 


ACCAAGAGAG 


ATGTGACCAG 


TACTAGGTTC 


GATATCTCCA 


GCTAAAATTT 


TTAAAAAGGT 


7200 


TGATTTTCCG 


GCACCATTAG 


CACCCATTAA 


TCCGTAAGTA 


TTTCCTTCTG 


TAAATTTGAT 


7260 


ATTGACATCA 


TCAAAAAGTT 


TGCGATCACT 


AAAACGTAGT 


GAAACATCAG 


ATACTGTAAG 


7320 
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CAATGTTTTT CTCCTATATG TGTAATATAT TTATTCTACT AGAAAATACA CAAATATTCA 7380 

AATTTTTATT TGTCAATTTT GTCTAAATTA TATTTACAGT ATCCTTTACA CAAATCTCTA 7440 

AAAACCAAGG CTGATTTATT TTGATAAATT ACGGTTATTT CATTAAAAAA ATCCTATAAT 7500 

TGAAAGCACT ATATCCAAGG AGAACAAAAT GACTAAACCC ATTATTTTAA CAGGAGACCG 7560 

TCCAACAGGA AAATTGCATA TTGGACATTA TGTTGGAAGT CTCAAAAATC GAGTATTATT 7620 

ACAGGAAGAG GATAAGTATG ATAT G TTTGT CTTCTTGGCT GACCAACAAG CCTTGACAGA 7680 

TCATGCCAAA GATCCTCAAA CCATTGTAGA GTCTATCGGA AATGTGGCTT TGGATTATCT 7740 

TGCAGTTGGA TTGGATCCAA ATAAGTCAAC TATTTTTATT CAAAGCCAGA TTCCAGAGTT 7800 

GCCTCAGTTG TCTATGTATT ATATGAATCT AGTTTCGTTA GCAC G TTTGC AGCGAAATCC 7860 

AACAGTCAAG ACAGAGATTT CTCAGAAAGG ATTTGGAGAA AGCATTCCGA CAGGATTCTT 7920 

GGTCTATCCA ATCGCTCAAG CAGCTGATAT CACAGCTTTC AAGCCTAATT ATGTTCCTGT 7980 

TGGGACAGAT CAGAAACCAA TGATTGAGCA AACTCCTCAA ATTGTTCGTT CTTTTAACAA 8040 

TCCATATAAC TGTGATGTCT TGGTAGACCC GGAAGGTATT TATCCAGAAA ATGAGAGAGC 8100 

ACGGCGTTTG CCTGGTTTAG ATGGAAATGC TAAAATGTCT AAATCACTAA ATAATGGTAT 8160 

TTATTTAGCT GATGATGCGG ATACTTTGCG TAAAAAACTA ATGACTATGT ATACACATCC 8220 

AC AT CAT ATC CGCGTTGACG ATCCAGGTAA GATTGAGGGA AATATGGTTT TCCATTATCT 8 280 

AGAT G TTTTT GGTCCTCCAG AAGATGCTCA AGAAATTGCT GATATGAAAG AACGTTATCA 8340 

ACGAGGTGGT CTTGGTGATG TGAAGACCAA GCGTTATCTA CTTGAAATAT TAGAACGTGA 8400 

ACTGGGTCCG G 8411 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9064 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TGCCGTACTC AAGTACAGCC TCCGCTAAGT TTCCTAGTTT GCTCTTTGAT TTTCATTGAG 60 

TATTAGTAAC CAAAATCCGA CCACATAGCC AGCCCCTATG AATATAGCCA TTAAAGCTAG 120 

CATGGAATTT AGGAAATTAA AAACCACCGC AGATACAAAG GTTAGCACAA AAACATTAAA 180 

AGCAATGGTC TCAGAAGCCA AGACTAGAAT ATAGGGTGTC AACCGATCTA AAGTTTTGGA 240 

ATCTAGCAAA AATAAGTGTT TATACATGAT CACCTCCTCT ATGGCTGAAA AGCAAGCCTT 300 



DKicrt«vin. oo< oooi AO i 
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TTCTTTTTTT 


ACCCCAAGAC 


CCTATGTAGA 


ft ft, ft ^"*P*^» m 0+ ft 

AAAGTGAGCA 


AAAACGGGAA 


GCTCGCTACA 


3 go 


ATATTATTGA 


TCACATGCAC 


CGCATAGGAT 


GGATAAATGC 


TCTTGCTATA GCGGGTCAAA 




CCAGCAAAGA 


TGATTCCAAC 


TGTTGCAAAG 


ACGAAGAT AT 


CTAACAGACT 


AGGCAGGCTT 


480 


GAAAAATGAG 


GGAGAGCAAA 


TAAAATAGAA 


GGAAGAAGCA 


AATCAAGACC 


AAATCGCGAA 




TGCTTAAAGA 


AAGCATGTTG 


CAGTAATCCT 


CTATAAATCA 


ATTCTTCCAT 


CAGTGGAACC 


DUU 


AGAAAGAACA 


GGGCTATATA 


AATACCTAGC 


TCTCCAAAGT 


TAGTCCCACT 


ATAACCAATC 


boo 


AATACAGCCC 


AACCTTCCGC 


AGTTGACTGA 


ACATGTTTAG 




GTTAAAAGAG 


720 


ATCTGGAACA 


CTAGCACTAA 


TACTGTCAAA 


ATCGAATACC 


& & h. CLC*f* %. TTT 


111 1 W i & ^*V*/\ 


780 


ATGCGGAAGA 


GATAACCATG 


GCCTGTCTTA 


ACAAGAACCA 






840 


AGTAAACTCA 


AGATATTTTG 


AATCCAGAAT 


AAATTGCCTA 






900 


TACTTTTGGA 


CGATAAGCGT 


CAGCTGAGAA 


AGACTAAATA 


r*f~ h. A & 

WunAAAA 1 ftA 




960 


ACTCCACTTA 


TTTTGAATAG 


AAGTTCATAC 


TTTTTCATAG 


A/v\ I V-V- i 




1020 


TCACCTTGTC 


AGGCTCTACT 


GCTCTAAGAT 


TAAGAAGACA 


ul I lul H 1 1 




1080 


ACCTGACTAC 


T AG AT AAT AG 


ATACATTAAG 


GCATTAAAGA 


» i*pr* & & & a T 

LAA I (j/V/VW* A 




1 140 


AATAAAATCA 


ACCTCGCATC 


CAAACCAAGA 


TAAAGTTTGA 




GATf.AGCAAA 


1200 


ACAATTTGAA 


ACCATAAGGT 


TTTTCCAAAA 


ATAAATTTAA 


/\Ol~V^M 111 


AATATCTACT 


1260 


TCCTTGATTT 


TTACCGCCAC 


CCCTTTATTA 


GCAAGAAGGA 


IUV1W tv>wi 


TTCAAACAAA 


1 J*U 


CCACTCTAAA 


GAACAAGCCA 


CCCAATAGAT 


ACGATAGAGA 


TTTGTAAAAA 


TGTCCCTAAA 


1JDU 


AGAATATCCA 


ACACACTACT 


CAAGAAAATA 


ACAAAAAATA 


ATGTGT ATTT 


CATATTAAAT 




ACCTCCATTC 


ATTTATTTCA 


CTAACAATTT 


AATAGAGCCT 


TCTACTCAAA 


TATCCTCTCA 


1500 


GAAAACGATA 


GAAACCTACT 


TTTTATAATA 


CTTCAAGCCC 


CACATGAGCA 


GAAGCGTCAT 


1560 


AAACAAGCAG 


AGAATACACC 


TATATAAGCG 


ATTAGTTGTT 


GATAGAATTC 


TGTTTCTGAA 


1620 


ATACCTCTAT 


ACAAACAAAT 


GACAAACATA 


AAATCTGCCA 


AGCCGATAAA 


CATAACTTGA 


1680 


TTGGTTCTAG 


GACTAACCAA 


ATCATCATTT 


ACTTATATTT 


AAGACTATCT 


CTTTTATTTT 


1740 


AATCTATCTT ACCACTGAAA 


AGCAAGACAG 


GCCAATAATA 


TTTAAAATGA 


ACAGTAACGG 


1800 


GGTTAAGTCT 


CTAAAAAAAT 


TATCTACTGA 


CACTACAAGA 


AATACTATAC 


ATATTATAGT 


1860 


CCAAACTATC 


TTTTTCTTAT 


CCATAATTAT 


TTACTCCTTT 


CCTAACAAAT 


CCAGCTTATC 


1920 


AATCAAGAGC 


GATTTTTAAC 


ATAATGTAGC 


AGCACCCGTT 


GCAACTTTGA 


CAAGTTTAGT 


1980 


ATATCATTGT 


TTTTTAAAAT 


TTTTCATCCA 


AATCTTGAAT 


TGTCATCGAA 


ACATCTTGAA 


2040 
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TTGTTAAAAA 


ATTTAAAAAG 


TAAGCATTAA 


AAACATACTT 


TCCTCTTTAT 


ATTGTATTGA 


2100 

• 


TACCAACTTG 


TTTGTAGACT 


TTTCATCCTG 


CTATCACATA 


TCATTTTGAC 


AGGCGAAACA 


2160 


ATATTAAAGA 


AACTCCCCTG 


TAAATTAAGC 


TAGCAAATAC 


AGGGGAGAAA 


TTTATTTTTT 


2220 


AGAGAGTACT 


ATCCGTATCC 


TTTTTGGAAG 


ATTTTGAAAA 


TATTTTTCTA 


ATTAAGTCAT 


2280 


CCATATAACG 


ACCAAATATA 


CCAACTACTA 


AACCAATAAT 


AAAACTTTTA 


AAATCCATAA 


2340 


TTACCACCAA 


CATATTGCTG 


CATAGGCTAC 


ACCTCCAAGT 


ATAGCTCCAC 


CTGCAGCACC 


2400 


AGTTACACCT 


ATT CCT AT AG 


CAAATGGTCC 


CAATAGAAAT 


GTCAAACCGT 


TGTTGCACAC 


2460 


CCATCAATTG 


CGCCATATGC 


AACCCCTGCT 


CCACAACTAA 


TTTTTCTTCC 


CCAATCAATA 


2520 


TCTCCACCTT 


CAACGCAAGC 


AAGCATTTCA 


TTATCCATAA 


CTGCAAATTG 


TGACATCATT 


2580 


TTTGTATCCA 


TATAGTGTAT 


CACTTTTCAG 


TTACGCAACA 


AGTTTAATAT 


AAAAATTATC 


2640 


AAAAAAACAT 


AGGCAATAAA 


GAGAAAAATT 


AATTTATCAT 


AGATTAGAAA 


TAATATGACA 


2700 


AAACAATTCA 


ATGATGTTAA 


TTCAATAGTC 


TTTTGTTTTT 


TATCGGAGAT 


ACTTATGGAT 


2760 


AGATAAATAA 


GATAGCTTTG 


AAAACCGAAG 


AGAATAATAA 


ACAATATAGC 


CTTCATAAAA 


2820 


TTTAGCTTTC 


ATTTTTATGA 


TGTAGCGGTA 


TAGGCTAAAT 


ATCCACAAAC 


CACTCCTCCT 


2880 


CCAATTCCTC 


CTATTGCAGC 


GCCCCATGGT 


CCTAGAAGTC 


TCCCATATTT 


CACTCCACCC 


2940 


GCTGCACAAC 


CTAAAGCAGC 


AACTACAGCT 


GCTCCTCCGG 


AATTACCTCC 


ATAAACCTCA 


3000 


CTCAGCATTG 


TTTCATTTAT 


ATTACAATAA 


CTATTCATAC 


AAGTCTCCTT 


TTATTAAAAT 


3060 


CCACCCGTTG 


CCCCTGTTAC 


TCCTGCCCAA 


AGATCCACAC 


CAAATTTAGC 


TCCTATGTAT 


3120 


CCACATGCTC 


CCATAAATGG 


TGCTCCAACA 


CCACTCGCAG 


CACAAATAGC 


TGTCCCTAGC 


3180 


CCCCAGCCAC 


CAAAAGCAGC 


ACCACCACCT 


TCTAAGACAT 


TAGTTTGCCA 


ATTATTCTTG 


3240 


CCTCCTTCAA 


TACTAGATAA 


CATAGTTATA 


TC CATTTCAT 


GAAATTGTTC 


CATAATTTTT 


3300 


GTATCCATGA 


CAAATACTCT 


TTTTTATTTT 


TAATTTTTGT 


CTTCTTGTAA 


CTTTGACAAC 


3360 


TTTAGTATAT 


CATCGTTTTT 


TAAAATTTTT 


CATCCAGATT 


TTGAATAGTC 


ATCGAAACGT 


3420 


CTTGAATTGC 


AAAAATTACA 


TTAGACTTCC 


TGCAAAACTA 


GAATCCTAGT 


TCATGATTCA 


3480 


TAATACCAGC 


ACTCAAATTC 


ATTCGTAATC 


CGAAGCGTTT 


ACGATGACTT 


CCATAGGTTG 


3540 


T TG AAAACAT 


TTTAAACGTT 


TTTACTTTGC 


CAAAGATGT7 


CTCAACCTTG 


CTTCTCTCCT 


3600 


TAGATAGCGC 


ATGGTTACAG 


GCTTTATCTT 


CAACTGTTAG 


CGGTTTGAGT 


TTGCTGGATT 


3660 


TACGTGAAGT 


TTGTGCTTGA 


GGATATATCT 


TCATGAGCCC 


TTGATAACCA 


CTGTCAGCCA 


3720 


AGATTTTACC 


AGCTTGTCCG 


ATATTTCTGC 


GACTCATTTT 


GAACAACTTC 


ATATCATGAC 


3780 


AATAGTTCAC 


AGTGATATCC 


AAAGAAACAA 


TTCTCCCTTG 


ACTTCTGACA 


ATCGCTTCAG 


3840 
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TCTTCATAGC 

A W 4> A wA A A%A* 


GTGAAATTTC 


TTTTTACCAG 


AATCATTCGC 


TAATTCTTTT 


TTTAGGGCCA 


3900 


TTGATT^TITA 




ATCAATCATT 


ACCCTGTCCT 


CAGAACTGAC 


AGGAGTTCTT 


3960 


GAAATCGTAA 


CACCACTTTG 


AACAAGAGTT 


ACTTCAACCC 


ATTCCCTCCG 


ACGGAGTAAG 


4020 




CAACACCAAA 


ATCAGCCGCA 


ATTTCTTCAT 


AAGTGCGCTA 


TTCTCGCACA 


4080 


1 A 1 luMUjAU 


*Pfi£5CCATAAG 

1 UV1WWA * AAV* 


AAGGTCTTCT 


AGGCTTAATT 


TAGGTTTTCG 




4X40 


uLu 1 u ill AA 


ui 1 OA 1 AnuL 


* 1 *t 1TTTPTMT 


ACAGCTAGCA 


TCTCTTCAAA 


AGTCGTGCGC 


4200 




UnnunUuV. 1 a 


AAATCCTCCA 

AAA 4 Wv A wWA 


A A * A » 


GTTTACPTGC 

w AAA A A 


TTCATAATTC 


4260 




AU X AAAA A 


AATAACAACX 


GGATAAATCG 


ATCAGGACAG 


TCAAATCGAT 


4320 


1TCTAALAAT 


GTTTTAGAAG 




CP ATTCT ACT 

W * A A * W A A\* A 


TTCAATCTAC 


TATACTATAC 


4380 


CATATTTTCT 


TTCJGCIAoGGA 


A 1L iAl a A i A 


nnnwu A AAVj 


TATTCCAAAA 

l AX A OV-AAAA 


ACACTTACCC 

nwu> A A AWWW 


4440 




TACTTCATTA 


AGCTCTACTT 


1 t lAi AA 1 AW 


TTPAACTCPC 

A A 


ACATGACCAC 


4500 


m ks*a» m/* 

AAGCATGATG 


ATTAAGCAGA 




AA<r&T&Af:rf! 

AA 1 A I AAwWu 


A A 1 A A X lU 1 A 


W A AVA* A A A V* 


4560 


TCCTGCTGTG 


ATACCTCTAT 


ALAAALAAA 1 


AATACA^ATA 

k AWAV» A A A 


AAACCTCTCA 


AGf-CCATGAA 


4620 


LAI KAvj I I VjA 


J 1 Ow 1 1 L 1 AO 


UAW A AAV»WAA 


ATCATCATCT 


TCAAACTCTC 


TTATCCTCAT 


4680 


1 ItCLi ACTG 


nuA i AAAUAu 


A AftLLIUlrtn i 


ACAAGCCAAG 

AVJ AA V_ AAV 


TTAATAACTA 

A A / A V«» A A 


CTAAAAGAAA 


4740 


■■»■*# »«■» till J 

I lGGAAAACT 


ACGuAAAAAT 


1 i AAAAAL 1 V* 


Af^CACATACA 
nkwAwi 1 AVJ A 


AATAGATAAC 

AA A oUA A AAW 


TACAAACAAG 

A HWWWWi AAW 


4800 




luALW 1 AAl*A 


Af* A ATCTTCGC 


AAGGAAGATG 


A * 4 * A 


TAGCAAGAAA 


4860 


AwTTTTCATV 




TTTPTTTTTA 

A A A V» A * A AAA 


TTGATAGCAA 


AATAGATCAT 


AACTCCAATC 


4920 


A*— A 1 AbuU 1 A 


I W i> A 1 AAAA 


1 AvV* A V*A 4 AV» 


CAAGCACTCT 


CCCTAAGCGG 


ATATAGAAAG 


4980 


ATnfiAPATttA 
a 1 ai» a 1 v*a 


1 1, AVXA A AV_ AVJ 


AACGAAAATA 


ATC AGTATTT 




AAGATTTCCT 


S040 


CCTAAATCTC 

S*V* * AAA * w 4 W 


CGATTTATCT 


TACTTGAGCA 


AGAACATTTA 


CACTGCTAGT 


ATAGCACTTA 


S100 


TTTTGACCTT 


GGATCACTCA 


AATCATAAAT 


CGTCATCAAA 


ACCTCTTCAA 


TTGTAAAAAT 


5160 


TAAAAAAGCA 


AGCATGAAAA 


ACATACTTTC 


CTCTTTATAT 


TGTATTGATA 


CCAACTTGTT 


5220 


TGTAGACTTT 


TCATCCTGCT 


ATCACATATC 


ATTTTGACAG 


GCGAAACAAT 


ATTAAAGAAA 


5280 


CTCCCCTGTA 


AATTAACCTA 


G C AAAT AC AG 


GGGAGAAATT 


TATTTTTTAG 


AGAGTACTAT 


5340 


CCCTATCCTT 


TTTGGAAGAT 


TTTGAAAATA 


TTTTTCTAAT 


TAAGTCATCC 


ATATAAGGAC 


5400 


CAAATATACC 


AACTACTAAA 


CCAATAATAA 


AACTTTTAAA 


ATCCATAATT 


ACCACCAACA 


5460 


TGTTGCTGCA 


TAGGCTACAC 


CTCCAAGTAT 


AGCTCCACCC 


GCAGCACCAG 


TTGCTCCACC 


5520 


TTGCCATGTT 


CCTGTTTTAA 


TGCCTAGTTG 


AAGACCTCTT 


GCTGCTCCTC 


CTCCAACACC 


5580 
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TGCTTTGGCA AAATCTCCCC AATTGCATCC GCCACCTTCA ACCCAAGCAA GCATTTCAGT 5640 

ATCCATAACA GAAAATTCTG ACATCATTTT TGTATCCATG ACAAATACTC CTTTTTTAAA 5700 

AAACTAAAAT AAATCAGAAT AGAATCCTCA TAATTTTACT ATAAGTCTTA CCAACTTAGT 5760 

CCCAATTTAT CACCAACCAT ACCTCCTAAG CATGTTAATC CACCCCCAAT TGCACCAATG 5820 

TCTGCTCCAA CAAATGCACC AGCAAGTCCA GCTACTCCTA AAGTGGCCAA ACCTGCTCCA 5880 

GTTCCACCAG TTATAATTCC CGTAGTGACT CCTGTAATCA GTGCATTTTG ACAATCACTG 5940 

GAGCTATACC CCCCTTCAAC TTTCGCAAGC ATTTCAGTAT CCATAACCTC TAACTGTGAC 6000 

AACATTTTTG TATTCATCAT GAATACCTCC TTTTTATTTT CAATTTGTTA CCAAAGTCTT 6060 

AAATTCAATA AACAAATAGA TTTTTTATAG TAT C TTTTTG ATTTTCTTAA AAAAGTATAT 6120 

ACGTCTACTA TCTTCTTAAA GGTAGCAGTA CCTATTTTTT AGTCTAAGAT TTCAATAATC 6180 

TTGAGTATCT AAAATATCTT AATTTCGTTA TTCTCCTTGC AATAAAAAGT TTTACTATAC 6240 

TATTTATTAA CTTGCAGAAA GCAAAAAATA TTAGTAAATA ATAGTTTATA GTTAAGTTTT 6300 

TTATTCCTAC CAATCCATCA ACTAAGTAAA GCATCAACGA TTACATAAAC GATTGATAAT 6360 

ATAATTAAAA TTTTCCTAAC TATCTTATTC TC AT C ATT CT TAGATAACTT TGATATTTTG 6420 

TAAGTAAGTA AATAAGACAG TAAATTAATA GCGATAATAA TACTATATTT AAGAATCATA 64 80 

ATCTTACAAA GAGGACATAA TTCCTGAACC TACACAAATA AGTGTTGCTG CTCCCCCAGT 6540 

TATCGGACCA GTCGCAGCAG CTAATACTAC TGCTCCAATA CAACCACCCA TTGCAGATCC 6600 

TAAATTGCCT CTTCCTCCAC TAACTATTTC GAGTTCTTCA TTATCCATAA CAGAAAATTG 6660 

TTCCATCATT TTTGTATTCA TGACAAATAC TCCT T TTTTC TTTTTTTATT TTTGTCTTGT 6720 

TGTAACTTTG ATAAGTTTAG TATATCA7CC TT TTTTAAAA TTTTTCATCC AGATCTTGAA 6780 

TTGTCATCGA AACGTCTTGA ATTAG C T TXT TTATTTCAAG CCACCTCTAA ATGTTTAAAA 6840 

AAAATAATTT CTAATCACTT TTTTACCATT CAGGAAGTTT T AATG ACT AT TCAAGATTTC 6900 

ATAAAATATG AACTTACTTT TATGACATAA TAGACCTATC CACTATATGA AACGAATTGC 6960 

CAATGACTTC TTATAAACGT ACATTTGTTC CTCAAATAGA TGCCAGAGAC TGTGCTCTCG 7020 

CTGCCTTAGC CTCGATTGCT AAATTCTATG GTTCAGATTT TTCTCTAGCT CACTTGAGAG 7080 

AACTTGCAAA GACCAATAAA GAAGGGACGA CTGCTCTTGG CATTGTAAAA GCCGCTGATG 7140 

AAATGGGCTT TGAAACAAGA CCTGTTCAAG CAGATAAAAC GCTCTTTGAC ATGAGTGATG 7200 

TCCCCTATCC ATTTATCCTT CACGTTAACA AACAAGGAAA ACTCCAACAT TACTATGTTG 7260 

TCTATCAAAC AAAGAAAGAC TATCTGATTA TTGGTGATCC TGACCCTTCT GTAAAAATCA 7320 

CTAAAATGTC AAAAGAACGC TTTTTCTATG AATGGACTGG AGTAGCTATT TTTCTAGCTA 7380 
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CCAAACCCAG 
CCTCTGATTT 
ACTATTATCA 
AATCAGATGA 
CAACAAGTCA 
ATTGATGTGA 
ACACGTCGTA 
TTGGCTTCTA 
GTCTTACTGG 
ATGTTCATCA 
ACTAATTCTA 
TCGCTCACGA 
GAAAAATCCT 
AAATTAGTTC 
ATTTCTATCG 
GAAAATATTA 
AACGAAGTCT 
TTTTTGATGG 
ACCTTAACAG 
AGTGGTTCTG 
GGGCATATTT 
CATATTAATT 
ACCTTGCGCG 
GAAATCCGTC 
GCTGGTCTAT 
TCTCCTGTTT 
GTTATAGATA 
AGTATAGCCG 
GGTA 



CTATCAACCC 
TCAAACAAAA 
ATATAGGTGG 
AATCAACTTT 
TGAGCTTCTC 
TTTTATCCTA 
CAGGAGAAAT 
CCATTCTTTC 
CACAAAACCC 
TCTTTTCTTT 
TGGTTAGCTC 
GTGAAGAAAA 
TTAAGCTCAG 
TGAATATCCT 
GTCAGCTCAT 
TCAACCTCCA 
ATCTACTCGA 
GCGATATTGA 
ATATTAATCT 
GTAAAACAAC 
CCATCAATCA 
ACCTACCCCA 
GTAATCATAT 
AAGACATTGA 
CAGGAGGACA 
TAATACTAGA 
ATCTTATCTC 
AACGAACCAA 



CATAAAGATA 
ATCTCTCATT 
TTCTTACTAT 
AGGAATCATC 
CAGAGATTAT 
TATTCGCCAT 
CATTTCACGA 
TCTTTTTCTG 
TAATCTCTTC 
TATGAAACCT 
TGCCATTATC 
TCGCTATCAA 
TAAATATTCT 
TATCCTATGG 
TACCTTTAAC 
AACCAAACTC 
ATCTGAATTT 
ATTTGATGAC 
CACGATTAAA 
TTTAGCCAAA 
TCAGGATATT 
ACAAGCCTAT 
GATTAGTCAA 
AAGAATGCCT 
GAAGCAACGA 
TGAAGCTACT 
TCTAACTGAT 
CCGTGTCATT 
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AAAAGAATGG 
GCTTACATTG 
CTCCAAGGAA 
TCAGTTGGTC 
CTCCTAACCC 
ATTTTTGAAC 
TTCACAGATG 
GATGTTTCTA 
CTTCTTTCTC 
TTCGAAAAAA 
GAAGATATCA 
AATATAGACA 
ATTTTACAAA 
TTTGGCGCTC 
ACACTTTTTT 
CAATCTGCCA 
CAACTTCAAG 
CTTTCTTATA 
CAAGGAGATA 
ATGATTGTCA 
AAAAACATTG 
ATCTTTAATG 
GAACATATTC 
ATGGGCTATC 
ATCGCTCTCG 
ACCGCTCTTG 
AAAACCATTC 
GTTCTTGACC 



TCTACTAAGC 
TTCTCTCAAG 
TCTTGGATGA 
TCCTTATCAC 
TTCTGACTCA 
TTCCCATGTC 
CTAACTCTAT 
TTCTGATTCT 
TTATTTCCAT 
TGAACCATGA 
ACGGGATTGA 
GCGAATTTGT 
CGACTTTAAA 
AATTAGTCAT 
CTTACTTTAC 
AGCTCGCTAA 
AAAACCCTGT 
AGTATGGTTT 
ACCTTAGCCT 
ATTTCTTTGA 
ATAAAAAAGT 
GCTCTATTTT 
TAAAAGCTTG 
AAACTCAGCT 
CTCGTGCTCT 
ATCTCTTGAC 
TCTTTGTAGC 
AGGGGAAAAT 



AAGCTTCCTT 
CTTATTGGTC 
ATACATTCCA 
CTATATCCTC 
CAGATTAAGT 
TTTCTTTGCG 
TATAGATGCC 
TGTAGGAGGC 
TCCTATATAC 
TGTCATGCAA 
AACTATAAAG 
AGATTATTTG 
GCAGGGAACA 
GTCAACTAAA 
AACTCCTATC 
TAACCGTTTG 
TCATTCACAT 
TCCATGAGAT 
AGTTGGAGTT 
ACCCTACAAA 
CTTGCGCCGT 
GGAAAACTT\ 
TCAAGTAGCT 
CTCTGATGGA 
TTTAACTAAA 
TGAGAAAAAG 
CCATCGTCTC 
CATTGAAGTT 



7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9064 
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<2> INFORMATION FOR SEO ID NO: 18: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7780 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 
(O) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CTCCATTTTT 


TTGATTTCAT 


AAATAAACAA 


CCTCTCTGTT 


AATTTTCTAT 


AATTATAACG 


60 


ATATCCAAGT 


TACTTGTCAA 


CTGTTTTTTA 


AATTTTTATC 


TCAAAAATAT 


TTTTTCGTTC 


120 


AAAAAAAGGA 


GCCATCAGTT 


GATTTCAAGC 


TCCCTTTTAT 


ACAGAATTAA 


ACTATTTTAT 


180 


AGTTCGACAA 


TCTTACCTGT 


TTCAAAGTAG 


ACAACCCATT 


CACAGATATT 


TTTACCATAG 


240 


TCACCGATAC 


GCTCCAAGTA 


GGAAATAACT 


TGGAAATAAT 


CACGACCCGT 


AACAATGGCT 


300 


TCTGGATTTT 


TCTTAATCTC 


TTCAGTCCCA 


AGGTCACGGA 


TAGTTTCAAA 


ATAGTGGTTA 


360 


ATTTGCTCAT 


CCATGGAGGC 


CACCCGGTAT 


GCGTCGTCAA 


CAGAACCATT 


AAGATAAAGA 


420 


TCAAGTGCTG 


CTTCCACAAC 


GCTTTTAACT 


TCACGTCCCA 


TTTTTTTAAT 


TTCTTCCTCT 


480 


ACAGCTGGAA 


TGCGCTCTTC 


CCCCTTCATA 


CGGATGGTTG 


CCTGGGCAAT 


GCCTACAGCC 


540 


TGATCCCCCA 


TACGCTCCAC 


ATCTCATACA 


GCCTTAAGGA 


CACTCAAGAC 


TGTACCCAAA 


600 


TCTTCAGAGA 


CTGGTTGTTG 


GAGTGCCATC 


ATTTCAAATG 


ATTTCTTTTC 


CAGTTTCACT 


660 


TCGTATTCAT 


TTACTTCTGC 


ATCATCTTCG 


ATGACCTCTT 


TTGCCAGCTC 


ACGGTCATGC 


720 


CTGACAAAAG 


CACGTACCGT 


ACGATTGATT 


TGTCAGAGCA 


CTTCTTGTCC 


CATAGCGTAG 


780 


AACTGGTTAT 


GTAATTTCTC 


TAAATCTTCT 


TCAAATTGAG 


ATCGTAACAT 


CTTTCATCTC 


840 


CTTATCCAAA 


TTTTCCTGTA 


ATATACTCTT 


CCGTTTCCTT 


GTGTTGGGGA 


TCAAGGAACA 


900 


TCTGCTTGGT 


ATCATTAAAT 


TCAATCAAAT 


CTCCATCTAG 


GAAAAATCCT 


CTCTTATCAG 


960 


AGATACGTGA 


AGCTTGCTCC 


ATGGAACGGG 


TTACCAGAAG 


CATGGTGTAC 


TTGTCTTTTA 


1020 


GACCATACAA 


GGTTTCCTCA 


ATTTTACCAG 


CTGAAATCGG 


ATCCAAAGCC 


CAACTTGGCT 


1080 


CATCCAAGAG 


GATGATTTTA 


GGACTACTTG 


CCAAGACACG 


GGCCACGCAG 


ACACGCTGCT 


1140 


GTTGACCACC 


TGACAATCCA 


ATAGCTGAAT 


CATATAGACG 


ATCCTTCACC 


TCATCCCACA 


1200 


TAGAGGCACC 


TTGCAAGGCT 


TTTTCTACGG 


CTTCATCCAG 


AACCTGCTTA 


TCCTTAATTC 


1260 


CATTGATACG 


AAGCCCGTAG 


ACAACATTCT 


CATAGATAGT 


CATAGGGAAA 


GGATTAGGTT 


1320 


GTTGGAAAAC 


CATTCCGATT 


TCCTTACGTA 


ATTCAACCGT 


ATCTGTACCC 


GGACTGTAGA 


1380 


TGTTGTGACC 


ATTGTACACC 


ACGGATCCAG 


TTCTGGTCAC 


CTCTGGATTG 


AGATCTCCCA 


1440 
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TCCGGTTGAG 


AGACTTGAGG 


AGGGTTGACT 


TCCCTGATCC 






1S00 


TAATTTCCTT 


AGGTTGGAAA 


GATAGGGAAA 


CACTATTCAA 


AL>4_W I 1A. Ill 


1 1 r\ i 14\&/v\l 


1560 


AAACGGACAC 


CTCTGATACC 


TCTAAAATCC 


CATCTGTCAT 


AWww 111 ^V. 1 




1620 


AGTGACCAGA 


TACATAGTCA 


TTGGTGGACT 


GTAGCTTGGC 


ATTTTGGAAA 


AlAul lwA_A&* 


1680 


T CTTGTC ATA 


CTCAATCAAA 


TCACCCAAGT 


AAAAGAAGCC 


TGTATAGTCA 




1740 


CAGCCTGCTG 


CATATTATGC 


GTTACAATGA 


TGATGGTAAA 


GTTTTTCTTG 


AGCTCAAAwA 


lOUU 


TGGTCTCTTC 


TAGTTGCATG 


CTCGCAATCG 


GATCCAAGGC 


TGAGGCTGGC 


TCATCCATTA 


1 oou 


AGAGGATATC 


TGGCTTAACA 


GAGATGGCAC 


GAGCGATACA 


GAGACGTTGT 


TGCTGACCAC 




CTGATAAGGT 


CAAGGCTGAC 


TTGTGGAGAT 


CGTCTTTAAC 


CTCATCCCAG 


AGGGCAGCCT 


1980 


GACGAAGGGA 


GGTTTCTACG 


ATTTCATCTA 


GGACTTGCTT 


ATCCTTAACT 


CCAGCACGTT 


->/*<• ft 


CATGCGCAAA 


GGTAATATTA 


CGGTAAATTG 


ACTTAGCAAA 


TGGATTGGGA 


CGTTGAAAAA 


2100 


CCATTCCAAT 


GTGTTTACGC 


ATTTCATAAA 


CGTTGATTTC 


TGGACGGTTG 


ACATCAATTC 


2160 


CACGATAGAG 


AATCTGCCCA 


GTTACTTTAG 


CAATATCAAT 


AGTATCATTC 


ATGCGATTGA 


2220 


GACTCCCTAA 


GTAGGTAGAT 


TTCCCCGATC 


CCGACGGGCC 


AATC AAACCT 


GTAATTTTAT 


2280 


TTCTTTCAAA 


TTGCATATCA 


ATCCCCTTAA 


TGGATTCATT 


TTTACCATAG 


T AAACATGGA 


"i 1 A f\ 

2 J4U 


CATCCTTAGT 


AGAAAGGGCT 


ACTTi"i-rcrr 


CAGGAAAGGT 


AAGGATATGC 


1TCTCATCCC 


"> A Aft 


ACTTATATGT 


TGACATGCCT 


TCTCCTTTAG 


GCAGCGGTTA 


ATTTCTTGTG 


TAGATAGCTT 


24 OU 


CCGAACTTAC 


GAGCTCCAAA 


GTTAAAAATC 


AGGATAAAGA 


TCAGGAGCAC 


AGCGGCAGAA 




CCTGCTGATA 


CAATGGTTCC 


ATCTGGAATA 


GTGCCTTCAC 


TATTGACTTT 




neon 


ACAGCCAAGG 


TTTCTGCTTG 


ACGGAAGATA 


GAGATGGGGC 


TAGTCACACT 






CAGTTAGACC 


AGTCAAGAGC 


TGCCGCCGAT 


TGCCCTGCTG 


TATAGATCAG 


Auv. ivLnw. 1 


« ' WW 


TCGCCAAAGA 


TACGACCAGA 


TGCCAAGACG 


ACACCCGTTA 


CAATACCTGG 


AAGCGCTTCC 


2760 


GCAATAACAA 


CATGAACCAC 


TGTCTCCCAG 


CGAGAAATCC 


CAAGAGCCAG 


ACCAGCCTCA 


2820 


CGTTGGGTAT 


GGTGAACGTG 


TTTCAAACTA 


TCCTCTACAT 


TACGCGTCAT 


CTGAGGCAAG 


2880 


TTAAAGACTG 


TCAAGGCCAA 


CGCACCTGAA 


ATGATTGAAA 


ATCCATACTC 


AAACTGGACT 


2940 


ACAAAGATCA 


AGTAACCAAA 


GAGACCCACC 


ACCACTGATG 


GTAAAGAGGA 


CAAAATTTCA 


3000 


ATACAAGTCC 


GCACAAAGTT 


GGTAACAGGA 


CCTTTTTTAG 


CATATTCAGC 


CAACTAAATC 


3060 


CCAGCTCCCA 


TAGAAAGAGG 


TACAGAAATA 


ATCAAGCTAA 


TGACCAATAG 


GAAAAAGGAA 


3120 


TTGTAAAGCT 


GAATGCCAAT 


CCCACCACCT 


GCTTGAAAAG 


CAGAAGACCT 


TCCACTCAAG 


3180 
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AAAGACCAAG 


AGATATGGGG 


CAAGCCCCGA 


ACCAAGATAT 


AGAGAATCAA 


GGAAGCCAAG 


3240 


ATTGTCACAA 


TCATGCTAGC 


AATCGTATAG 


AGGACAGCTG 


TTGCAAGTTT 


ATCTAATTTC 


3300 


TTAGCGCGCA 


TAATTTTTCT 


TTCCTCTTTC 


TTTCGTAATC 


AATTTAATCA 


CACTGTTAAA 


3360 


AACTAAGCTC 


ATCAAGAGCA 


GTACCAAGGC 


CAGTGACCAG 


AGAACATTAT 


TATTTACAGT 


3420 


TCCCATGACA 


GTGTTCCCAA 


TTCCCATAGT 


TAATATAGAA 


GTTAAAGTTG 


CAGCTGGTGT 


3460 


GGTCAAGGAA 


GTTGGGATAA 


CAGCTGAGTT 


TCCGACAACC 


ATCTGGATAG 


CTAGACCCTC 


3540 


ACCAAAGGCA 


CCCGCCATCC 


CAAAGACCAC 


TGCAGTCAAA 


ATACCAGAAC 


GGGCCGCCTT 


3600 


CAAGATCACA 


CGCCAGATAG 


TCTGCCAGCG 


AGTGGCTCCC 


ATAGCGAAAC 


TGGCTTCACG 


3660 


ATAATAACGA 


GGAACCGCAC 


GCAAGCTATC 


CGTTGTCATA 


AAGGTTACGG 


TCGGCAAAAT 


3720 


CATGACAAAG 


AGGACGGAAA 


TCCCTGACAA 


AATCCCAAAA 


CCAGTCCCAC 


CAAAGACACT 


3780 


GCGAACAAAG 


GGAACGACGA 


CTTGCAAGCC 


AATAAATCCG 


TACACTACTG 


AAGGAATCCC 


3840 


AACCAGGACT 


TCAATAGCTG 


GTTGCAAAAT 


CTTCGCCCCT 


TTTCGTGATA 


CTTCGGTCAT 


3900 


AAAAACTGCT 


GCACCAATAG 


CAAAGGGTGT 


TGCGATAAGG 


GCTGAGAGAA 


TGGTAACGAT 


3960 


AAAGGAACCC 


AAAATCATAG 


GAAGGGCACC 


AAATTCTTTA 


CTAGAAGGAT 


TCCAAGTTCC 


4020 


TCCCAAAAGA 


AAGTCAAAGA 


TATTCACACC 


ATTGACAAAG 


AAGGTCGACA 


AGCCTTTTTG 


4080 


CGCTACGAAA 


ACCAAAATCA 


TGGCCACAAG 


GATGACTATC 


AAAGAAAGAC 


ACGCAAACGT 


4X40 


CAAACCTTTT 


CCTAATTTCT 


CCAGACGAGA 


ATTCTTTGAT 


GGAAGCAACA 


TTTTCTTAGC 


4200 


TAATTCTTCT 


TGATTCATTA 


TTGTCTCCCT 


TCCAACACTG 


TCACACTTCC 


GGCAGCATCT 


4260 


TTTTCAACCT 


TCATTTCCTT 


AATCGGAATA 


TACTTCAATC 


CTTTGACAAT 


CCCTTCTTGG 


4320 


CTCTCATCCG 


AGAGAACAAA 


ATTGAGAAAT 


TCTGCAGCCA 


ACTCATTGGG 


CTGCCCCAAT 


4380 


GTATACATAT 


GCTCATAAGA 


CCACAAGGGC 


CAATTATTGC 


TACTTATATT 


TTCTGGACTT 


4440 


AAGTCATAGC 


CATTCAACTT 


CATCCTTTTG 


ACCGAATCAT 


CTATATAGGT 


AAGAGATAAA 


4500 


TAAGAGATAG 


CTCCTGCACT 


TTTTGATACG 


ATTGATTTTA 


CCGCTCCATT 


TGAATCCTGC 


4560 


TCCTGACTTT 


CCATGGCAGA 


CTGACCTTCC 


ATAATGACAG 


TATCAAAGGT 


AGCACGAGAG 


4620 


CCAGAGCCGG 


CTCCCCGATT 


GATAACAGAG 


ATGGGTAAGT 


CCTTACCACC 


AACCTCTTTC 


4680 


CAATTGGTTA 


CCTCACCTAT 


GAAGATTTGA 


CGAAGTTGCT 


CTGTCGTTAG 


GTTATCAACA 


4740 


TCAACCTCCT 


TATTGACAAT 


CAGAGCCAAG 


CCAGCTACCG 


CGACCTTGTG 


GTCAACAAGA 


4800 


GCAGAAGCAT 


CAATTCCGTC 


TTTTTCCTCA 


GCAAATACAT 


CTGAGTTTCC 


TATATCAACT 


4860 


GCCCCAGACT 


GAACCTGGGA 


CAAGCCTCTA 


CCAGAACCTC 


CCCCTTGGAC 


ATTGACCGTT 


4920 


TTTCCAACAT 


CGATCCTCCC 


AAATTCATCT 


GCCGCTACTT 


CAACCAAGGG. TTGCAAGGCA 


4980 
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GTTGAGCCAA 


CAGCCGllAT 


GCATTVi 




AGCTACCACA 


GCCTACTAAA 


5040 


CAAGCCGTCA 


f*r+r* ft k k k ft 

GCCAAAAAGC 




AnACfAACfT 


4 A * 4 * X* 4 4 * * 


TTTCACTGTT 


5100 


TTTCTCCTCG 


AAAATAA1 TA 


l W/w* A AL 4 w 1 


A A A * » 4 *■ 


m rVSM A A W A A 


TATGAGTTGA 


5X60 


CGCATGAATT 


CTTACCAAAT 


TTCTGCGCAA 


TTGATTATTT 


i\l n 4 #\/\ Inin 


Wv W A r\ A A A « 


5220 


CTCTTTCCTA 


ACCTCCTTTT 


TTCATATGTG 


GATAAAAT tl 


CTTG TCT ATC 




5280 


TTGTCACCCA 


TTATAGTCAT 


TTCGTGTCTC 


TTTTTCCCCT 


1 I I 1 MA luWA 


nUuu/Wv\ AAA 


5340 


CTCTCCTTAG 


ATGATAATCC 


AAAAGCTAGA 


AAGGTATWTv. 




AW 1 l\.vUAy 


3 WW 


ACTAGTTTAC 


AACTAAAAGG 


AAAAGATTCT 


ATTTTATGAG 


AAATCTAGTT 






AAGAACGCTA 


ATAACTAAAC 


TTCTTGTACT 


i iii>i»i«~ ft » » ft 
CTTIviAAAAT 




1 Vj 111 lu 


3 JAU 


AGCTATCTAT 


GGCTAGCTTC 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAG 


#I» ft X ft ft ft ^ ft 

TAAAACTACA 


3 SOU 


TGTAATGGCA 


ATCAACATAT 


CAAGAATCAT 


CCTACTAAAA 


AAATCCATAC 


iwnti^ ft * «f ii ft *n ft 

TTTCACTATA 




ACATAGAATA 


AGATATTTGA 


CTAGCATTTT 


CATTTGAATC 


TGAGGCCTTT 


■V*^ 1 ft ft * ft «P ft ft 

TGGAAAATAA 


c?aa 


T'T TTTCAAAA 


CATTTCCAGT 


AACCTTTGCA 


AAGCCCAAGC 


CATTGCCTTT 


ft ft/*^ ft ft. ft ft^ni 
AACCAAAACT 


3 / OU 


TGCTACCAAC 


C ATTTGGC AG 


ACTTTCTGCC 


AGCTGAACGG 


i i l CTCC AuC 


V. V»V_i\ 1 Ml_ 1 I Aj 


5620 


ACAAACGCTT 


CTTGGCCAAT 


TTCAACCGAC 


TGTTCGACCT 


GACTCGCTTT 


/W\ UwL 1 Ann 


5880 


CCAAGAGCGA 


AACTGGGCTC 


AAAGCGTTTC 


TTCTTAAAAG 


*V* ft ^^f* ft ^ ft 

TACCCAGATG 


LAu 1 LLA 1 lu 




CGAGCAATCT 


TGAGCTTCCA 


TAAATCTGGC 


AAAAGTTCTG 


GCAACAGATA 


AAGCTGGTCT 


6000 


CCAAAAATCT 


GCAAGATACC 


CGGTAGATTG 


A M/WM/* ft ft ft «** 

ACLl iCAAAT 


GGTTTTGGGC 




6060 


CACAAGGCAA 


CTTGTTCACG 


GCTGAGGTTA 


C'i'L 1 * ACTTG 


CCTTAAATTT 




6120 


TTGTTACCCT 


TAAACTGTAG 


ATGGGCAACA 




C TCCCTT AAA 


CTGATGAGCA 


6180 


TACATCCGAG 


CCGTTTCTGG 


/•* ft *^ ft 
LAduTLAA I A 




A A WUt A A UA A 


ATGCTCTACT 


6240 


GGCAACAAGT 


CAAAATCATA 


CTCTTCCACC 


AACCAATTGA 


CAATCTCTTC 


GTTTTCCTCG 


6300 


GGTGCCCAGG 


TACAGGTCGA 


ATAAACCAGA 


TGACCACCTT 


CAGCTAACAT 


GCTCACTGCA 


6360 


TCCTCCAGAA 


TTTCTCTTTG 


CAAGCTAGCA 


CATTGACTCG 


GATAATCTAA 


GCTCCAATAG 


6420 


TCCATAGCAT 


CAGGTTGCTT 


ACGAAACATT 


CCTTCACCAG 


AGCAAGGGGC 


ATCAAGAACG 


6480 


ATTAAGTCAA 


AATAGCCTTT 


AAAGACCTTG 


ACCAAGCGGT 


CGGCAGATTC 


ATTGGTCACC 


6540 


ACGACATTTG 


TCGCTCCAAA 


ACGCTCCATG 


TTTTCAACCA 


AAATCTTAGC 


CCGTTTGCTT 


6600 


GAAATTTCAT 


TGGAAnCAAG 


TACCCCCTCC 


CCTGCTAGAT 


ACCCTGCCAG 


TTGAGTTGAT 


6660 


TTGCCCCCCC 


CTCCACCAGC 


CAAGTCCAAG 


ACCTTCATAC 


CAGGACTGGG 


TTGGGCTACT 


6720 
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TGACCCACCA 


TTTGAGCAGC 


AGCTTCTTGC 


GAATAAACTA 


AACCTGTAGC 


ATGCTCACGC 


6780 


GATTTCCCTG 


AAACCTTCCC 


ATAGTGCCCC 


CAAGGGGTTT 


GAGTAATGGC 


ATCAGAAAAG 


6840 


GAAAGTTGCT 


CTTCTTTTAA 


GGGATTGACC 


CGAAAGCCCG 


AAACCCCTTC 


CTCCTCAAAA 


6900 


CAGGCAAGAA 


AATCTCTTGC 


CTCATCTCCT 


AGTATCTCTT 


TATATTTTTC 


AACAAATCCT 


6960 


TCTGGAAATT 


GCATTTAAGT 


TCTTTTCCTT 


TCGTAAATAT 


AGGACTGAAT 


TTCCTCCTGC 


7020 


ATCTCAAGAC 


GCACCATCAT 


GACCCGCTGT 


CTGGTTTGAA 


AATCAGGAGC 


TTCACCAAAA 


7080 


AGGGTCACAA 


CCCGATAGCC 


CAGACTTTCC 


CCTAAAATAC 


TAGCTGCGGC 


ATAATCCCAT 


7140 


GGTTGCAGAT 


AACTGAGATA 


GCTCAACAAA 


CGCCCTGACA 


AAATCTTGGC 


AAAACTAATG 


7200 


GCCGCACTTC 


CATAGACACG 


AACACCAAGA 


ACCGCTCGGC 


TCAAATCAGC 


CAGCCCCCAT 


7260 


TCATTGGTTT 


CCAGCATACC 


ACTATTCCCT 


GCAATGAGAA 


AATCTCCAAG 


TGGTTTAG'PT 


7320 


TTAAAAGGAG 


CTAGGGACCT 


ATCATTTAGA 


CAAACTGGAA 


ATTCCCCACC 


ACCGTGGTAA 


7380 


CAATCCCCTT 


TGACCACATC 


ATAAATCACA 


CCAAACTCTC 


CCTCACCATT 


TTCAAAATAA 


7440 


GCCATCATAA 


CAGCAAAATC 


TTCCTGCTGG 


GCTACAAAAT 


TATTGGTACC 


ATCAATGGGA 


7500 


TCAATGACCC 


AAACCTTCCC 


CTCTTCAACC 


GAGGCTCGCA 


GACAACCTTC 


TTCAGCACAA 


7560 


ATCTTATCCT 


CAGGATAACG 


GGACAAAATC 


TCACCAACCA 


AGACTTCCTG 


AACTTCTTTG 


7620 


TCCAGTCTGG 


TCACCAAATC 


TGTTGGAGAG 


GACTTGGTTT 


CAACACCCAA 


GTCTTCCTGC 


7660 


ATATCGTCAA 


GAATCTACTG 


ACCTCCTTTC 


TTAACAACCT 


CTTTAGCAAA 


TTCAAATTTA 


7740 


CTTTCCAAGA 


GAAATCTTTC 


CTTCCCCTTT 


TTCTTTGGGG 






7780 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4820 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double — 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CTAATGATAT AGGAACACCA GGTGACCTGA TGGGACCTCG TAAGCCTATC AACTACTAGC 60 

TGCTAAAGGC TTTAAAGATG GTATGCTACC ATATATCTCA AACCAATACG AAGAAGAAGC 120 

CAAACAAAAG GGCAAGACAA TCAATCTCTA CGGTAAAACA ACAGGTTTGG TTACAGATCA 180 

CTTGGTTTTG GAAAAGGTAT TTAATAACCA ATATCATACT TGGAGTGAGT TTAAGAAAGC 240 

TATGTATCAA GAACGACAAG ATCAGTTTGA TAGATTGAAC AAAGTTACTT TTAATGATAC 300 

AACACACCCT TGGCAAACAT TTGCCAAGAA AACTACAAGC AGTGTAGATG AATTACAGAA 360 
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ft nvn ft ft 0mm*m ft 

ATTAATGGAC 


GTTGCTCTTC 


GTAAGGATGC 


*^*ft ft*"**^* ft.^ 

AGAACACAA1 




f+f* ftT»A t/"<Pk 


420 


CAATCCACAC 


ATAGATACTG 


AAGTCCACAA 


GCTCAAGAGA 


GCAATCTTTA 


h K/^/^f^t 1 ftiW^*l* 




TGACCAAACA 


AATGATTTTA 


GAAGTTCAAT 


TTTTGAGAAT 


AAAAAATAGT 


GTCTACTATT 




AGGAAATAAA 


GTTTAAAAAG 


GTGATGAAGA 


ACAAACCAAG 


ATTCAAGCAG 


GAA i I Wi AC 


OUU 


TGATAATGAA 


GTAAGTTATG 


ATCTTATTTA 


TCAGCAGGAA 


ACTCTTCCTG 


CAACAGGTTC 




ATCAACTTCT 


GAGCTTACAG 


CTTTAGGCCT 


ATTAGCTGTT 


GGTACTTTAG 


TTCTTTTGGT 


720 


TCATAATATG 


ACGGGAACAG 


TTTTTTGCTC 


CCTCTGAAAA 


GTCATCATTT 


GATGGCTTTT 


780 


TTCTATATAG 


GGTAAAAGAT 


AGGGTAAAAG 


GCTATCATCG 


GACAAAATAA 


AGAAGGCATG 


840 


ATATAATATA 


AAGTAGATTT 


CTATGTCATA 


AAACAAGAAC 


TCTTTCGACA 


TCATTCATTT 


900 


GAAAACTCTC 


TATGTTCAAA 


CAATAGTAAA 


ATAAAATAGG 


GGATCTAAAT 


CCTTGCTATG 


960 


AAAGGAAAAA 


ACTCAATGGC 


TACTATTCAA 


TGGTTTCCTG 


GTCACATGTC 


TAAAGCTCGT 


1020 


CGACAGGTGC 


AGGAGAATTT 


AAAATTTGTT 


GATTTTGTGA 


CGATTTTAGT 


AGATGCACGC 


1080 


TTGCCTCTAT 


CTACTCAAAA 


TCCTATGTTG 


ACCAAGATTG 


TTGCTGATAA 


ACCAAAACTC 


1140 


TTGATTTTAA 


ACAAGGCCGA 


CTTGGCTGAT 


CCAGCAATGA 


CCAAGGAATG 


GCGTCACTAT 


1200 


TTTCAATCAC 


AAGGAATCCA 


GACGCTAGCT 


ATCAACTCCA 


AAGAGCAAGT 


GACTGTAAAA 


1260 


GTTGTAACAG 


ATGCGGCCAA 


GAAGCTCATG 


GCTGATAAGA 


TTGCTCCCCA 


GAAAGAACGT 


1320 


GGGATTCAGA 


TTGAAACCTT 


GCGTACTATG 


ATTATCGGGA 


TTCCAAACGC 


TGGTAAATCA 


1360 


ACTCTGATGA 


ACCGTTTCGC 


TGGTAAAAAG 


ATTGCTGTTG 


TTGGAAACAA 


GCCAGGGGTC 


1440 


ACAAAAGGTC 


AACAATGGCT 


TAAAACCAAT 


AAAGACCTGG 


AAATCTTGGA 


TACACCGGGG 


1500 


ATTCTCTGGC 


CTAAGTTTGA 


GGATGAAACT 


GTTGCACTTA 


AGTTGGCATT 


ft ^*tfMft^* ft 

GACTGGAGCT 


1560 


ATCAAAGACC 


AGTTGCTTCC 


TATGGATGAG 


GTTACCATTT 


TTGGTATCAA 


Mft ft fWfimA ft ft -ft 

TTATTTCAAA 


1620 


CAACATTATC 


CAGAAAAGCT 


GGCTGAACGC 


TTCAAACAAA 


TCAAAATTGA 


AGAAGAAGCG 


1680 


CCTGTGATTA 


TTATGGATAT 


GACCCGCGCC 


CTCGGTTTCC 


GTGATGACTA 


TGACCGTTTT 


1740 


TACAGTCTCT 


TCGTGAAGGA 


AGTCCGTGAT 


GGCAAACTCG 


GTAACTATAC 


CTTAGATACA 


1800 


TTGCAAGACC 


TCGATGGCAA 


CGATTAAAGA 


AATCAAAGAA 


TTCCTTGTGA 


CAGTCAAGGA 


1860 


GTTAGAAAGC 


CCTATTTTTT 


TAGAGCTTGA 


AAAGGATAAT 


CGCTCAGGAG 


TTCAAAAGGA 


1920 


AATCAGCAAG 


CGTAAAAGAG 


CCATTCAAGC 


TGAATTAGAT 


GAAAATTTCC 


GCTTGGAATC 


1980 


CATGCTTTCT 


TATGAAAAAG 


AACTTTATAA 


GCAAGGATTG 


ACCTTAATTG 


CAGCTATTGA 


2040 


TGAGGTTGGT 


CGTGGTCCTC 


TTGCTGGTCC 


TGTAGTCGCT 


GCGGCCGTTA 


TTTTATCTAA 


2100 
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AAATTGTAAG 


ATTAAAGGTC 


TCAACGACAG 


256 
CAAGAAAATT 


1 Annnnun 


nn^n I x> luun 


2160 


GATTTTCCAA 


GCCGTTCAAG 


ACCAAGCCTT 


CTCGATTGGA 


ATTwsiTATCA 




2220 


GGTCATCGAC 


CAAGTCAACA 


TCTATGAACC 


<h * +++m ft & ft ft 

AACCAAACTA 


GCCATCCAAG 




3 2 AO 


CCACCTCAGC 


CCTCAACCAG 


AGCACCTTTT 


GATTGATGCC 


ATGAAACTCG 


ACTTGCCCAT 


5140 


TTCACAAACC 


TCCATTATCA 


AAGGAGATGC 


CAACTCCCTC 




LAuLA 1 w iAi 


2400 


AGTACCCAAC 


GTAACACGTG 


ATGAATTGCT 


GAAAGAATAC 


GATCAGCAGT 


TCCCTGGCTA 


9 AAA 


TGATTTCGCT 


ACTAATGCAG 


GATATGGCAC 


AGCTAAACAT 


CTGGAAGGCC 


TCACAAAACT 




AGGACTTACC 


CCAATTCACC 


GAACCAGCTT 


TGAACCCUTr 


AAATCACTGG 


TTTTAGGTAA 




AAAAGAAAGT 


TAATTGAAAG 


GAAATAACAT 


GGAGGAACAG 


TCGGAAATAG 


TCCGTTCTAA 


Z54Q 


GAAACAATTC 


GCCTTTGCAT 


CCAGCACTAT 


ACTATCCCAA 


GTTGGTCGAG 


GAATCATTGT 


Z7QQ 


CGGCCTCATC 


CTTGGAATTA 


TCGTCGGATC 


CTTTCGTTTC 


TTAATTGAAA 


AGGGCTTCCA 


2760 


CCTGATACAA 


GGACTTTATC 


AACATCAAGG 


GTACTTAGTG 


CGCAATCTTT 


TTGTACTGGT 


2820 


TTTGTTTTAT 


ATACTCATCT 


GTTGGCTCAG 


TGCCAAACTA 


ACACGGTCAG 


AAAAAGATAT 


2680 


TAAAGGCTCA 


GGAATTCCTC 


AACTCGAAGC 


CGAACTGAAA 


GGCCTCATGT 


CCCTCAACTG 


2940 


GTGGGGCATT 


CTTTGGAAAA 


AATATGTGCT 


AGGTATTCTT 


GCTATTGCCA 


GTGGACTCAT 


3000 


GCTCGGTCGA 


GAGGGACCCA 


GCATTCAACT 


TGGAGCACTT 


CCTGGTAAAG 


GAATTGCCAA 


3060 


GTGGCTCAAA 


TCCAGTCCAG 


TAGAGGAACG 


TTCCTTGATT 


GCCACTGGAG 


CTGCAGCAGG 


3120 


TTTAGCCGCA 


GCCTTTAATG 


CTCCTATTGC 


ACCACTTCTC 


TTTGTTGTAG 


AAGAAGTCTA 


3180 


TCACCATTTT 


TCGCGCTTTT 


TCTGGGTCTC 


AACTCTAGCA 


CCCAGCATCG 


TAGCAAACTT 


3240 


TGTGTCTCTA 


CTCATGTTCG 


GTTTGACACC 


AGTATTGGAT 


ATGCCACATA 


ACATTCCTCC 


J3Q0 


CATGACCCTA 


GATCAGTATT 


GGATATATCT 


CGTCATGGGA 


ATTTTCCTTG 


GATTTTCAGG 


3360 


TTTTCTCTAT 


GAGAAAGCTG 


TATTAAACGT 


TGGAAGAGTT 


TATGACTTGA 


iwv*^* j" ft ft, ft ft 

TTGGTCAAAA 


J42U 


AATCCATTTC 


GATAGGGCTT 


ATTATCCCAT 


CTTGGCTTTT 


ATCCTTATCA 


TACCAGTCGG 


J4SU 


AATCTTCTTA 


CCTCAAATCA 


TTGGTGGCGG 


AAATCAGCTT 


GTCCTTTCrr 


TAACTGAACA 


3S40 


AAATTTTAGT 


TTCCAAGTTT 


TATTAGCTTA 


CTTTTTAATC 


CGCTTTATTT 
ATTTT AGCTC 


GGAGTATGAT 


3600 
3660 


TAGCTATGGA 


AGTGGACTGC 


CAGGAGGAAT 


TTTCCTCCCC 




GCTTGGTGCC 


TTAGTTGGTG 


TTATCTGTGT 


CAATCTTGCA 


CTTGTCAGTC 


AAGAGCAATT 


3720 


CCCTATATTT 


GTCATTCTAG 


GAATGAGTGG 


CTATTTTGGA 


GCCATATCAA 


AACCTCCCTT 


3780 


AACCGCTATG 


ATCCTCGTAA 


CTGAGATGGT 


AGGAGATATT 


CGCAACCTTA 


TCCCACTTGG 


3840 


TCTTCTCACT 


CTTGTTTCTT 


AT ATT ATC AT 


GGATTTGCTC 


AAAGGTACGC 


CAGTCTATGA 


3900 
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AGCCATGCTG GAAAAAATCC TTCCAGAAGA AGTATCTAGC GAAGGAGAAG TTACACTTAT 
CGAAATACCA CTTTCTGATA AAATTGCTGG GAAACAACTT CATGAACTCA ACTTACCACA 
CAACGTCCTC ATCACAACTC AAGTCCATAA TGCCAACAGC CAAACACTTA ACGGCTCAAC 
CAGAATGTAT CTGGGTGATA TGATTCACCT GGTTATTCCA AAAAGTGAAA TTGGAAAAGT 
CAAAGATTTG TT C TT G TAGT ATGAGTATTT ACATAATTTA TGTTATGTAA ATGATCAGTT 
TGATTTATTT AGAAAACCGA TTCTCAGGAA TGAGATCGGT TATTTTTTAC TGATCAGGAA 
TTTTACATAT AAATAATTGA ACTTTATTAA AAATAAGACT ATAATTAAGT TAGAAATGAT 
AAAGTATAAA GCTAGAAAGG ACTTTACTGT ATCAAATCTG TACACTAAGA TTAAAATCAT 
GAAAAAGAAA ACAATAGCAA TT AT AT AG AG AAATGAAATA GAAATAGGAT AAAACAATCA 
GCACAATCAA ATCAATTTCT ACCAATCTTT TAGAAGTCCA GATGTACTAT TCTAGTTTCA 
ATCTATTATA CAATGTGTTT TCTATCTCAT AGCTCCTTAT ATAGCTCTTC AGTTATGTAG 
TATTAACAGA AGTTTAGTGG GTGAGATTTT TATTATTTTC CTTATTCTGT TTT G TT T GTA 
CGTCTAAGTC TTTTTATCAC TTTGAAAAAC TCCTATAACA TCTTTCCGAA AAACTATAAT 
TTTCTTGAAA AATATACAAG TCTATGCTAT ACTACTAGTA TACTTACTTA TGGAGAAAAT 
ACATGAAACG TCAGATTTTA CTGGAACCAA TCGACAAACT AAAACAACTC ATGCCCTCCT 
AAGTTCTGGA ATACTACCAA 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTACGACATC ATGATTAACA GTCATGCGCT ACTACCAACT GACCTATGGC GGATAAAATA 
GTCCGTACGG GATTCGAACC CGTGTTACCG CCGTGAAAAG GCGGT G TCTT AACCCCTTGA 
CCAACGGACC TTCTATCTGT AGCAGATATA ACCATTATAT CAATTTCTTG CTAATTGTCA 
ATCACTTTTG AGATTTTTTC TCTAAAATAT CTTTTAATTT TCTAATTTTT AATCTTGAAA 
TAGGACAACG ATGGTCTTCA TAGAAAACAA TTTCTAAGTT TTTTCCATCA ATTTCTCTGA 
TATTACCTAT ATTTACCAAA AATGACTTGT GAGGAGAATA AAATCGCTGA GTATGTTTGT 
CCTTTTCCTG AATATCTGTC ATGGTACCAT AAAA C TCTTT TGCAAAATTC TTACCAATAA 



3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4*740 

4600 

4820 
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TGCGCAATTT 


ATGAGATACC 


CCTGTTGTTT 


258 
CAATATACAA 


AATATCATGG 


TAAGGAATTT 


480 


TTAAATCATT 


TCCCTTGTAA 


TTGTAGTCGA 


AATAATCTAC 


AACATCTTCA 


TTTTCAAGTA 


540 


ACATACTCTT 


CCTGTAGAAG 


ATATTTTGCT 


CAATTCTCTT 


CTTAAACATC 


TCATCATTGA 


600 


TATCCTTATC 


AACAAAATCT 


AGGGCTGATA 


CCTGCTATTT 


ATAGGTTAGA 


GTCGCAAACT 


660 


CTGATCGACT 


AGTGATAAAG 


ACGATAATAG 


CGTAAGGATT 


GTAATGACGA 


ATGAGCTGAG 


720 


CCACTTCAAA 


TCCCTTTTTC 


TCAATTCCAT 


CAATATCGAT 


ATCTAGGAAA 


TAAAGCTGAT 


780 


TTACTTCATC 


ATTTTCAATG 


TATTCTTCAA 


ATTCACGGAC 


TTTTCCCGTT 


GTCTTGTATG 


840 


ATATTCGAAT 


ATTCGATTCT 


TTCGAAATTT 


CATCCAATAT 


TCTCTCTAGT 


CTCACTTGAT 


900 


GTTCAATAAC 


ATCTTCTAAA 


ATTAAAACTT 


TCATTCAAAT 


TCCCTCTTAA 


ATCTAATGAT 


960 




GTACTGCCTT 


CCATCTCTGT 


TTCTAAAATA 


ATATTGTTGT 


ACTTATCTAG 


1020 


TACTTCTTTC 


ACATTATTTA 


ATCCGACTCC 


GCGATTTCTT 


CCCTTAGTGG 


AGAATCCTAA 


1080 


GGCAAATAGA 


TCTCCTGAAG 


GAGTCATCGT 


CATTTTACAT 


GAATTCTCAA 


TCACAATAAC 


1140 


A \9 A A » A * 


TCCATCTTJU^ 


TAACTGCTAC 


TTCCATCTGC 


TTTTTATAGC 


TATCAGCCGA 


1200 


TCCTTCGACA 


GCATTATTCA 


ATAAAACGCT 


CATGATACGA 


ACCAAATCCA 


ATAGTTCAAT 


1260 


X wU/V^JL 1 IU 


GT AATCCT AT 


CTTTTACTTC 


CAGTGTAAAC 


TCTACACCAT 


TATTTCGAGC 


1320 


ATAGACAATT 


GACTGAGCAA 


CCAAACTTCG 


TAAAGCTCAG 


TCTTCTATGT 


TGTTCAAATC 


1380 


AAACTAACTG 


TACTTATCTG 


AACGCAATTT 


ATGATTTGCT 


TTGACTAAAA 


CTTCATTGTA 


1440 


AATTCTGTCA 


ATTTCCTGTA 


AATTACCACT 


GTCAATTGCC 


ATCTGCATGC 


TGACAAGCAT 


1500 


TCCAGCATAA 


TCATGTCCAA 


AACCACGGAT 


TTCATTATAC 


AGACCAACAA 


TTTCATCTCT 


1560 


GTAATTCTGT 


AAATGTTTCT 


GTTCAAATTT 


CTTCTGCTTC 


AAAGCAATCT 


CTTTCTCCAT 


1620 


TTGAACTTTA 


TGAGAATTCA 


TTGCAAAGAA 


GGTCAAAAGG 


AGAGAGATAA 


AGACAATAGA 


1680 


TGACAAAATA 


CTTCCAAAAC 


TATTCAAATC 


TTTAATCGTA 


CTTACCATAT 


CTCAAACGAA 


1740 


AGATACAATA 


TGTAGCAATA 


GTAAAGCAAA 


AAATACTTTT 


TTCAAGAAAG 


GATAAAGGTA 


1800 


GTCCTTGTCA 


AAATAGGCTA 


GTTCCAAATG 


uAAA 1 Wj 1 An 


ATGATTTTTA 


ATGTAACAAA 


1860 


ATAGCTTAAC 


ACCCTCACAA 


CGAAAAAGAA 


TGGGAAATGA 


TATTGTAAAA 


CAAAATTATC 


1920 


TCCTGTTATA 


GAGGAGAAAA 


TTACGGACAG 


AAAGTTATGA 


GTG CTCTCAT 


ATAAAAGAGA 


1980 


TAGTAGTAAA 


CTTAGGAATA 


GTCCTCTATC 


CCTCTCATAC 


TGTTTCATCC 


ATCGAAAATA 


2040 


GGAATATAAG 


CCCAAAGGAA 


ATAAAAATCT 


TTCAATCCCT 


ATTTTATCTA 


AATATAGAAG 


2100 


ATAAAAGGAA 


AATTCAAGTA 


CTATTTCAGT 


TACTAATGTA 


TAAGCACCAA 


AAACGTATAA 


2160 


TTCTTTTCTA 


TTTATTCGAC 


CTTTACAAAT 


TAAACGGTAA 


CTGTGACTAA 


TAATTAAAAA 


2220 
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ATGAACAATA 


ACTGTCCCAA 


ATCCAAGTAA 


ATCCArTACT 


CTTTCTCCTT 


AT1TCATTAW 


9? An 


TTTTTTCGTA 


GGAAAAGAAA 


ATCAAGGATG 


ATTCTTGAAA 




UXACCTTiA 


« J4U 


ATCTTTTGTA 


AGTCTTTTTC 


CTTCAAAGCT 


ACAAACTGTT 


CCAATTTAAC 


TGTGTTTTTC 


2400 


ATAATAAAAT 


CTCCTAAAAT 


GTTTTTTCTT 


GTAAGCTAAC 


i i AL AAAAAC 


CATTATACAA 


2460 


AATGGAATTT 


CGTTTTAGAT 


AAAATTCTCT 


CAACTGTCAT 


TTTTTTCTCC 


CAAAGTGTAC 


2520 


TTTTTTAAGA 


AAAAAGCCGG 


GAAAATTCCC 


AGCTTTGCTA 


TTATATTGAT 


CCCAGCAGGA 


2580 


TTCGAACCTC 


CCACCCTTCG 


CTTAGAAGGC 


GAATGCTCTA 


TCCAGCTGAG 


CTATGAGACC 


2640 


TAATACAATT 


ATTCTACCAA 


AAATTCAATT 


AAAAGTCAAT 


TTTCTATTTA 


TGGTAGGGGA 


2700 


ATCCCTGCTG 


AATCGTAAAA 


GCGCGATAGA 


TTTGTTCAAC 


AAGAACTAGT 


CTCATTAACT 


2760 


GATGGGGTAA 


GGTTAGGCGA 


CCAAAACTGA 


CAGAAACATT 


GGCTCTATTT 


TTTACAGATG 


2620 


ATGATAATCC 


TAAACTTCCC 


CCAATAATAA 


AAGTAAGAGT 


AGAAAATCCT 


TTTATAGAAG 


2880 


TTTCTTCTAA 


CTGCTTACTA 


AATTCTTCTG 


AGAAGAAAGT 


TTTCCCTTCA 


ATGGCTAACA 


2940 


CAATAACGAA 


ATCACGCTCA 


GCAATTTTTG 


ATAAAATTCT 


CTGACCTTCT 


ATTTCTAAAA 


3000 


TCTTTTGATT 


TTCTCATTCA 


CTGGCCTTAT 


CTGGTGTTTT 


TTCATCTGAT 


AACTC AATC A 


3060 


TTTCAAACTT 


AGCAAATCTA 


GAAATTCGTT 


TTGAATACTC 


TGCGATACCA 


TCTTTTAAAT 


3120 


ACTTTTCTTT 


CAGTTTCCCA 


ACTGTTACAA 


CTTTAATTTT 


CATGACTCTA 


TTCTAACATA 


3180 


TTCTCTATTT 


TTTCACATCT 


TATTCACAAA 


ATAAAAAATA 


GATTTCAATT 


AAGAAAATCA 


3240 


CAATTTCAAA 


AGAGTTATCC 


AC AG TTTGTG 


TAAAACTTTT 


GTG'lTrAAGT 


TATAATTAAG 


3300 


CTAGTCAGTT 


TATACTTTCA 


GTAATTCAAA 


CATATGGAGG 


CAAATATGAA 


ACATCTAAAA 


3 360 


ACATTTTACA 


AAAAATGGTT 


TCAATTATTA 


GTCGTTATCG 


TCATTAGCTT 


TTTTAGTGGA 


1 A *i A 


GCCTTGGGTA 


GTTTTTCAAT 


AACTCAACTA 


ACTCAAAAAA 


GTAGTGTAAA 


A« ft <"ft»l> 11 

CAALlVlAAC 


1 A fl rt 

348U 


AACAATAGTA 


CTATTACACA 


AACTGC CT AT 


AAGAACGAAA 


ATTCAACAAC 


ACAGCCTGTT 


3540 


AACAAAGTAA 


AAGATCCTGT 


TGTTTCTCTT 


ATTACTTATT 


CGGCAAACAG 


ACAAAATAGC 


3600 


GTATTTGGCA 


ATGATGATAC 


TGACACAGAT 


TCTCAGCGAA 


TCTCTAGTGA 


AGGATCTGGA 


3660 


GTTATTTATA 


AAAAGAATGA 


TAAAGAAGCT 


TACATCGTCA 


CCAACAATCA 


CGTTATTAAT 


3120 


GGCGCCAgCA 


AAGTAGATAT 


TCGATTGTCA 


GATGGGACTA 


AACTACCTGG 


AGAAATTGTC 


3*780 


GGAGCTGACA 


CTTTCTCTCA 


TATTGCTGTC 


GTC AAAATCT 


CTTCAGAAAA 


AGTGACAACA 


3840 


CTAGCTGAGT 


TTGGTGATTC 


TAGTAAGTTA 


ACTGTAGGAG 


AAACTGCTAT 


TGCCATCGGT 


3900 


AGCCCGTTAG 


GTTCTGAATA 


TGCAAATACT 


GTCACTCAAG 


GTATCGTATC 


CAGTCTCAAT 


3960 
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AG AAATGT AT CCTTAAAATC GGAAGATGGA CAAGCTATTT CTACAAAAGC CATCCAAACT 4 020 

GATACTGCTA TTAACCCAGG TAACTCTGGC GGCCCACTGA TCAATATTCA AGGGCAGGTT 4080 

ATCGGAATTA CCTCAAGTAA AATTGCTACA AATGGAGGAA CATCTGTAGA AGGTCTTGGT 4140 

TTCGCAATTC CTGCAAATGA TGCTATCAAT ATTATTGAAC AGTTAGAAAA AAACGGAAAA 4200 

GTGACGCCTC CAGCTTTGGG AATCCAGATG GTTAATTTAT CTAATCTCAG TACAAGCGAC 4260 

ATCAGAAGAC TCAATATTCC AAGTAATGTT ACATCTGGTG TAATTGTTCC TTCGGTACAA 4320 

AGTAATATCC CTGCCAATGG TCACCTTGAA AAATACGATG TAATTACAAA AGTAGATGAC 4380 

AAAGAGATTG CTTCATCAAC AGACTTACAA AGTGCTCTTT ACAACCATTC TATCGGAGAC 4440 

ACCATTAAGA TAACCTACTA TCGTAACGGG AAAGAAGAAA CTACCTCTAT CAAACTTAAC 4 500 

AAGAGTTCAG GTGATTTAGA ATCTTAATTG ACATCTATGT AAACAAAGCT TTACATAAGA 4560 

CAAAAGATCT GTTAGTCTAG AATCATGGAA AAATTTGAAA TGATTTCTAT CACAGATATA 4620 

CAAAAAAATC CCTATCAACC CCGAAAAGAA TTTGATAGAG AAAAACTAGA TGAACTAGCA 4680 

CAGTCTATCA AAGAAAATGG GGTCATTCAA CCCATTATTC TTCCTCAATC TC C T G TTATT 4740 

GCTTATGAAA TCcTTGCAGG AGAGAGACGC TATCGGGCTT CACTTTTAGC TGGTCTACGG 4 800 

TCTATCCCAG CTGTTGTTAA ACAGATTTCA GACCAAGAGA TGATGGTCCA GTCCATTATT 4 860 

GAAAATTTAC AGAGAGAAAA TTTAAACCCA ATAGAAGAAC CACGCGCCTA TGAATCTCTC 4 920 

GTACAGAAAG GATTCACCCA TGCTGAAATT GCAGATAACA TGGCCAAGTC TCGTCCATAT 4 980 

ATCAGCAACT CCATTCGTTT ACTTTCCTTC CCAGAACAGA TTCTTTCAGA AGTAGAAAAT 5040 

GGCAAACTAT CACAACCCCA TCCGCCTTCC CTAGTTCCCT TAAATAACGA ACAACAAGAC 5100 

TATTTCTTTC AACGGATTAT AGAAGAAGAT ATTTCTCTAA GGAAATTAGA AGCTCTTCTG 5160 

AGAGAGAAAA AACAAAAGAA ACAGCAAAAA ACTAATCATT TCATACAAAA TGAAGAAAAA 5220 

CAGTTAAGAA AACTACTCGG ATTAGATGTA GAAATTAAAC TATCTAAAAA AGACAGTGGA 5280 

AAAATCATTA TTTCTTTTTC AAATCAAGAA GAATATAGTA GAATTATCAA CAGCCTGAAA 5340 

TAAGGCTGTT CTTTTATTTT TTTATCTCAC AAGGTTATCC ACTATGTTTT TCGATAAAAA 5400 

GCTTAATAAA TCAATAATTT CTTCTTTTAT CCCCAACCTG TGGATAAAGT TTGGTAACAT 5460 

TGTGGATTAT TTTTCACAGC TTGTGCAAAA TTCTTGCTAT CTATGGTAAA ATATCTCTAG 5520 

TATTAAACTT TTAAATAGTA AAGGACGAGA AAGGATTGAA ACAAAAACAA TTTTGGAATC 5580 

CTATATTAGA ATTTGCACAA CAAAGACTGA CTCGATCCAT GTATCATTTC TATGCTATTC 5640 

AAGCTCAACT CATCAAGCTA GAGGAAAATG TTGCCACTAT ATTTCTACCT CGCTCTGAAA 5700 

TCGAAATGGT CTGGGAAAAA CAACTAAAAG ATATTATTGT AGTAGCTGGT TTTGAAATTT 5760 
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ft ftV^ Ik ++0+ jmw^ ft 

ATGACGCTGA 


* ft #vt ft ft 

AATAACTCCC 


CACTATATTT 


TCACCAAACC 


1 ALV 






ft ft ^JMtft^* ft ft ^% ft 

AAGTTGAAGA 


a ^*^*bmi ft ft, ft ft Ml 

AGCTACAAAT 


TTAACTCTTT 


ATAACTATAC 




GTATCTATTC 


SOOW 


CTTATTCAGA 


TACGGGATTA 


AAAGAAAAGT 


ATACCTTTGA 


TAACTTTATT 






GAAATGTTTG 


GGCTGTATCA 


GCCGCTTTAG 


CTGTCTCTGA 


AGATliGGCT 


CTGACCTATA 


conn 


ACCCTCTTTT 


TATCTATGGA 


GGACCAGGCC 


jiMWL^rf^gi ft ft ft ^ 

TTGGTAAGAC 


TCACTTATT^A 


AACGCTATTvi 


ouou 


GAAATGAAAT 


TCTAAAAAAT 


ATTCCTAATG 


CGCGTGTTAA 


ft ft. «n ft *mj~* j"* 

ATATATCCCT 


MAJU ft m ft ^MM 

GCCGAAAGCT 


6120 


TTATTAATGA 


CTTTCTTGAT 


CACCTAAGAC 


TTGGGGAAAT 


GGAAAAGTTT 


AAAAAGACCT 


6180 


ATCGTAGTCT 


tgatcttttg 


TTAATCGATG 


ATATCCACTC 


ACTCAGCGGA 


AAAAAAGTCG 


6240 


CAACTCAGCA 


AGAATTTTTC 


AATACCTTTA 


ACGCCCTTCA 


TGACAAGCAA 


ft ft ft ft ^ft ft MM 

AAAC AG ATTG 


6300 


TCCTAACGAG 


TGATCGTAGT 


CCAAAACATC 


TAGAAGGGCT 


CGAGGAGAGG 


CTTGTCACGC 


6360 


GTTTTAGTTG 


GGGATTGACA 


CAAACTATCA 


CCCCCCCTGA 


CTTTGAAACA 


CGTATTGCCA 


6420 


TTTTACAAAG 


TAAGACGGAA 


CATTTAGGCT 


ACAATTTCCA 


AAGTGATACT 


CTAGAATACC 


6480 


TAGCTGGGCA 


ATTTGATTCA 


AATGTTCGAG 


ATCTTGAGGG 


AGCCATCAAC 


GACATCACTT 


6540 


TAATTGCCAG 


AGTAAAAAAA 


ATCAAGGATA 


TCACTATTGA 


TATTGCTGCA 


CAAGCCATTA 


6600 


GAGCCCGCAA 


ACAAGATGTT 


AG CC AAATGC 


TCGTCATCCC 


AATTGAT AAA 


ATCCAAACTC 


6660 


AAGTTGCTAA 


CTTTTATGGT 


GTTAGTATCA 


AAGAAATGAA 


GGGAAGTAGA 


CGCCTTCAAA 


6720 


ATATTGTTTT 


GGCCCGTCAA 


CTACCCATGT 


ATTTATCTAG 


AGAACTAACA 


GATAATAGTC 


6780 


TTCCAAAAAT 


TGCGAAGGAA 


TTTGGGGGAA 


AAGATCATAC 


C AC AGTC ATT 


CATGCCCATG 


6840 


CCAAAATAAA 


ATCTTTGATT 


GATCAACACG 


ATAATTTACG 


TTTAGAAATT 


0+ ft ft 09\&+ ft m -Mft^ft ft 

GAATCAATCA 


6900 


AAAAGAAAAT 


CAAATAATTT 


GTGGATAACT 


TTTAol I I i l 


TATC1 n i l l 


ft MA/-* ft J"» ft. MMftftft 

ATCCACA • i i 


6960 


TTTAAACAAG 


CTAAAAAACT 


TGATATGACT 


TGTTTAAAGG 


CTGTTTTCCA 


CAuATTTCAC 


7020 


AGACTCTATT 


ATTACTATTA 


TCTTTCTAAT 


ACTAAAAATA 


AATAAAGGAG 


AATCCATGAT 


70B0 


TCATTTTTCA 


ATTAATAAAA 


ATTTATTTCT 


ACAAGCATTA 


AATACTACTA 


AGAGAGCTAT 


7140 


TAGTTCTAAA 


AATGCCATTC 


CTATTTTATC 


AACAGTAAAA 


ATTGACCTGA 


CCAATGAAGG 


7200 


TATTACTTTA 


ATTGGTTCAA 


ATGGTCAAAT 


TTCAATTGAA 


AATTTTATTT 


CTCAAAAAAA 


7260 


TGAAGATGCT 


GGTTTGTTAA 


TTACTTCTTT 


AGGTTCGATC 


CTTCTTGAAG 


CTTCTTTCTT 


7320 


TATCAATGTA 


GTATCTAGTT 


T AC CTGATGT 


AACTCTTGAT 


TTTAAACAAA 


TTGAACAAAA 


7380 


TCAAATTGTT 


TTAACCAGTG 


GCAAATCAGA 


AATTACCCTA 


AAAGGAAAAG 


ATAGCGAACA 


7440 


ATATCCACGA 


ATCCAAGAAA 


TTTCAGCAAG 


CACTCCTTTA 


ATACTTGAAA 


CAAAATTACT 


7500 
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CAAGAAAATT 


ATTAATGAAA 


CAGCCTTTGC 


TGCAACTACA 


CAACAGAGTC 


GTCCGATTTT 


,7560 


AACAGCTGTC 


CACTTCGTAT 


TGAGTCAACA 


CAAAGAGTTA 


AAAACAGTTG 


C AAC AG ACT C 


7620 


TCATCCCCTA 


AGCCAGAAAA 


AATTGACTCT 


TGAAAAAAAT 


AGTGATGATT 


TTGATGTCGT 


7 680 


AATTCCTACC 


CGTTCTCTAC 


GCGAATTTTC 


AGCGGTATTT 


ACAGATGATA 


TCGAAACTGT 


7740 


AGAGATTTTC 


TTTGCCAATA 


ACCAAATCCT 


CTTTACAACC 


gaaWtatta 


GCTTCTATAC 


7800 


TCGTCTCCTA 


GAAGGAAACT 


ATCCTGATAC 


AGATCGCTTG 


ATTCCAACAG 


ACTTTAACAC 


7860 


TACT ATT ACT 


TTTAATGTGG 


TAAACTTACG 


CCAGTCAATG 


GAGCGTGCCC 


GTCTTTTATC 


7920 


AAGTGCGACT 


CAAAATGGTA 


CTGTGAAACT 


TGAAATTAAG 


GATGGGCTTG 


TTAGCGCCCA 


7980 


TCTTCACTCT 


CCAGAAGTTG 


GTAAAGTAAA 


CGAAGAAATC 


GATACTCATC 


AGGTTACTGG 


6040 


TGAAGATTTG 


ACCATTAGTT 


TCAACCCAAC 


TTACTTGATT 


GATTCTCTTA 


AAGCTTTAAA 


8100 


TACCGAAAAC 


GTGACTATTA 


GCTTTATCTC 


AGCTGTTCGT 


CCATTTACTC 


TTGTGCCAGC 


6160 


AGATACTGAC 


GAAGACTTCA 


TGCAGCTCAT 


TACACCACTT 


CGTACAAATT 


AAGTGAAAGA 


8220 


GGTTGAGCCT 


GGCTCCCCTC 


TTTTATGATA 


TAATCGAAAA 


AGAAAAGGAG 


AGTAGTATGT 


8280 


ATCAAGTTGG 


AAATTTTGTT 


GAGATGAAAA 


AATCACACGC 


TTGTACAATC 


AAGTCGACTG 


8340 


GTAAAAAGGC 


TAATCGTTGG 


GAAATTACAC 


GTCTAGCACC 


AGATATCAAA 


ATAAAATGTA 


8400 


GTAATTGTCA 


GCATGTTGTC 


ATGATGGCGC 


GATATGATTT 


TCAGCGAAAA 


ATGAATAAAA 


8460 


TTATTGACTG 


AGAACCCTTA 


GTTAGAGGGT 


TAGCACTTTA 


TCCCTTTTTG 


TGTTATAATA 


8520 


TTAGGGATTG 


AAATGAAAAC 


GGAGAATGAG 


AAATATGGCT 


TTGACAGCAG 


GTATCGTTGG 


8580 


TTTGCCAAAC 


GTTGGTAAAT 


CAACACTATT 


TAATGCAATT 


ACAAAAGCAG 


GAGCAGAGGC 


8640 


AGCAAACTAC 


CCATTTGCGA 


CGATTGATCC 


AAATGTTGGA 


ATCGTGGAAG 


TTCCAGATGA 


8700 


ACGCCTACAA 


AAACTAACTG 


AAATGATAAC 


TCCTAAAAAG 


ACAGTTCCCA 


CAACATTTGA 


8760 


ATTTACAGAT 


ATTGCAGGCA 


TTCTAAAACG 


AGCTTCAAAA GGACAGGGCC 


TAGGGAATAA 


8820 


ATTCTTCGCC 


AATATTCCTG 


AAGTAGATGC 


GATTGTTCAC 


GTAGTTCGTG 


CTTTTGATGA 


8880 


TGAAAATGTA 


ATGCGCGAGC 


AAGGACGTGA 


ACACGCCTTT 


GTAGATCCAC 


TTGCAGATAT 


8940 


TGATACCATT 


AATCTGGAAT 


TGATTCTTGC 


TGACTTAGAA 


TCACTGAACA 


AACGATATGC 


9000 


GCGTGTAGAA 


AAGATGGCAC 


GTACGCAAAA 


AGATAAAGAA 


TCAGTAGCAG 


AATTCAATGT 


9060 


TCTTCAAAAG 


ATTAAACCAG 


TCCTAGAAGA 


CGGGAAATCA 


GCTCGTACCA 


TTGAATTTAC 


9120 


AGATGAGGAA 


CAAAAGGTTG 


TCAAACGTCT 


TTTCCTTTTG 


ACGACTAAAC 


CAGTTCTTTA 


9180 


TGTAGCTAAT 


CTGGACGAGG 


ATGTGGTTTC 


AGAACCTGAC 


TCTATCGACT 


ATGTCAAACA 


9240 


AATTCGTGAA 


TTTGCACCGA 


CAGAAAATGC 


TGAAGTACTC 


GTTATTTCTG 


CCCGTGCTGA 


9300 
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GGAAGAAATT 


TCTGAATTGA 


ATGATGAAGA 


TAAAAAAGAG 


TTTCTTGAAG 


CCATTGGTPT 


7 J DU 


GACAGAATCA 


GGTGTAGATA 


AGTTGACGCG 


TGCAGCTTAC 


CACTTGCTTG CATTGGGAAC 


9420 


TTACTTCACA 


CCTGGTGAAA 


AAGAAGTTCG 


CGCTTGCACT 


1 IVAAHLU i \t 


GTATGAAGGC 




TCCTCAAGCA 


GCTGGTATTA 


TCCACTCACA 


CTTTGAAAAA 




CTGCAGTAAC 


9540 


CATGTCATAT 


GAAGATCTAG 


TGAAATACGG 


ATCTGAAAAG 






9600 


CTTGCGTGAA 


GAAGGAAAAG 


AATATATCGT 


TCAAGATGGC 


C ft ^ ft & ^^/** 

GATATCATvU 


An 1 l\,UVn« 1 A 


9660 


TAATGTCTAA 


AAATTAATAA 


ATGGTGTCAA 


TTAGGTTGGA 


ft n n ft ft ft ipii^^ 
AAAAAATTWu 


AACCCTTTTG 


9720 


GCTTTTGAAA 


GGAAAAATAA 


ATGACCAAAT 


TACTTCTAGG 




L.wAl>Ovl»Ai A 


9780 


AATATTTTGA 


AACAAAACAC 


AATGTTGGTT 


TTATGTTGAT 


ft gn^* % ft MM & 

TGATCAACTA 


f^f* ft K O » ft It ^ 


9840 


AGAATGTCAC 


TTTTACACAC 


GATAAGATAT 


TTCAAGCTGA 


CCTAGCATCC 


TTTTTCCTAA 


9900 


ATGGAGAAAA 


AATTTATCTG 


GTTAAACCAA 


CGACCTTTAT 


GAATGAAAGT 


GGAAAAGCAG 


9960 


TTCATGCTTT 


ATTAACTTAC 


TATGGTTTGG 


ATATTGACGA 


TTTACTTATC 


ATTTACviATG 


10020 


ATCTTGACAT 


GGAACTTGGG 


AAAATTCGTT 


TAAGAGCAAA 


AGGCTCAGCA 


GGTGGTCATA 


10080 


ATGGTATCAA 


GTCTATTATT 


CAACATATAC 


GAACTCAGGT 


CTTT AACCCT 


O » lAAOA I 1 Li 


10140 


GAATTGGAAG 


ACCTAAAAAT 


GGTATCTCAG 


TTGTTCATCA 


TGTTTTGAGT 


ft ft>' "**ttn« i/™ ft /"* ft 


10200 


GGGATGATTA 


TATCGCTATT 


TTACAGTCTG 


TTGACAAAGT 


TGACGATTCT 


ft ft ft 

GTAAAHAUl 


10260 


ATTTACAAGA 


GAAAAATTTT 


GAGAAAACAA 


TGCAGAGGTA 


*t* ft ft & fl» & & 

TAACUOnTAA 




10320 


TATTAGATTT 


ATTCTCAGAA 


AATGATCAGA 


TTAAAAAATG 


W. ATLnAAn 1 




10380 


AGAAAAGACA 


ACTAATACTT 


GGTTTATCAA 


CATCTACTAA 


GGCTCTTOOA 




10440 


GTTTAGAAAA 


AGAACATAGC 


ATTCTGTTAT 


TGACGTCAAC 


TTATGGAGAA 


GCAGAACCAC 


10500 


TTGTTAGTGA 


TCTTATTTCT 


ATCTTGGGTG 


AGGAACTCGT 


CTATCCATTT 


TTGGTAGATC 


10560 


ATGCTCCTAT 


GGTGGAGTTT 


TTGATGTCTT 


CACAGGAAAA 


AATTATTTCA 


CGGGTTGAAG 


10620 


CCTTGCGTTT 


TTTGACTGAT 


TCATCTAAGA 


AAGCGATTTT 


AGTTTGTAAT 


ATCGCAGCAA 


10680 


GTCGATTGAT 


TTTACCGTCT 


CCCAATGCAT 


TCAAAGATAG 


TATTGTAAAA 


ATCTCAGTTC 


10740 


GTGAAGAATA 


TGATCAACAC 


GCGTTTATCC 


ATCAGTTAAA 


GGAAAATGGC 


TATCGAAAAG 


10800 


TTACTCAAGT 


ACAAACTCAG 


GGCGAATTTA 


CTCTTCGAGG 


AGATATTTTA 


GATATTTTTG 


10860 


AAATATCCCA 


GTTAGAACCT 


TGTCGAATTG 


AGTTTTTTGG 


TGATGAAATT 


GATGGTATCA 


10920 


GGTCATTTGA 


AGTAGAAACA 


CAATTATCGA 


AAGAAAATAA 


GACAGAACTC 


ACT AT CTTT C 


10980 


CAGCTAGTGA 


TATGCTTTTG 


AGAGAAAAGG 


ATTATCAACG 


AGGACAGTCA 


G CTTT AG AAA 


11040 
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AACAAATTTC 


AAAAACTTTA 


TCACCTATTT 


TGAAATCATA 


CCTAGAAGAA 


ATTCTTTCAA 


11100 


GTTTTCACCA 


AAAACAAAGT 


CATGCAGACT 


CTCGGAAGTT 


TTTATCTTTG 


TCCTATGATA 


11160 


AGACATGGAC 


TGTCTTTGAT 


TATATTCAAA 


AAGATACTCC 


AATATTCTTT 


GATGATTATC 


11220 


AAAAATTGAT 


GAATCAGTAT 


GAACTCTTTG 


AAAGAGACTT 


AGCGCAGTAC 


TTTACAGAAC 


11280 


AATTACAGAA 


TAGTAAAGCA 


TTTTCTGATA 


TGCACTATTT 


TTCTGATATT 


GAACAAATCT 


11340 


ATAAAAAACA 


AACTCCAGTG 


ACCTTTTTCT 


CTAATCTTCA 


AAAGGCTTTA 


GGAAATCTCA 


11400 


AATTTGACAA 


AATTTATCAA 


TTCAATCAAT 


ATCCTATGCA 


GG AATTTTT C 


AATCACTTTT 


11460 


CTTTTCTAAA 


AGAAGAAATT 


CAACGATATA 


AAAAAATCGA 


TTACACCATT 


ATTCTGCAGT 


11S20 


CTAGCAATTC 


AATGGGAAGT 


AAAACATTGG 


AGGATATGTT 


AGAGGAATAT 


CAGATTAAAT 


11580 


TCGATTCTAG 


AGATAAGACA 


AATATCTGTA 


AAGAATCTGT 


AAACTTAATA 


GAGGGTAATC 


11640 


TCAGACATGG 


TTTTCATTTT 


GTAGATGAAA 


AGATTTTATT 


GATAACTCAA 


CATGAGATTT 


11700 


TTCAAAAGAA 


ATTAAAGCGT 


CGTTTTCGAA 


CACAACATCT 


TTCAAATGCA 


GAGAGATTAA 


11760 


AAGATTACAA 


TGAACTTGAA 


AAAGGGGACT 


ATGTTGTCCA 


TCATATCCAT 


GGGATTCGTC 


11820 


AATATCTAGG 


AATTGAAACC 


ATTGAAATCA 


AGGGAATTCA 


TCGCGATTAT 


CTCACTGTCC 


11880 


AATACCAAAA 


TGGTGATCAA 


ATTTCTATCC 


CCGTGGAACA 


GATTCATCTA 


CTGTCCAAAT 


11940 


ATATTTCAAG 


TGATCGTAAA 


CCTCCAAAAC 


TCAATAAATT 


AAATGACGGT 


CATTTTAAAA 


12000 


AGGCCAAGCA 


AAACGTTAAG 


AACCAGGTAG 


AGGATATAGC 


TGATGATTTA 


ATCAAACTCT 


12060 


ACTCTGAACG 


TAGTCAGTTG 


AAGGGTTTTG 


CTTTCTCAGC 


TGATGATGAT 


GATCAAGATG 


12120 


CCTTTGATGA 


TGCTTTCCCT 


TATGTTGAAA 


CGGATGATCA 


ACTTCGTAGT 


ATTGAGGAAA 


12180 


TCAAGAGCGA 


TATGCAGGGT 


TCTCACCCAA 


TGGATCGACT 


TTTACTTGGG 


GATGTTGGTT 


12240 


TTGGAAAGAC 


TGAAGTTCCT 


ATGCGTGCAG 


CCTTTAAAGC 


AGTCAATGAT 


CACAAACAGG 


12300 


TTGTCATTCT 


AGTTCCCACG 


ACGGTTTTAG 


CGCAACAGCA 


CTATACGAAT 


TTTAAGGAAC 


12360 


GATTCCAAAA 


TTTTGCAGTT 


AATATTGATG 


TGTTGAGTCG 


CTTTAGAAGT 


AAAAAAGAGC 


12420 


AGACTGCAAC 


ACTTGAAAAA 


TTGAAAAACG 


CTCAAGTCGA 


TATTTTGATT 


GGAACACATC 


12480 


GTGTTTTGTC 


AAAAGATGTT 


GTGTTTGCTG 


ATTTGGGCTT 


GATGATTATT 


GATGAGGAAC 


12S40 


AGCGATTTGG 


TGTCAACCAT 


AAGGAAACTT 


TGAAAGAACT 


GAAGAAACAA 


GTCGATGTCC 


12600 


TAACCTTGAC 


CGCTACGCCA 


ATCCCTCGTA CCCTCCATAT 


CTCTATGCTG 


GCAATCAGAG 


12660 


ATTTATCTGT 


TATTGAAACT 


CCGCCGACTA 


ATCGCTATCC 


TGTTCAGACC 


TATGTTTTGG 


12720 


AAAAGAATGA 


TAGTCTCATT 


CGTGATGCTG 


TCTTGCGTGA 


AATGCAGCGT 


GGAGGTCAAG 


12780 


TTTATTATCT 


TTACAACAAA 


GTTGACACAA 


TTGTTCAGAA 


GGTTTCAGAA 


TTACAGGAGT 


12840 
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TGATTCCGGA 


GGCTTCGATT 


GGATATGTTC 


ATGGTCGAAT 


GAGTGAAGTC 


CAGTTGGAAA 


12900 


ATACTCTATT 


AGACTTTATT 


GAGGGACAAT 


ACGATATCTT 


G GTGAC G ACT 


ACTATTATTC 


12960 


AGACAGGGGT 


GGACATTCCA 


AATGCTAATA 


CTTTATTTAT 


TGAAAATGCG 


GACCATATGG 


13020 


GCTTGTCAAC 


CTTATATCAG 


TTAAGAGGAA 


GAGTCGGTCG 


TAGTAATCGT 


ATTGCTTATG 


13080 


CTTATCTCAT 


GTATCGTCCA 


GAAAAATCAA 


TCAGTGAAGT 


CTCTGAAAAG 


AGATTAGAAG 


13140 


CGATTAAAGG 


ATTTACAGAA 


TTGGGCTCTG 


GCTTTAAGAT 


TGCAATGCGA 


GATCTTTCGA 


13200 


TTCGTGGAGC 


AGGAAATCTT 


TTAGGAAAAT 


CCCAGTCTGG 


TTTCATTGAT 


TCTCTTGGTT 


13260 


TTGAATTCTA 


TTCGCACTTA 


TTAGAGGAAG 


CTATTGCTAA 


ACGAAACGGT 


AATGCTAACG 


13320 


CTAACACAAG 


AACCAAAGGG 


AATGCTGAGT 


TGATTTTGCA 


AATTGATGCC 


TA'l'CTTCCTG 


13380 


ATACTTATAT 


TTCTGATCAA 


CGACATAAGA 


TTGAAATTTA 


CAAGAAAATT 


CGTCAAATTG 


13440 


ACAACCCTGT 


CAATTATGAA 


GAGTTACAAG 


AGGAGTTGAT 


AGACCCTTTT 


GGAGAATACC 


13S00 


CACATGTAGT 


AGCCTATCTG 


TTAGAGATTG 


CTTTGGTCAA 


ATCATACTTG 


GACAAGGTCT 


13560 


TTGTTCAACG 


TGTGGAAAGA 


AAACATAATA 


AAATTACAAT 


TCAATTTGAA 


AAAGTCACTC 


13620 


AACCACTCTT- 


TTTAGCTCAA 


GATTATTTTA 


AACCTTTATC 


CGTAACGAAC 


TTAAAAGCAG 


13660 


GCATCGCTGA 


GAATAAGGCA 


TTAATGGAGC 


TTGTATTTGA 


TGTCCAAAAT 


AAGAAAGATT 


13740 


ATGAAATTTT 


AGAAGGTTTG 


CTGATTTTTG 


GAGAAACTTT 


ATTAGAGATA 


AAAGAGTCTA 


13800 


AGGAAGAAAA 


TTCCATTTGA 


TATTTTTCTT 


CTATAAAATA 


GATAAAAATG 


GTACAATAAT 


13860 


AAATTGAGGT 


AATAAGGATG 


AGATTAGATA 


AATATTTAAA 


ACTATCGCCA 


ATTATCAAGC 


13920 


GTCGTACAGT 


CGCAAAGGAA 


GTAGCACATA 


AAGGTAGAAT 


CAAGGTTAAT 


CGAATCTTGG 


13980 


CCAAAAGTTC 


AACGGACTTG 


AAAGTTAATG 


ACCAAGTTGA 


AATTCCCTTT 


GGCAATAAGT 


14040 


TGCTGCTTGT 


AAAAGTACTA 


GAGATGAAAG 


ATAGTACAAA 


AAAAGAAGAT 


GCAGCAGGAA 


14100 


TGTATGAAAT 


TATCAGTGAA 


ACACGGCTAG 


AAGAAAATGT 


CTAAAAATAT 


TGTACAATTG 


14160 


AATAATTCTT 


TTATTCAAAA 


TGAATACCAA 


CGTCGTCGCT 


ACCTGATGAA 


AGAACGACAA 


14220 


AAACGGAATC 


GTTTTATGGG 


AGGGGTATTG 


ATTTTGATTA 


TGCTATTATT 


TATCTTGCCA 


14280 


ACTTTTAATT 


TACCGCAGAG 


TTATCAGCAA 


TTACTCCAAA 


GACGTCAGCA 


ATTAGCAGAC 


14340 


TTGC AAACTC 


AGTATCAAAC 


TTTGAGTGAT 


CAAAAGCATA 


ACGAGACAGC 


ATTTGCTACC 


14400 


AAGTTGAAAG 


ATGAAGATTA 


TGCTGCTAAA 


TATACACGAG 


CGAAGTACTA 


TTATTCTAAG 


14460 


TCGAGGGAAA 


AAGTTTATAC 


CATTCCTGAC 


TTGCTTCAAA 


GCTGATAAAA 


TGGAAAATTT 


14520 


ATTAGACCTA 


ATAGAGCAAT 


TTTTGAGTTT 


GTCAGATGAA 


AAGCTCGAAG 


AATTGCCTGA 


14580 
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TAAAAATCAA TTATTGCGTT TACAAGAAGA AAAGGAAAGG AAGAATGCGT AAATTCTTAA 
TTATTTTGTT GCTACCAAGT TTTTTGACCA TTTCAAAAGT CCTTACCACA GAAAAAGAAG 
TCGTCTATAC TTCGAAAGAA ATTTATTACC TTTCACAATC TGACTTTGGT ATTTATTTTA 
GAGAAAAATT AAGTTCTCCC ATGGTTT ATG GAGAGGTTCC TGTTTATGCC AATGAAGATT 
TAGTAGTGGA ATCTGGGAAA TTGACTCCCA AAACAAGTTT TCAAATAACC GACTGGCGCT 
TAAATAAACA AGCAATTCCA GTATTTAAGC TATCAAATCA TCAATTTATA GCTGCGGACA 
AACGATTTTT ATATGATCAA TCAGAGGTAA CTCCAACAAT AAAAAAAGTA TGGTTAGAAT 
CTGA C'T TTAA ACTGTACAAT AGTCCTTATG ATTTAAAAGA AGTGAAATCA TCCTTATCAG 
CTTATTCGCA ACTATCAATC GACAAGACCA TGTTTGTAGA AGGAAGAGAA TTTCTACATA 
TTGATCAGCC TGGATGGGTA GCTAAAGAAT CAACTTCTGA AGAAGATAAT CGGATGAGTA 
AAGTTCAAGA AATGTTATCT GAAAAATATC AGAAAGATTC TTTCTCTATT TATCTTAAGC 
AACTGACTAC TGGAAAACAA GCTGGT AT C A ATCAAGATGA AAAGATGTAT GCACCCAGCG 
TTTTGAAACT CTCTTATCTC TATTATACGC AAGAAAAAAT AAATGAGGCT CTTTATCAGT 
TAGATACGAC TGTAAAATAC GTATCTGCAG TCAATGATTT TCCAGCTTCT TATAAACCAG 
ACGGAAGTGG TAGTCTTCCT AAAAAAGAAG ATAATAAAGA ATATTCTTTA AAGGATTTAA 
TTACGAAAGT ATCAAAAGAA TCTGATAATG TAGCTCATAA TCTATTCGGA T ATT AC ATTT 
CAAACCAATC TGATGCCACA TTCAAATCCA ACATCTCTGC CATTATGGCA GATGATTGGG 
ATCCAAAAGA AAAATTGATT TCTTCTAAGA TGGCCGGGAA GTTTATCGAA GCT ATTT ATA 
ATCAAAATCG ATTTGTGCTA GAGTCTTTCA CTAAAACAGA TTTTGATAGT CAGCCAATTC 
CCAAACGTGT TTCTCTTAAA GTACCTCATA AAATTGGAGA TGCGCATGAA TTTAAGCATG 
ATACGGGTGT TGTCTATGCA GATTCTCCAT TTATTCTTTC TATTTTCACT AACAATTCTG 
ATTATGATAC GATTTCTAAG ATAGCCAAGC AT G TTTATGA GGTTCTAAAA TGAGGGAACC 
AGATTTTTTA AATCATTTTC TCAAGAAGGG ATATTTCAAA AAGCATGCTA AGCCGGTTCT 
ACCTCTTTCT GGTGGATTAG ATTCCATGTT TCTATTTAAG GTATTGTCTA CTTATCAAAA 
AGAGTTAGAG ATTGAATTGA TTCTAGCTCA TGTGAATCAT AAGCAGAGAA TTCAATCAGA 
TTGGGAAGAA AAGGAATTAA GGAAGTTGGC TGCTGAAGCA GAGCTTCCTA TTTATATCAG 
CAATTTTTCA GGAGAATTTT CAGAAGCGCC TGCACGAAAT TTTCGTTATC ATTTTTTTCA 
AGAGGTCATG AAAAAGACAG GTGCG AC AGC TTTAGTCACT GCCCACCATG CTGATGATCA 
GGTGGAAACG ATTTTTATGC GCTTGATTCG AGGAACTCGC TTGCGCTATC TATCAGGAAT 
TAAGGAGAAG CAACTACTCG GAGAGATAGA AATCATTCGT CCCTTCTTGC ATTTTCAGAA 
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AAAAGACTTT 


CCATCAATTT 


TTCACTTTGA 


AGATACATCA 


AATCAGGAGA 


ATCATTATTT 


1 C A A A 

1D440 


TCCAAATCCT 


ATTCGAAATT 


CTTACTTACC 


AGAATTGGAA 


AAAdAAAATC 


CTCGATTTAG 


I03vv 


GGATGCAATC 


TTAGGCATTG 


GCAATGAAAT 


TTTAGATTAT 


GAl i iGGCAA 


TAGCTGAATT 




ATCTAACAAT 


ATTAATGTGG 


AAGATTTACA 


GCAGTTATTT 


TCTTACTCTG 


AGTCTACACA 


16620 


AAGAGTTTTA 


CTTCAAACTT 


ATCTGAATCG 


TTTTCCAGAT 


TTGAATCTTA 


CAAAAGCTCA 


1 6680 


GTTTCCTGAA 


GTTCAGCAGA 


TTTTAAAATC 


TAAAAGCCAG 


TATCGTCATC 


CGATTAAAAA 


16740 


TCGCTATGAA 


TTGATAAAAG 


AGTACCAACA 


GTTTCAGATT 


TGTAAAATCA 


GTCCGCAGgC 


16800 


TGATGAAAAG 


GAAGATGAAC 


TTGTGTTACA 


CTATCAAAAT 


CAGGTAGCTT 


ATCAAGGATA 


16860 


TTTATTTTCT 


TTTGGACTTC 


CATTAGAAGG 


TGAATTAATT 


CAACAAATAC 


CTGTTTCACG 


16920 


TGAAACATCC 


ATACACATTC 


GTCATCGAAA 


AACAGGAGAT 


GTTTTGATTA 


AAAATGGGCA 


16980 


TAGAAAAAAA 


CTCAGACGTT 


TATTTATTGA 


TTTGAAAATC 


CCTATGGAAA 


ACAGAAACTC 


17040 


TGCTCTTATT 


ATTGAGCAAT 


TTGGTGAAAT 


TGTCTCAATT 


TTGGGAATTG 


CCACCAATAA 


17100 


TTTGAGTAAA 


AAAACGAAAA 


ATGATATAAT 


GAACACTGTA 


CTTTATATAG 


AAAAAATAGA 


17160 


TAGGTAAAAA 


ATGTTAGAAA 


ACGATATTAA 


AAAAGT CCT C 


GTTTCACACG 


ATGAAATTAC 


17220 


AGAAGCAGCT 


AAAAAACTAC 


GTGCTCAATT 


AACTAAAGAC 


TATGCAGGAA 


AAAATCCAAT 


17280 


CTTAGTTGGG 


ATTTTAAAAG 


GATCTATTCC 


TTTTATGGCT 


GAATTGGTCA 


AACATATTGA 


17340 


TACACATATT 


GAAATGGACT 


TCATGATGCT 


TTCTAGCTAC 


CATGGTGGAA 


CAGCAAGTAG 


17400 


TGGTGTTATC 


AATATTAAAC 


AAGATGTGAC 


TCAAGATATC 


AAAGGAAGAC 


ATGTTCTATT 


17460 


TGTAGAAGAT 


ATCATTGATA 


CAGGTCAAAC 


TTTGAAGAAT 


TTGCCAGATA 


TGTTTAAAGA 


17S20 


AAGAGAAGCA 


GCITCTGTTA 


AAATTGCAAC 


CTTGTTCGAT 


AAACCAGAAG 


GACGTGTTGT 


17S80 


AGAAATTGAG 


GCAGACTATA 


CTTGCTTTAC 


TATCCCAAAT 


GAGTTTGTAG 


TAGGTTATGG 


17640 


TTTAGACTAC 


AAAGAAAATT 


ATCGTAATCT 


TCCTTATATT 


GGAGTATTGA 


AAGAGGAACT 


17700 


GTATTCAAAT 


TAGAAAGAAT 


AATCTTTAAT 


GAAAAAACAA 


AATAATGGTT 


TAATTAAAAA 


17760 


TCCTTTTCTA 


TGGTTATTAT 


TTATCTTTTT 


CCTTGTGACA 


GGATTCCAGT 


ATTTCTATTC 


17820 


TGGGAATAAC 


TCACGAGGAA CTCAGCAAAT CAACTATACT GACTTGGTAC 


AAGAAATTAC 


17880 


CGATGGTAAT 


GTAAAAGAAT 


TAACTTACCA 


ACCAAATGGT 


AGTGTTATCG 


AAGTTTCTGG 


17940 


TGTCTATAAA 


AATCCTAAAA 


CAAGTAAAGA 


AGAAACAGGT 


ATTCAGTTTT 


TCACGCCATC 


18000 


TGTTACTAAG 


GTAGAGAAAT 


TTACCAGCAC 


TATTCTTCCT 


GCAGATACTA 


CCGTATCAGA 


18060 


ATTGCAAAAA 


CTTGCTACTG 


ACCATAAAGC 


AGAAGTAACT 


GTTAAGCATG 


AAAGTTCAAG 


18120 
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TCCTATATGG 


ATTAATCTAC 


TCCTATCCAT 


TGTGCCATTT 


GGAAWCTAT 


TCTTCTTCCT 


18180 


ATTCTCTATC 


ATGGGAAATA 


TGGGAGGAGG 


CAATGCCCGT 


AATCCAATGA 


GTTTTGGACG 


18240 


TAGTAAGGCT 


AAAGCAGCAA 


ATAAAGAAGA 


TATTAAAGTA 


AGATTTTCAG 


ATGTTGCTGG 


18300 


AGCTGAGGAA 


GAAAAACAAG 


AACTAGTTGA 


AGTTGTTGAG 


TTCTTAAAAG 


ATCCAAAACG 


18360 


ATTCACAAAA 


CTTGGAGCCC 


GTATTCCAGC 


AGGTGTTCTT 


TTGGAGGGAC 


CTCCGGGGAC 


18420 


AGGTAAAACT 


TTGCTTGCTA 


AGGCAGTCGC 


TGGAGAAGCA 


GGTGTTCCAT 


TCTTTAGTAT 


18480 


CTCAGGTTCT 


GACTTTGTAG 


AAATGTTTGT 


CGGAGTTGGA 


GCTAGTCGTC 


TTCGCTCTCT 


18S40 


TTTTGAGGAT 


GCCAAAAAAG 


CAGCACCAGC 


TATCATCTTT 


ATCGATGAAA 


TTGATGCTGT 


18600 


TGGACGTCAA 


CGTGGAGTCG 


GTCTCGGCGG 


AGGTAATGAC 


GAACGTGAAC 


AAACCTTGAA 


18660 


CCAACTTTTG 


ATTGAGATCG 


ATGGTTTTGA 


GGGAAATGAA 


GGGATTATCG 


TCATCGCTCC 


18720 


GACAAACCCT 


TCAGATGTAC 


TTGACCCTGC 


CCTTTTGCGT 


CCAGGACCTT 


TTGATAGAAA 


18780 


AGTATTGCTT 


CGTCGTCCTG 


ATGTTAAAGG 


TCCTGAAGCA 


ATCTTGAAAG 


TTCACGCTAA 


18840 


CAATAACCCT 


TTAGC AG AAG 


ATGTTGATTT 


GAAATTAGTG 


GCTCAACAAA 


CTCCAGGCTT 


18900 


TGTTGGTGCT 


GATTTAGAGA 


ATGTCTTGAA 


TGAAGCAGCT 


TTAGTTGCTG 


CTCCTCGCAA 


18960 


TAAATCGATA 


ATTGATCCTT 


CACATATTGA 


TGAAGCAGAA 


GAT AG ACTT A 


TTCCTGGACC 


19020 


TTCTAAGAAA 


GATAACACAG 


TTTCACAAAA 


AGAACGACAA 


TTGCTTGCTT 


ACCATGAGGC 


19080 


AGGACATACC 


ATTGTTGGTC 


TACTCTTGTC 


GAATCCTCCC 


GTTGTCCATA 


AGGTTACAAT 


19140 


TGTACCACGC 


GGCCGTGCAG 


GCCGATACAT 


GATTGCACTT 


CCTAAAGAGG 


ATCAAATGCT 


19200 


TCTATCTAAA 


GAAGATATGA 


AAGAGCAATT 


GGCTGCCTTA 


ATGGCTCGAC 


GTGTACCTGA 


19260 


ACAAATTATC 


TTTAATCTCC 


AAACCACAGG 


AGCTTCAAAC 


CACTTTCAAC 


AAGCGACACA 


19320 


AATGGCACCT 


GCAATGCTTA 


CAGAGTACGG 


TATGACTGAA 


AAACTTGCCC 


CAGTACAATA 


193BO 


TGAAGGAAAC 


CATGCTATGC 


TTGGTGCACA 


GACTCCTCAA 


AAATCAATTT 


CAGAACAAAC 


19440 


AGCTTATCAA 


ATTGATGAAG 


AGGTTCCTTC 


ATTATTAAAT 


GAGCCACGAA 


ATAAAGCTGC 


19500 


TGAAATTATT 


CAGTCAAATC 


GTGAAACTCA 


CAACTTAATT 


GCAGAAGCAT 


TATTGAAATA 


19S60 


CGAAACATTG 


GATAGTACAC 


AAATTAAAGC 


TCTTTACGAA 


ACAGGAAAGA 


TCCCTGAAGC 


19620 


AGTAGAAGAG 


GAATCTCATG 


CACTATCCTA 


TGATGAAGTA 


AAGTCAAAAA 


TGAATGACGA 


19680 


AAAATAACCC 


TGAGAGAGGC 


TGGAGCCTCT 


CTTTTTTGTC 


CAGTTTACGA 


GCTAAAGGGA 


19740 


ACAGAATGGA 


GAAAATGGAA 


CAAATGTGTT 


TTCTAATCTG 


TTAGACTGTA 


TCTAGAAAGG 


19800 


GGAAAATTAT 


GATTAAAGAA 


TTGTATGAAG 


AACTCCAAGG 


GACTGTGTAT 


AAGTGTAGAA 


19860 


ATGAATATTA 


CCTTCATTTA 


TGGGAATTGT 


CGGATTGGGA 


GCAAGAAGGC 


ATGCTCTGCT 


19920 
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TACATGAATT GATTACTAGA GAAGAAGGAC TGGTAGACGA TATTCCACCT TTAAGGAAAT 19980 

ATTTCAACAC CAAGTTTCGA AATCGAATTT TAGACTATAT CCCTAAACAG GAAAGTCAGA 20040 

AGCGTAGATA CGATAAAGAA CCCTATGAAG AACTGGGTGA GATCACTCAT CGTATAAGTG 20100 

AGGGGGGTCT CTGGCTAGAT GATTATTATC TCTTTCATGA AACACTAAGA GATTATAGAA 20160 

ACAAACAAAG TAAAGAGAAA CAAGAAGAAC TAGAACGCCT CTTAAGCAAT CAACCATTTC' 20220 

GAGGGCGTCA AAGACTATTA AGAGACTTAC GCATTCTCTT TAAGGAGTTT ACTATCCGTA 20280 

CCCACTACTA AGTCATGCAA AAAAAATGAA AAAAATTAGA AAAAGTAGTT GACAAAGTTT 20340 

GAAAAGGCTG TATAATAGTA AGAGTTGAAA ATAACAACTC AGGTCCGTTG GTCAAGGGGT 20400 

TAAGACACCG CCTTTTCACG GCGGTAACAC GGGTTCGAAT CCCGTACCGA CTATGGTATG 204 60 

TTGCGTCAGG ACCACTTGAT GAAAAAAAGT TTAAAAAAAC TTAAAAATCT TCAAAAAAGT 20520 

CTTGACAACC GAAACCAGTT CTGATATACT AATATAGTTG TCCCTTGAGA GAAGCAAGTG 20580 

ACAAAGACCT TTGAAAACTG AACAAGACGA ACCAATGTGC AGGGCGCTAC AACGTAAGTT 20640 

GTAGTACTGA ACAATCAAAA AAACAATAAA TCTCTCACTG ACAGAAATCA GTAAGAACTC 20700 

AAACTTTTTA ATCAGAGTTT GATCCTGGCT CAGGACGAAC GCTCGCGGCG TCCCTAATAC 20760 

ATGCAAGTAG AACGCTGAAG CAGCAGCTTG CTTCTCTCGA TGAGTTGCGA ACGGCTGACT 20820 

AACGCGTAGG TAACCTGCCT GGTAGCGGGG CATAACTATT GGAAACGATA GCTAATACCG 20680 

CATAAGACTA GATGTTGCAT G AC ATTTG CT TAAAACCTGC ACTTCCATCA CTACCACATG 20940 

GACCTGCGTT GTATTAGCTA GTTGGTGGGG TAACCGCTCA CCAAGCCGAC GATACATAGC 21000 

CCACCTGAGA GGGTGATCGG CCACACTGCG ACTGAGACAC GGCCCAGACT CCTACGGGAG 21060 

GCAGCAGTAG GGAATCTTCG GCAATGGACG GAAGTCTGAC CGAGCAACGC CGCGTGAGTG 21120 

AAGAACCTTT TCGGATCGTA AAGCTCTCTT GTAACACAAG AACCACTGTG AGAGTGGAAA 21180 

GTTCACACTG TGACGGTATC TTACCAGAAA GCGACGGCTA ACTACGTGCC ACCAGCCCCG 21240 

GTAATACGTA GGTCCCGAGC GTTCTCCGGA TTTATTGGGC GTAAAGCGAG CCCAGGCGGT 21300 

TAGATAAGTC TGAAGTTAAA GGCTGTGGCT TAACCATA 21338 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6273 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EONESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

T C TTTTTAAA GACCCGTGTC TGCATAGACT TTCGGACGCA ACGCTCTATT AGATAATGAA 60 

CTGCCTATAC ACAAGATTTC TAACCTTAGT CGACATGAGC TGAAACCTCT TATTTGTTAA 120 

GTAGTTCACA AAATATTATA CACCTATTTT ATGAATAGTC AACTGTCTTT ACAGTAAAAT 180 

TTTAGAAAAT CATGAAAATT TTCTCTTTCT TTCCATTTTA AGTGACATTC AGTCATTCTC 240 

ACATCAAAAA AGCCCAGACG AAATTGTCTG AGCATTCTTT TATCTAGTCG TTTAAGGAAG 300 

TTGAGTTCAC TATGTTTAAA GTCTCTGTCC CATCATTTCT TCAACAAACC TTGTTCTTGG 360 

AGAAACTCCT TGGCTACTTC CTTTGCTGAC TTGCCTTCAA CACCGACTTC GTAGTTGAGC 420 

TGGCTCATCT GGCTTTCTCT AATCTTACCA GCCAATGTAT TAACAACTCT TTCCAACTCT 4 80 

GGGTCTTTCT TGAGAAGAGC TTCTTTCATG ACTGGAGCCC CTTGATAAGG TGGGAAGAGT 540 

TCCTT G TCAT CTTCCAAGAC CTGTAAATCA TAACGCTCCA ATTCCGCATC AGTCGAATAG 600 

GCATCCGTGA TTTGAATATC CCCTGACTCA ATAGCCTGAT ACCCAAGGCC TGGCTCAATG 660 

GTCGCTACAT TGAGATTGAG ACCATACATT GATTGCAAGC CCTTATTTCC ATCTTCACGG 720 

TCCTTAAACT CGAGTGTAAA ACCTGCCTTC AACTGCCCTT CCA C TTTTTT CAACTCTGAA 780 

ATCGTCTTCA AGCCATATTC TTGAGCAATC TTTTTCGGAA CAGCTACAGC ATAG G TGTTT B4 0 

TGATAAGACA TGGGTTTGAG ATAGGCTACA TGATCCTGCT TAGCAATGCC ATCACGCGCC 900 

ACCTGATAAA CCTGTTCTGC TTCATGACTC ACCTTGGGTG ATGCTTGAAG CAAACTTTCA 960 

GTCACCGTAC CAGTAAATTC AGGATAGATG TCAATATCGC CTTTTTTCAG AGCTTCATAA 1020 

AGGAAGCTTG TCTTCCCAAA ATTCG G TTTA ACAGTCCCAG TCATGCTGCT ATTTTCTTCA 1080 

ATCAGCAACT TATACATATT GGCCAAAATT TCTGCTTCTG CACCTA TT TT CCCAGCAATA 1140 

ACCAA G TTTT CCTTCTCTTT TTGAACCAAA AGAGCTGGAC TATAAGACAG ACCCAGTAAT 1200 

AAAGCCACCA AGGCAAAACC TGAGAAAATC CTCCCTAATT TTGCTTTTTC CATCACTTTT 1260 

AGTAGGAAGT TAAAGGCAAT CGCTAGCACT CCAGAACAAA GTCCCCCAAT CAAAATCAAA 1320 

CTGGCATTAT TACGGTCAAT TCCCAAAAGA ATAAAGGAAC CTAGTCCCCC TGCACCAATC 1380 

AACGCCGCCA AGGTTGCCGT ACCGATAATC AAAACAGCTG CCGTCCGAAT CCCAGACATG 1440 

ATAACAGGCA TGGCGAGTGG AATTTCAAAT TTCTTGAGAC CTTCCCATCT GGTCATCCCA 1500 

AAGGCAATCC CAGCCTCTTG CAGGTTCGGA TCAATTCCCT TCACCCCAGT GATAGTATTT 1560 

TGCAAAATAG GGAAAATCGC ATAAATCACT AGAGCTGTCA AACCCGGCAA GGTCCCAATT 1620 

CCCATCAAAG GGATAAAGAC CCCCAACAAG GCCAGAGACG GGATGGTCTG GAAAATACCT 1680 

GCAATCTGCA AGACCCAGTC GGCCAGCTTC TCATGATACC GAAGAAAAAC AGCCAAGGGA 1740 
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ATCGCAAGCA 


AAATAGCTAG 


TAACAAGGTC 


AAAAGCGACA 


ACTGCAAATG 


TTGAGATAGA 


1800 




AATCACTAAA 


ACGATCCTGA AAAGTTGCAA TTAAATTAGT 


CATGAACACT 


1860 


ACCTCCAAAC 


AAGTCTGCTA 


CAAAGTCTGT 


TGCAGGCGCT 


TTTAAAATTC 


TCTCGGGATT 


1920 




CCAATTTCTC 


CATCCTGCAA GACAGCAATA 


CGGTCCGCCA 


ACTTCAACGC 


1980 




TCATGCGTTA 


CAAAAATCGT 


TCTCATCCCA 


AACTCTTTAT 


GCAATTCTTT 


2040 


Tfyrr&ra Af*r* 


Tttf AACTGTT 


TTCTCGAAAT 


AGCATCCAAG 


GCCGAAAAGC 


CTTCATCCAT 


2100 


OnugAAnn 1 


1 1\JUUV« I WAV 


CAATCATAGC 


TCGGACAATA 


CCGACCCGTT 


GCTGTTCTCC 


2160 


ACCAGATAAT 


TWAU 1 Aw 1 A 


AGCGATGCCC 


ATACTCGGCT 


ACTGGTAAAC 


CAACCTTAGC 


2220 


CAAAAGCTCT 


TCTGTTTTCT 


TCGTAATTTC 


TTCCTTGCTC 


CACCCCTTCA 


TTTCAGGAAT 


2280 


GAGAG CAATA 


11 HA- LuLAn 


CTCTTAGATT 


TGGAAAAAGA 


GCAATAGCCT 


GTAAAACATA 


2340 


ACCAGTAGAA 


AOALuAAul 1 


CACGCTCATC 


ATACTCTTTG 


ATCCGCTTCC 


CATCCATATA 


2400 


AATATTTCCA 


TCAGTTGGTT 


CCAAAAGACG 


CTTAATCATC 


TTGAGCATGG 


TCUTCTTACC 


2460 


TGACCCAGAa 


wWV.(.V» 1 AL 1 A 


AAACCATAAA 


TTCCCCATCC 


TCAATCTCTA 


ACTTGACATC 


2520 


TCTCAAGACA 


TCCTTTTCTG 


TGTAGCGCAG 


TGCTACATTT 


TTGTATTCAA 


TCATTCTTTG 


2580 


TC CTCAATTT 


AAAAL t 


TCGATTGGTC 


AAGTCTTCTA 


CCTTAGGCAT 


AACTTCCTTA 


2640 


TTATCCCAAT 




TTTCCCCTTC 


TCTAAACGGA 


AGATATCGTA 


CTGGGCATAA 


2700 


ClAAUjCWAT 


LAn wAu A 


CTGACCATAG 


CTAACCACAT 


AGTTTCCTTG 


TCCTAAGAGT 


2760 


TuUAnAnLnA 


AO l\.AAAAw 1 


(*:Af*ACTA' T *AT 


TCAGCCACAT 


AGTTTTTATA 


ACCAGCACTT 


2820 




T 1TC ATC ATT 


ATGCTGAATC 


AAATCCTCTC 


CCACATAATC 


ACTCCACTGC 


2880 


IV. l Av»s_ • — v. 






AAGAAACCC Z 


G.W.CCACCTT 


TTTAT7TTCT 


2940 


i 1 1 A 1 


CCAAATCCTT 


GATTTCAAAA 


TCTCCAAAAA 


TTTGATCTAG 


TTGGTCATTT 


3000 


TCAGGTGTTC 


GATAGTAGTC 


AATGACATCC 


CAATGCTCAA 


CAATACAACC 


ATTCTCATCC 


3060 


TCACCGAAAG 


TATCCGTCGT 


CACCCATTGA 


GCTTCTCCAC 


CATTCAGATA 


TTGATGAACA 


3120 


TGAACAAAGA 


CCAGATTGCC 


ATCCTCAATG 


GTGCGGACAA 


TCTTAATCTG 


ACGCTCTGGA 


3180 


TGACGCTCAA 


AGAAATCTGC 


AAAGAAGGCT 


GCAAATCCTT 


CTTTCCCGTC 


AGGAACACCT 


3240 


GTCGAATGTT 


GGATATAGGT 


ATCCCCTACA 


GACTGGGCTT 


GAGCCTCAGC 


AACTCGTCCG 


3300 


TCTTGAATGG 


CATGGATGTA 


TAGCTTGTGA 


GCATTTTTCA 


CTTGTTGTGA 


CATATTCTAA 


3360 


ACCTCATTTC 


CCTTCTCTTT 


CAGATTCGCC 


AAAATTCTTT 


CTTGAAAACC 


TTCAAATTGG 


3420 


TGAATTTCTT 


CCTCTGAAAA 


TCCTTTGTAA 


AAGATAGTAT 


CCAATTTCTG 


ACTGACACGA 


3480 
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TGCCCCACTT 


CTTTCTGGGA 


CTTGCCTAAC 


TCCGTTAAAA 


CTAAATACTT 


CTTACGCTTG 


3540 


TCTTTTCCAC 


ACGGACTAAC 


AATTACAAGC 


TTTTGTTCCT 


CTAGCTTTTT 


TATCATAGTC 


3600 


CTCAGCCTAT 


TATTCGCAAG 


TCCAGTCGCA 


AGCCCGATAT 


CTGTCGCAGT 


TGCGCAGCCA 


3660 


CTTTCACTAT 


TCCATAAAAC 


CGCTAAAATC 


TTGCCCTGTT 


CACCCCTATA 


AAGAGCCTCA 


3720 


GCATCTTGAC 


TCAGTAACTT 


TTGAAAAATC 


CCCCCATTCA 


ACAAACGAAT 


ATGATGGGCT 


3780 


ACCAAATGAC 


CATCTTTCAT 


AACACCTCCA 


ATTTATTTCG 


ATATCGAAAT 


GAATAAAACA 


3840 


ATTGTAACAC 


TCATCGTTCT 


AACTGTCAAC 


TATTTCGATT 


TAGAAATAAT 


TTTTGATAAT 


3900 


TATCCACACC 


ACCATACTCC 


GGCTCAACTA 


ACTTTTAACG 


AGAGTTTCTA 


AACTCCTTCG 


3960 


TCCTCCAGTC 


TACAAAAGCC 


TTCCATTCGT 


ACTATCCTAT 


ATTTTATGAG 


GGGACACATT 


4020 


TTTCCTATCA 


GACCATTTAT 


TTTAAAGATA 


GAAGTAAATC 


ATAATTGCTT 


CCATCTGTTC 


4080 


TTTTATAGTA 


TATTGAAGTT 


AGACTAGAGC 


ACTCTATCTT 


CTAAAACATT 


GATACAAAGC 


4140 


CATTTGAATT 


TCCCAATCAA 


TTTGTTCGTA 


TTTATAGCAT 


TTCGAAACTG 


GAATAGGACA 


4200 


CCATGACTGC 


TAAAAGATTT 


CTATAAATTC 


ATTTAATTTC 


CTCAATCAAT 


TTGTTCATAT 


4260 


CTTATTTCAT 


TCCGCTATAA 


TTTCACCTTA 


CCCTATCTTT 


TTCCTAGCAC 


CCTTCAAACA 


4320 


GCCTATCCCC 


TACCCTTTGA 


CGATTCCTCA 


CTTCGCTCCA 


CTTCCATTAC 


AGAAGTTTCT 


4380 


TCACTACTAT 


GGGCTCGGCT 


GACTTCTCAT 


GATTCCTTGT 


TACT ACT ATT 


TGAACGCTCA 


4440 


CGAGATAGAT 


CTTACAAAAA 


ATGCTTTGAT 


CCACAATCGA 


ATCAAAGCAT 


TTTAAAGAGT 


4500 


TCCTCATACA 


TAAGCGCAGA 


AGTCGCAGTT 


CCTCTGTACT 


TGGCTTCTTC 


TCTTTTGACA 


4560 


AAGCGAGCCA 


AGTTGAGCAA 


CTCAGGTCCT 


CGATGTTTGC 


CATTTACCAC 


CAATTCACGA 


4620 


TTCACCAGGC 


CTG AG AG ACG 


AACTGCCTGC 


AATTGCTCAT 


TTGTAGTAGG 


CAGTTTTTTA 


4680 


GTAGTCTCTA 


GGAGAGCAGC 


AACTAAATCT 


TCACTCAAAT 


CATCTCGAGC 


ATGATTGTAA 


4740 


AGATCTTTTA 


TAAGGCTTTC 


TAGGTTTGGT 


TCTACCATCC 


CTACCACCTC 


CCTTATGGTT 


4800 


TAATAATGTT 


TAATCAAATC 


AACCCTTGAA 


CCATCCAATT 


TCTTCACCAA 


GGCTTGTAAG 


4860 


AAAGCTTGCG 


CTTCTAGGAA 


CTCATCCATT 


GCATAGAGGG 


TTTGCTCAGA 


ATGGATATAA 


4920 


CGAGCGCAGA 


CACCGATAGT 


TGTTGATGGG 


ACACCACCAT 


TTTTCAGATG 


AGCTGCACCT 


4960 


GCATCTGTTC 


CGCCTTTACC 


ACAGTAGTAT 


TGCTACTTGA 


TACCAGCTTC 


TTCAGCCGTT 


5040 


GTCAAAAGGA 


AATCCTTCAT 


CCCTGGGAGA 


AGCAACTGAC 


CTCGATCATA 


GAAACCAATC 


5LO0 


AAGGTTCCAT 


CTCCAATCTT 


GCCTTGACCA 


CCGTAGACAT 


CACCTGCTGC 


TGAGCAATCA 


5160 


ACTGCGAGGA 


AGACTTCTGG 


GTCAAACTTG 


GTTGTAGAGG 


TATGAGCGCC 


ACGCAGACCA 


5220 


AC TTCTTCTT 


GGACGTTAGA 


ACCCAGATAG 


AGTTCATTGC 


CGAGTTTTTG 


ACCCGATAAA 


5280 
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GCTTCAGCTA GCTCCCTTAC CATGAGGACA CCCTAGCGGT TATCCCAAGC TTTTGAGATG 5340 

AT A TT TT T TT CATTGGCTGT CAAAATTGCA GAACTATCTG GTACAATGGT ATCACCAGGA 5400 

CGGATGCCAA AACTTTCTGC CTCAGCCTTG TCCGCAAAAC CACCATCAAA AACGATATCC 5460 

GCAATGGCTG GCATCGTTGG TCCCCCCTTT CCACGAGTCA AATGCGGAGG AACAGAACCT 5520 

GAAATCACAG GAATTTCATG ACCATCACGA GTCAAGAGTT TGAAACCTTG CCTGCTAACC 5580 

ACCATGGGGT TCCAGCCACC GATTTCTACG ACACGGAAGG TACCATCTGG CTTGATTTCC 5640 

CTGACCATAA AACCAACTTC GTCCATATGA GAACCGACCA AGACGCGCGC TGCATCCACA 5700 

GCTTCTGAAT GTTTGATACC AAAAATACCA CCCAAGCCAT CTGTCACCAC TTCATCCACA 57 60 

TGCGGTGTCA ACTTTTCACG AAGATAAGCA CGGACAGGCG CTTCATGACC TGAGACTGCA 5820 

GCAAGTTCTG TTACTTCTTT AATTTTTGAA AATAATGTTG TCATTTCAGT TCCTTCTTTC 5880 

TTTCATCCAT TTTACCACTT TTTATAGGAC AAGGATAGTG GGAAGGTGGA TTTCTAAGTT 5940 

AGTATCTTAC TCCTGCTCTA TCTTAGAAAA GGATAGTATT CTCTTGCATG TAGTGCAAAA 6000 

TCTAGTAAAC ATTCCAAAAT TAACTCGAAT ATTTATTTCC AAACAAAAAA ACAATACACC 6060 

ATCAAAGTTG TTTGGATTTT TCATGAAATT TACAGAAAAT AGTTGACTTC CCTTTCTTCT 6120 

TTCTTTAAAT ATATAGTTCG TTGAGTTTGC AATAGTACGC TGTAGCTGCT AAAACATTTC 6180 

TAGAAATTAA TTTGACTTTC CTAATAGACT TCTTCATATC TTATTTCAAT TTACTATAGT 6240 

ACAAAACTAG AAAAGGAAAA AATCATGACC AGG 6273 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28171 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ACAACCTTTT TCAAAAACTC ACCTTGGTAC GGAGATGTTT TGCTTTCTGC TATTATTTTC 60 

GCTTATATTC ATATCAATTT TGCTTTAACT CCTCTTGCTT TTTTCATTTA TCCTAGTGGA 120 

GGTCTTATTT TAGCTCTATT GTATCGCATG ACTAAAAATC TCTACTATCC AATACTAGTT 180 

CATATTCTCA TTAATATCAC TGCCTTCTGG GATGTGTGGT TGCTCCTATT TTCAGGAAGT 240 

TACCTTACTA AAATAATGTC GGAACTTTCC GGCATTTTCT TTTTTCACAA ATAGTCAACG 300 

TTTTTCTTTT CGATATTGTA CTGCTCTGTA TCCAGTTATT TTTTTGAATT GATTTTGAAA 360 



omefWift. ,\hir\ noioooi AO i «. 
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ATAAGGTTGA CTTGAGAAAG GCAGATAGTG AAGATAGTTA AGAACAATAG GATGTTCTTT 420 

TTTCCTTTTT GGAAAACTTC TAAAATATGG TATAATGAAA AGATAAAGAA GTTGGGGGTA 480 

GAAGATGAAC ATTCAACAAT TACGCTATGT TGTGGCTATT GCCAATAGTC GTACTTTTCG 540 

TGAAGCTGCT GAAAAGATGT ATGTTAGTCA GCCGACTCTG TCTATTTCTG TTCCTGATTT 600 

GGAAAAAGAG TTGGGCTTTA AGATTTTCCG TCGCACCAGC TCRGCGACTT TCTTGACCCG 660 

TCCTGGGATG GAATTTTATG AAAAATCGCA AGAATTGGTT AAAGGATTTG ATA T TTTTCA 720 

AAATCAGTAT GCCAATCCTG AAGAACAAAA AGATGAATTT TCT G TTCCTA GCCAGCACTA 780 

TGACTTCTTG CCACCAACTA TTACGGCCTT TTCAGAGCCC TATCCTGACT ATAAGAACTT 840 

CCCTATTTTT GAATCAACTA CTGTTCAAAT ATTAGATGAA GTGGCGCAAG GCCATAGTGA 900 

GATTGGGATT ATCTACCTCA ACAATCAAAA TAAAAAGGGG ATTATGCAAC GGGTTGAAAA 960 

ATTAGGTCTG GAGGTCATCG AATTGATTCC TTTCCATACC CATATTTATC TCCGTGAGGC 1020 

TCATCCTTTA GCCCAGAAAG AGGAATTAGT CATGGAGGAT TTAGCGGATT TACCAACGGT 1080 

TCGTTTCACT CAAGAGAAAG ACGACTACCT TTATTATTCA GAGAACTTTG TCGATACCAG 1140 

CGCTAGCTCA CAGATGTTTA ATGTGACAGA CCGTGCCACC TTGAATGGTA TTTT G GAGCG 1200 

GACGGACGCC TATGCGACAG GTTCTGCATT TTTAGATAGT GACACTGTTA ATGGCATTAC 12 60 

AGTTATTCGT CTCAAGGATA ACCTAGATAA CCGCATGGTC TATGTTAAAC GTCAAGAAGT 1320 

GGAGCTTACT CAAGCTGGGA CTCTCTTCGT AGAAGTCATG CAACAATATT TTGATCAAAA 1380 

GAGGAAATCA TGAAAAAAAG AGCAATAGTG GCAGTCATTC TACTGCTTTT GATTGGGCTG 1440 

CATCAGTTGG TCAAATCCTA TATCCTCCAG CAGATTCCAC TGGGTGAAGT GCGCTCCTGC 1500 

ATCCCCAATT TCGTTAGCTT GACCTACCTG CAAAATCGAG GTGCAGCCTT TTC T ATCTTA 1560 

CAAGATCAGC AGCTGTTATT CGCTGTCATT ACTCTCGTTG TCGTGATAGG TGCCATTT G G 1620 

TATTTACATA AACACATCGA GGACTCATTC TGGATGGTCT TGGCTTTGAC TCTAATAATC 1680 

GCGGGTGCTC TTGCAAACTT TATTCACAGG GTCAGTCACG GCTTT G TTGT GGATATCTTC 1740 

CACCTTGACT TTATCAACTT TGCAATTTTC AATGTGGC AG ATAGCTATCT GACGCTTGGA 1800 

GTCATTATTT TATTGATTGC AATGCTAAAA GAGGAAATAA ATGGAAATTA AAATTCAAAC 1860 

TGGTCGTCTG CGTTTGGATA AGG C TTTGTC AGATTTGTCA GAATTATCAC CTAGTCTCGC 1920 

GAATGAACAA ATTAAATCAG GCCAGGTCTT GGTCAATGGT CAAGTCAAGA AAGCTAAATA 1980 

CACACTCCAA GAGCGTGATG TCGTCACTTA CCATGTGCCA GAACCAGAGG TATTAGAGTA 2040 

TGTGGCTGAG GATCTTCCCC TAGAAATAGT CTACCAAGAT GAGGATCTGG CTGTCGTTAA 2100 

CAAACCTCAG GGAATGGTTG TGCACCCGAG TGCTGGTCAT ACCACTGGAA CCCTAGTAAA 2160 
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TGCCCTCATG 


f* k fT*f k *l* ft ft 

TATCATATTA 


AGGAlTi l\* i u 






w 1 vUnVJUUA *■ 


2220 


TGTTCACCGT 


nTTun inJ\s*\* 


r^tTf* a art 

JK 1 ALU 1 l_Ml*V* 




ATTCCTAAAA 

^\ A & A ^%#ft<^#* 


ACGATGATGC 


22B0 


GCATCTAGCA 


CTTGCCCAAU 


*. ft /*— pf* a A t~^f* a 








2340 


TGTTCATGGA 


ft ft #*I^**V* ft f+^+01\ ft 

AATCTACCTA 


k ***** ft Ht/*^<1VP 

ATGATCGTGG 






fWVSrt A/5HV1 A 




AAAAGACCGT 


% * ft ft ft f+ ft 

AAGAAACAGG 


CTGTAACTGC 


i*& a irrv*A a/* 




CGCGTTTXCA 




CGTCTTGGAA 


CGC TTTGGCG 


ATTATAGCTT 


ft ^*t» ft o ft > • 
AGTAGAGTTl* 








TCATCAAATC 


CGTGTCCACA 


JM*^* ^W%*f* ft iff! ft Mf% 

TGGCTTATAT 


0+f*f+f+f+ ft IH< »i ' ft 

CGGCCATwCA 


GTCGCTGGTG 


k tmf ■ m /■Villus!) ft 




TGGTCCTCGC 


AAGACTTTGA 


AAGGACATGG 


ACAAtitv. rr 


ft tV f*/"* ft ft^*ft 

CATGwCAAGA 


CTTTAGGTTT 


2 D4U 


TACTCATCCG 


AGAACAGGTA 


AGACCTTGGA 


ATTTAAAGCA 


^ ft K ft ft%^^^ ft ^ 

GATATCCCAG 


AGATTTTTAA 




GGAAACCTTG 


GAGAGATTGA 


GAAAGTAAGA 


ATGAAAAAGA 


ft ft JV*#V% ft ft *+*W* ft ^ 

AATTAACTAG 


Mum fk +mr* ft Mftfli 

TTTAGCACTT 


2760 


GTAGGCGCTT 


TTTTAGGTTT 


GTCATGCTAT 


GGGAATGTTC 


ft ft ft ft 

AGGCTCAAGA 


ft ft /^MA^ ft ft 

AAGTTCAGGA 


2820 


AATAAAATCC 


ACTTTATCAA 


TGTTCAAGAA 


GGTGGCAGTG 


ATGCGATTAT 


TCTTGAAAGC 


2880 


AATGGACATT 


TTGCCATCCT 


GGATACAGGA 


GAAGATTATG 


ATTTC CCAGA 


TGGAAGTGAT 


2940 


TCTCGCTATC 


CATGGACAGA 


AGGAATTGAA 


ACGTCTTATA 


AGCATGTTCT 


AACAGACCGT 


3000 


GTCTTTCGT C 


GTTTGAAGGA 


ATTGGGTGTC 


CAAAAACTTG 


ATTTTATTTT 


GCTGACCCAT 


3060 


ACCCACAGTG 


ATCATATTGG 


AAATGTTGAT 


GAATTACTGT 


CTACCTATCC 


ft > mwm > k k 

AGTTGACCGA 


^ 1 on 
3120 


GTCTATCTTA 


AGAAATATAG 


TGATAGTCGT 


ATTACTAATT 


CTGAACGTCT 


ft*f*^^^ftti*ft 
ATCGCATAAT 


1 1 OA 
J1BU 


CTGTATCGCT 


ATGATAAGGT 


TTTACAGACT 


ft. ^ ft ft ft 

GCTGCAGAAA 


AAGGTGTTTC 


AGTTATTCAA 


i o n 


AATATCACAC 


AAGGGGATGC 


ft miwiiili#* ft ^» 

TCATTTTCAG 


iiMMfi^^r 1 ^ ft ^ ft 

TTTGGGGACA 


•/•r »i<n (wnp » 
TGGATAi 1CA 


P^'T^ T IV T ft km 


nrtrt 

J JUU 


TATGAAAATG 


AAACT Z ATTC 


ATC CCCTG AA 


(f mi » » N^ft ft ft ft 




VpAAI Iw^AAl 


J JOv 


TCCTTGATTA 


GCGTGGTGAA 


AGTCAATuliC 


A Af"*A A 11TTT 


/\^W i 1 uwuuu 




3420 


AATGTTCATG 


GAGCAGAAGA 


CAAGTATGGT 


CCTCTCATTG 


GAAAAGTTGA 


TTTGATGAAC 


3480 


TTTAATCATC 


ACCATGATAC 


CAACAAATCA 


AATACCAAGG 


ATTTCATTAA 


AAATTTGAGT 


3540 


CCGAGTTTGA 


TTGTTCAAAC 


TTCGGATAGT 


CTACCTTGGA 


AAAATGGTCT 


TGATAGTGAG 


3600 


TATGTTAATT 


GGCTCAAAGA 


ACGAGGAATT 


GAGAGAATCA 


ACGCAGCCAC 


CAAAGACTAT 


3660 


GATGCAACAG 


TTTTTGATAT 


TCGAAAAGAC 


GGTTTTGTCA 


ATATTTCAAC 


ATCCTACAAG 


3720 


CCCATTCCAA 


GTTTTCAAGC 


TGGTTGGCAT 


AAGAGTGCAT 


ATGGGAACTG 


GTGGTATCAA 


3780 


GCGCCTGATT 


CTACAGGAGA 


GTATGCTGTC 


GGTTGGAATG 


AAATCGAAGG 


TCAATGGTAT 


3840 


TACTTTAACC 


AAACGGGTAT 


CTTGTT AC AG 


AATCAATGGA 


AAAAATGGAA 


CAATCATTGG 


3900 
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TTCTATTTGA CAGACTCTGG TGCTTCTGCT AAAAATTCGA AGAAAATCGC TGGAATCTGG 3960 
TATTATTTTA ACAAAGAAAA CCAGATGGAA ATTGGTTGGA TTCAAGATAA AGAGCAGTGG 4020 
TATTATTTGG ATGTTGATGG TTCTATGAAG ACAGGATGGC TTCAATATAT GGGGCAATGG 4080 
TATTACTTTG CTCCATCAGG GGAAATGAAA ATGGGCTGGG TAAAACATAA AGAAACCTGG 4140 
TACTATATGG ATTCTACTGG TGTCATGAAG ACACGTGAGA TAGAACTTGC TGGTCAACAT 4200 
TATTATCTCG AAGATTCAGG AGCTATGAAG CAAGGCTCCC ATAAAAAGGC AAATGATTGG 4260 
TATTTCTACA AGACAGACGG TTCACGAGCT GTGGGTTGGA TCAAGGACAA GGATAAATGG 4320 
TACTTCTTGA AAGAAAATGG TCAATTACTT CTCAACGGTA AGACACCACA ACGTTATACT 4380 
CTGGATTCAA GTGGTGCCTG GTTAGTGGAT CTTTCGATCG ACAAATCTCC TACAATTAAA 4440 
ACTACAAGTC ATTCAGAAAT AAAAGAATCC AAAGAAGTAG TGAAAAAGGA TCTTGAAAAT 4500 
AAAGAAACGA GTCAACATGA AAGTGTTACA AATTTTTCAA CTAGTCAACA TTTGACATCC 4560 
TCAACTTCAC AAACCTCTGA AACGAGTGTA AACAAATCGG AATCAGAACA CTAGTACAAA 4620 

AGAAGGTTTT AGGGCCTTCT TTTTCCTATC AACTCTTTTC TATTTCCTGT TATTCATCTT 4680 

ATAATGGATA AATATGAATA ATCGGAGTGA GACTATGAAA TACAAACGCA TTGTCTTTAA 4 740 

GGTGGGT ACT TCTTCTCTGA CAAATGAGGA TGGAAGTTTA TCACGTAGTA AGGTAAAGGA 4 800 

TATTACCCAG CAGTTCCCTA TGCTGCACGA GGCTGCTCAT GAGTTGATTT TGGTGTCTTC 4860 

AGGTCCCATT GCGGCTGGTT TTCGAGCCTT AGGATTTAAA AAGCGTCCGA CTAAGATTCC 4920 

TGATAAACAG GCTTCAGCAG CGCTAGGGCA GGGCCTTTTG TTGGAAGAAT ATACAACCAA 4980 

TCTTCTCTTG CCTCAAATCG TTTCTGCACA AATCTTCCTG ACCCAACATC ACTTT C TGGA 5C40 

TAAGCGTCGT TATAAAAATG CCCATCAGGC TTTGTCGGTT TTGCTCAACC CTGGGCCAAT 5100 

TCCTATCATC AATGACAATG ATACTCTCGT TATTGATGAC CTCAAGCTTG GGGACAATGA 5160 

CACTCTAACT GCTCAAGTAG CGGCGATGGT CCAAGCAGAC CTTTTAGTTT TCTTGACAGA 5220 

TGTGGACGGT CTCTATACTG CAAATCCTAA TTCAGATCCA AGAGCCAAAC GCTTGGAGAG 5280 

AATCGAGACC ATCAATCGTG AGATTATTGA TATGGCTGGT GGAGCTGCTT CGTCAAACCG 5340 

AACTGGGGGT ATGTTAACCA AAATCAACGC TGCAACTATC GCGACGGAAT CAGGAGTTCC 5400 

TCTTTATATC TGCTCATCCT TGAAATCAGA TTCCATGATT GAGGCGGCAG AGGAGACCGA 54 60 

GGATGGTTCT TACTTTGTTG CTCAAGAGAA GGGGCTTCGT ACCCAGAAAC AATGCCTTGC 5520 

CTTCTATGCT CAGAGTCAAG GTTCTATTTG GCTTGATAAA GGGGCTGCGG AAGCTCTCTC 5580 

TCAATATGGA AACAGTCTTC TCTTATCTGG TATCGTTGAA GCAGAAGGAG TCTTTTCTTA 5640 

CGGTGATATC GTGACAGTAT TTGACAAGGA AAGTGGAAAA TCACTTGGAA AAGGACGCGT 5700 
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GCAATTTGGA 


GCATCTGCTT 


TGGAGGATAT 


GTTGCGTTCT 




AGGGTGTCTT 


5760 


GATTTACCGT 


GACGACTGGA 


TTTCCATTAC 


Tt. U L \* AAaT t 


\~t\t\\. 1 All 


1 1 nwW\ 1 1 


5820 


TTAGAGCTAA 


ACTATGGTGA 


GTAGACAAGA 


ACAATTTGAA 






^ Ann 


ATCGATTAAC 


ACAGCTAGTG 


AAGAAGTGAA 


AAACCAAGCC 


TTGCTAGCCA 


TGGCTGATCA 




CTTAGTGGCT 


GCTACTGAGG 


AAATTTTAGC 


GGCTAATGCC 


CTCGATATCG 


CAGCubwTAA 


DUDD 


GGGGAAAATC 


TCAGATGTGA 


TGTTGGATCG 


TCTTTATTTG 


GATGCAGATC 


GTATAGAAGC 




GATGGCAAGA 


GGAATTCGTG 


AAGTGGTTGC 


CTTACCAGAT 


CCAATCGGTG 


AAGTTTTAGA 


6120 


AACAAGTCAG 


CTTGAAAATG 


GTTTGGTTAT. 


CACAAAAAAA 


CGTGTAGCTA 


TGGGTGTCAT 


6180 


CGGTATTATC 


TATGAAAGCC 


CTCCAAATGT 


GACGTCTGAT 


GCCGCTGCTT 


TGACTCTTAA 


6240 


GAGTGGAAAT 


GCGGTTGTTC 


TTCGTAGTGG 


TAAGGATGCC 


TATCAAACAA 


CCCATGCCAT 


6300 


TGTCACAGCC 


TTGAAGAAGG 


GCTTGGAGAC 


GACTACTATT 


CATCCAAATG 


TGATTCAACT 


6360 


GGTGGAGGAT 


ACTAGCCGTG 


AAACTAGTTA 


TGCTATGATC 


AAGGCCAACG 


GCTATCTAGA 


6420 


CCTTCTCATT 


CCTCGTGGAG 


GAGCTGGCTT 


GATCAATGCA 


GTGGTTGAGA 


ATGCGATTGT 


6480 


ACCTGTTATC 


GAGACAGGGA 


CTGGGATTGT 


C C ATCTCT AT 


GTGGATAAGG 


ATGCAGACGA 


6540 


AG A CAAGGCG 


CTCTCTATCA 


TCAACAATCC 


TAAAACCAGT 


CGTC CTTCTG 


TTTGTAATGC 


6600 


CATGGAGGTT 


CTCCTGGTTC 


ATGAAAACAA 


GGCAGCAAGC 


TTCCTTCCTC 


GCTTGGAGCA 


6660 


AGTGTTGGTT 


GCAGAGCGTA 


AGCAAGCTGG 


ACTGGAACCA 


ATTCAATTCC 


GCCTAGATAG 


6720 


CAAAGCAAGC 


CAGTTTGTTT 


CAGGTCAAGC 


AGCTGAGACC 


CAAGACTTTG 


ACACCGAGTT 


6780 


TTTAGACTAT 


GTCCTTGCTG 


TTAAGGTTGT 


GAGCAGTTTA 


GAAGAAGCGG 


TTGCGCACAT 


6840 


TGAATCCCAC 


AGCACCCATC 


ATTCGGATGC 


TATTGTGACG 


GAAAATGCTG 




07UU 


ATACTTTACA 


GATCAAGTGG 


ACTCTGCAGC 


GGTGTATGTT 


AATCCCTwAA 


CTCCTTTCAC 


070U 


AGATGGAGGA 


CAATTTGGTC 


TTGGTTGT G A 


AATGCGGATT 


TCTACTCAGA 


AATTGCACGC 


7020 


GCCTGGTCCC 


ATGGGCTTGA 


AAGAGTTGAC 


CAGCTACAAG 


TATGTGCTTG 


CCCGTGATGG 


7080 


GCAGATAAGG 


GAGTAAGAGA 


TGAAGATTGG 


ATTTATCGGT 


TTGGGGAATA 


TGGGTGCTAG 


7140 


CTTGGCAAAA 


TCTGTCTTGC 


AGACTAGGAC 


GTCAGATGAG 


ATTCTCCTTG 


C C AATCGT AG 


7200 


TCAAGCTAAG 


GTAGATGCTT 


TCATTGCAGA 


CTTTGGTGGT 


CAGGCTTCCA 


GCAATGAAGA 


7260 


AATGTTTGCA 


GAAGCAGATG 


TGATTTTTCT 


AGGAGTTAAG 


CCTCCTCAGT 


TTTCTGAACT 


7320 


GCTTTCTCAA 


TACCAGACCA 


TCCTTGAAAA 


AAGAGAAAGT 


CTTCTTTTGA 


TTTCGATGGC 


7380 


AGCTGGATTG 


ACCTTAGAAA 


AACTAGCAAG 


TCTTATCCCA 


AGTCAACACC 


GAATTATTCG 


7440 
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TATGATGCCT AATACCCCTG CTTCTATCGG GCAAGGACTG ATTAGTTATC CCTTCTCTCC 7500 

TAATTCCAGG CCTCAGCACA GTGAGCTCTT TTATCAGCTT TTAGCCAAGG CTGGTCTCTT 7560 

GGTTGAACTA GGAGAAAGTT TAATCGATGC AGCGACAGGT CTTGCAGGTT GTGGACCACC 7620 

CTTTGTCTAT C TTTTT ATCG AGGCCTTGGC AGATGCAGGT GTTCAGACAG GATTACCACG 7680 

AGAAATAGCA TTCAAAATCG CAGCACAAAC TGTGGTAGGA GCTGGGCAAT TGCTCCTTGA 7740 

AACTCACCAA CATCCTGGAG TATTGAAAGA CCAAGTCTGT AGCCCAGGCG GTTCGACTAT 7800 

CCCTGGTGTA GCAAGCCTAG AACCGCATCC TTTCCGAGGA ACAGTCATGG ATGCAGTTCA 7860 

TCAAGCCTAC AAACGAACAC AAGAACTAGG TAAATAAGAG GTAGTTTTGA CTGCCTCTTT 7920 

TATGGTGGCT GAAATGAGAA GACACAAAAA GATTGTCACA AACCCCTATT TTTTTGATAG 7980 

AATAGAAGTA GTAAAAAAGA AATGAGTTAG ACATGTCAAA AGGATTTTTA GTCTCTCTTG 8040 

AGGGACCAGA GGGAGCAGGC AAGACCAGTG TTTTAGAGGC TCTGCTACCA ATTTT AG AGG 8100 

AAAAACGAGT AGAGGTGTTG ACGACCCGTG AACCTGGCGG AGTCTTGATT GGGGAGAAGA 8160 

TTCGGGAAGT GATTTTGGAT CCAAGTCATA CTCAGATGGA TGCTAAAACA GAGCTACTTC 8220 

TCTATATTGC CACTCGCAGA CAGCATTTGG TGGAAAAAGT TCTTCCAGCC CTTGAAGCTC 8280 

GCAAGTTGGT CATCATGGAT CCTTTTATCG ATAGTTCTGT TGCCTATCAG CGATTTGGTC 8340 

CTCCCTTAGA TATTGAAGCC ATTGACTGGC TCAATCAGTT TGCCACAGAT CCCCTCAAAC 8400 

CCGATTTGAC ACTCT ATTTT GACATCGAGG TGGAAGAAGG GCTGGCTCGT ATTCCTGCTA 8460 

ATAGTGACCG CGAGGTTAAT CGTTTGGATT TGGAAGGGTT GGACTTGCAT AAAAAAGTTC 8520 

GTCAAGGCTA CCTTTCTCTT CTGGATAAAG AGGGAAATCG CATTCTCAAG ATTGATGCTA 85 80 

GTCTCCCTTT GGAGCAAGTT GTCGAAACTA CCAAGGCTGT CTTCTTTGAC GGAATGGGCT 8640 

TGGCCAAATG AAACAAGATC AACTAAACGC TTGGCAACCA CCTCACTTTG ACCGTTTTGT 8700 

CCGTATCTTA GAACAAGACC AGCTCAATCA CGCCTATCTC TTTTCAGGTT TCTTTGAAAG 8760 

CTTGGAAATG GCGCAATTTT TAGCTAAGAG CCTCTTTTGT ACGGATAAAG TTCGCGTCTT 8820 

ACCATGTGAG AAATGCCGAA CTTGCAAGCT CATTCAACAC GGAGAATTTC CCGATGTCAC 8880 

CTTGATTAAA CCAGTTAATC AGGTCATTAA GACGGAACGC ATTCGAGAAT TGGTGGGTCA 8940 

GTTTTCTCAA GCAGGGATTG AAAGCCAGCA ACACGTCT TT ATCATCGAGC AAGCGGATAA 9000 

AATGCATCCC AACGCAGCCA ATTCTCTGCT CAAGGTCATC GAAGAACCCC AGAGTGAAGT 9060 

TTATATTTTC TTCTTGACTA GCGATGAGGA AAAGATCTTA CCGACAATCC GAACTCGGAC 9120 

TCAGATCTTC CACTTTAAAA AGCAAGAAGA AAAACTTATC TTACTCTTAG AACAAATGGG 9180 

ACTTGTTAAG AAAAAAGCGA CTCTTTTACC TAAGTTTAGT CAATCGCGAG CTGAAGCAGA 9240 
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AAACTTCCCT AATCACGCAA GTTTTTGGAC CTTGCTCGAT GAAACTGAAC GCCTGCTGAC 9300 

TTGCTTAGTA GCTAAGAAAA AAGAAAGTTA TCTACAGGTT GCCAAATTAG CCAACTTGCC 93 60 

AGATGATAAG GAAAAACAGG ATCACCTTTT ACGGATTCTT GAAGTTCTCT GTGGGCAGGA 9420 

CCTCTTCCAG GTAAGAGTAA GACTGATTCT A CAAGATTTA CTAGAAGCTA GAAAAATGTG 9480 

GCAACCTAAT GTCAGCTTTC AAAATCCCAT GGAATATCTG GTCTTGAAAG AAATATAAAC 9540 

TCAAAAATGA ATGATAAAGA AAGGAAAGGG CTGTTTTATG GACAAAAAAG AATTATTTGA 9600 

CGCGCTGGAT GATTTTTCCC AACAATTATT GGTAACCTTA GCCGATGTGG AAGCCATCAA 9660 
GAAAAATCTC AAGACCCTGC TAGAGGAAAA TACAGCTCTT CGCTTGGAAA ATAGTAAGTT . 9720 

GCGAGAACGC TTGGGTGAGG TGGAAGCAGA TGCTCCTGTC AAGGCCAAGC ATGTTCGTGA 9780 

AAGTGTCCGT CGCATTTACC GTGATGGATT TCACGTATGT AATGATTTTT ATGGACAACG 9840 

TCGAGAGCAG GACGAGGAAT GTATGTTTTG TGACCAGTTG CTATACAGGC AGTAGGCATG 9900 

CAGATTCAAA AAAGTTTTAA GGGGCAGTCT CCCTATGGCA AGCTGTATCT AGTGGCAACG 9960 

CCGATTGGCA ATCTAGATGA TATGACTTTT CGTGCTATCC ACACCTTGAA AGAAGTGGAC 10020 

TGGATTGCTG CTGAGGATAC GCGCAATACA GGGCTTTTGC TCAACCATTT TCACATTTCC 10080 

ACCAAGCAGA TCAGTTTTCA TGAGCACAAT GCCAAGGAAA AAATTCCTGA TTTG ATTGGT 10140 

TTCTTGAAAG CAGGGCAAAG TATTGCTCAG GTCTCTGATC CCG G TTTGCC TAGCATTTCA 10200 

GACCCTGGTC ATGATTTAGT TAAGGCAGCT ATTGAGGAAG AAATTGCAGT TGTGACAGTT 10260 

CCAGGTCCCT CTGCAGGAAT TTCTGCCTTG ATTGCCAGTG GTTTACCGCC ACAGCCACAT 10320 

ATCTTTTACG GTTTTTTACC GAGAAAATCA CCTCAGCACA AGCAATTTTT TGGCTTGAAA 10380 

AAACATTATC CTGAAACACA GATTTTTTAT GAATCACCTC ATCGTGTACC AC ACACGTTG 10440 

GAAAATATGT TAGAAGTCTA CGGTGACCGC TCCGTTGTCT TGGTCACGGA ATTGACCAAA 10500 

ATCTATGAAG AATACCAACG AGGTACTATC TCTCAGTTAT TAGAAAGCAT TGCTCAAACG 10560 

CCACTCAAGG GCGAATCTCT TCTCATT C TT CAGGGTGCCA GTCAGGGTGT GGAGGAAAAG 10620 

GACGAGGAAG ACTTGTTCCT AGAAATTCAA ACCCGCATCC AGCAAGGTGT GAAGAAAAAC 10680 

CAAGCTATCA AGGAAGTCGC TAAGATTTAC CAGTGGAATA AAAGTCAGCT CTACGCTGCC 10740 

TACCACGACT GGGAAGAAAA ACAATAAAGG GAGACAGGAT GTAATAATTC TGTCTGTTTC 10800 

TGTTTAACTT AATTAGTGAT GATAATATAA AGATGTATCA CTTGGTATAG AAGCTTTGGT 10860 

ATTAAGTTTT TTATTAAGCC CATACGGAAT ACCGATGGTT GGAGCAGCAG TTATAGCGTT 10920 

CTTAGAAGCT ATAAATAGAA AAATAACGTC ATTTTAAATC AAAGGATTGA TAAATCAGAA 10980 
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AGAAGCTGAT 


TTTTTGCGAA 


CATACCAAAA 


TAAAGAAGAA 


CTAA^ACCTC 


AGATAGAGAA 


11040 


AACATTTCAG 


AAATATATTT 


TAG AATTTG A 


TAATATTCCA 


GAAAATTTAA 


AAGATAAGAG 


11100 


AGCTGATGAA 


GTTGACAGAA 


CTCCACCACA 


AAACCTTGCT 


TATCAGGTTG 


GTTGGACCAA 


11160 


CTTGGTTCTT 


AAATGGGAAG 


AAGATGAAAG 


AAAGGGGCTT 


CAAGTAAAAA 


CACCATCGGA 


11220 


TAAATTTAAA 


TGCAATCAAC 


TTGGTGAATT 


ATATCAGTGG 


TTCACAGATA 


CCTACGCTCA 


11280 


TTTATCTCTG 


CAAGAGTTGA 


AAGCAAAATT 


AAATGAAAAT 


ATTAATTCTA 


TCTCTGCAAT 


11340 


CATTGATTCG 


TTGAGTGAGG 


AAGAATTATT 


TGAACCGCAT 


ATGAGAAAGT 


GGGCTGATGA 


11400 


AG CG ACT AAA 


ACAGCGACTT 


GGCAAGTGTA 


TAACTTTATT 


CATGTAAATA 


CGCTTGCACC 


11460 


TTTTGGAACT 


TTCAGAACTA 


AAATCAGAAA 


ATGGAAGAAG 


ATAGTATTAT 


AAATTATATT 


11520 


TTTAACTTTA 


AAAAATTTCA 


TAAAAATGGT 


TACCAAAGGC 


GATAGAAGAA 


AAACTATCGT 


11580 


CTTTTTCTTT 


GCAAATTTTT 


AAGAAGGGAG 


GTGATCTTGC 


ATGGACTTTG 


AATATTTTTA 


11640 


TAACAGAGAA 


GCGGAAAGAT 


TTAACTTCTT 


AAAAGTACCG 


GAGATATTAG 


TTGATAGAGA 


11700 


AGAATTTCGG 


GCCTTATCAG 


CAGAAGCAAT 


TATCCTTTAT 


TCCATACTTC 


TTAAACAGAC 


11760 


AGGAATGTCA 


TTTAAGAATA 


ACTGGATAGA 


CAAGGAAGGC 


AGAGTATTTA 


TCTATTTTAC 


11620 


TGTCGAAGAA 


ATTATGAAAA 


GAAGAAATAT 


CTCAAAGCCA 


ACTGCCATAA 


AAAC ATT AG A 


11880 


TGAGCTTGAT 


GTAAAAAAGC 


AATAGGACTG 


ATCGAAAGAG 


TAAGCCTTGG 


ACTTGGTAAG 


11940 


CCGAACATCA 


TTTATGTTAA 


ACACTTTATG 


AGTATATTTC 


AGGTAAAAGA 


AAATGACTTA 


12000 


C AG AAGTCAA 


AAAACTTAAC 


TTCAGAACTA 


AAAGATTTTA 


ACCTCAGAAG 


TAAAGAAAAT 


12060 


GAACTTCAAG 


AGGTTAAGAA 


CCTTGACTCT 


AACTATATAG 


AGAATAATAA 


GAGTAAGTAT 


12120 


AGTAACAGAG 


AATATACTTT 


TGGTGAAAAC 


GGACTTGGAA 


CATTTCAAAA 


TGTGTTTTTA 


12180 


GCTGCTGAAG 


ATATATCGGA 


TTTACAAATC 


ATAATGAACT 


CACAGCTTGA 


CAATTACATT 


12240 


AGACTTCCTC 


CAAAACTAGA 


ATCCTAGTTC 


ATGATTGATA 


ATGCCAGCAA 


TCAAATTCAT 


12300 


TCCTAATCCG 


AAGCGTTTAC 


GATGATTTCG 


ATAGATTGTT 


GAAAACATTT 


TAAACGTTTT 


12360 


TACTTTGGCA 


AAGATGTTCT 


CAATCTTGCT 


TCTCTCCTTG 


CATAGCGCAT 


GGTTACAGGC 


12420 


TTTATCTTCA 


GCTGTTAGCG 


GCTTGAGTTT 


GCTGGATTTA 


CGTGGAGTTT 


GTACTTGAGG 


12480 


ATATATCTTC 


ATGAGCCCTT 


GATAACCACT 


GTCAGACAAG 


ATTTTACCAG 


CTTGTCCGAT 


12540 


ATTTCTGCGA 


CTCATTTTGA 


ACAACTTCAT 


ATCACGACAA 


TAGTTCACAG 


CGATATCCAA 


12600 


AGAAACAATT 


CTCCCTTGAC 


TTGTGACAAT 


CGCTTGAGCC 


TTCATAGCGT 


GAAATTTCTT 


12660 


TTTACCAGAA 


TGATTCGCTA 


ATTCTTTTTT 


TAGGGCGATT 


GATTTTTACT 


TCCGTCGCAT 


12720 


CAATCATTAC 


CGTGTC CTC A 


GAACTGAGAG 


GAGTTCTTGA 


AATCGTAACA 


CCACTTTGAA 


12780 
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CAAGAGTTAC 


TTCAACCCAT 


TGGCTCCGAC GGATTAAGTT 


GCTTTCGTGA 


ATACCAAAAT 


12840 


CACCCGCAAT 


TTGTTCATAA 


GTTCGATATT 


CTCCCACATA 


TTGAAGAGTG 


GCCATAAGAA 


12900 


CGTCTTCTAG 


GCTTAATTTA 


GGTTTTCGTC 


CACCTTTTCC 


GTGTTTAAGT 


TGATAAGCTG 


12960 


TTTTTAATAC 


AGCTAATATC 


TCTTCAAAAG 


TCGTGCGCTG 


AACACCAACA 


AGACGCTTAA 


13020 


ATCGTGCATC 


AGTTAGTTGT 


TTACTTGCTT 


CATCATTCAT 


AGAACTACTA 


TACCATATTT 


13080 


TGTTTCGCAG 


GAAGTCTATT 


GGAAAGTAAG 


AAATATTGAA 




TTACAACAAA 


13140 


TTGTGAGCCT 


GGTGCTATTT 


TTTCAGGTAA 


AATAAAATAT 


CACCAAGA.TT 




13200 


AGGAGATCAC 


TATGTTGAAT 


GTTATGCTGT 


TCTAGATAAT 






13260 


AATAACAGTC 


CCTATCGATC 


CCTTATCTGG 


AAAAGATTTT 


JK 1 Awnu 1 AUV. 


ATATA AfTYlA 


13320 


TTCTTAACTG 


GAATACTCAC 


T ATCT CTVT A 


CATCAAGAAA 






13380 


GCCTTCTTCC 


C11TTTTTGT 




GAAGAAAAAA 


1 I AuAAnuA 1 


TTGTGGGTOT 


13440 


CAAACAGCCC 


AGTGGGGTGT 




ACTTAGGTCC 


CACCCAAAGA 


GGTATTAGTG 


1 J3UU 


TCGTGTCTCA 


ATCTTATATC 


AATCTTATCG 


GTGCTGGTTT 


GGCAUC l IV! 






AC C AAATCGC 


AGAGCGTGGT 


ATTCCAGTTA 


AACTATATGA 


AA rULu 1 1 


u 1 1_ r\J\\j 1L In 


13620 


CACCCCAGCA 


TAAAACAGAC 


AATTTTGCTG 


AGTTGGTTTG 


TTCCAA TltT 


l l \*\m\» I UUVIU 


13680 


ATGCTTTGAC 


AAATGCAGTT 


GGTCTTCTCA 


AGGAAGAAAT 


GCGTCGCTTG 


GGTTCTGTTA 


13740 


TCTTGGAATC 


TGCTCAGGCT 


ACACGTGTTC 


CTCCAGCTGG 




gPCGACCCTC 


13800 


ATGGTTTCTC 


TCAAATGGTG 


ACCGAAAAAG 


TTGCC AAC C A 


CCCC TTG ATT 




13860 


GTGATGAAAT 


TACAGAATTG 


CCGACAGATG 


TTATTACCGT 


I A U.UL 1 Ml- 1 




13920 


-AACT jATCC 


CTTGCTTGrt.**. 


AAGATTCATC 


CTTr-.VATCA 




T 1" i ♦ ATTTCT 


13980 


ACGATGCGGC 


AGCGCCTATT 


ATCGATGTCA 




T & TY^ Al^ & Afi 


GTCTACCTCA 


14040 


AATCACGTTA 


TGATAACGGA 


GAAGCGGCCT 


ACCTCAATGC 


CCCTATGACC 


AAGCAAGAVT 


14100 


TTATGGATTT 


CCATGAAGCT 


TTGGTCAATG 


CAGAACAAGC 


ACCGCTTAGT 


TCTTTTGAAA 


14160 


AAGAAAAGTA 


CTTTGAAGGA 


TGTATGCCTA 


TCGAAGTCAT 


GGCCAAACGT 


GGCATTAAAA 


14220 


CTATGCTTTA 


TGGCCCTATG 


AAGCCAGTCG 


GTCTTGAGTA 


CCCAGACGAC 


TATACAGGAC 


14280 


CTCGTGATGG 


AGAATTTAAA 


ACACCTTATG 


CCGTTGTGCA 


ACTTCCTCAG 


GATAATGCAG 


14340 


CTGGTACCCT 


CTACAATATT 


GTTGGTTTCC 


AGACCCACCT 


CAAATGGGGA 


GAACAAAAGC 


14400 


CTCTCTTCCA 


AATGATTCCG 


GGTCTTGAAA 


ATGCGG AG TT 


TGTCCCTTAT 


GGTGTGATGC 


14460 


ATCGCAATTC 


TTACATGGAT 


TCACCAAATC 


TTCTTGAGCA 


GACTTACCCT 


TCTAAGAAAC 


14520 



WO 98/18931 



PCT/US97/19588 



282 



AACCAAATCT 


CTTCTTTGCT 


GGTCAAATGA 


CGGGTGTGCA 


AGGCTATGTT 


GAGTCGGCGG 


14580 


CTTCAGGCTT 


AGTTCCGGCA 


ATTAACGCAG 


CTCGTCTCTT 


CAAGGAAGAA 


AGCGAGGCTA 


14640 


TTTTCCCCGA 


GACGACAGCG 


ATTGGAAGCT 


TAGCTCATTA 


CATTACCCAT 


GCCGACAGCA 


14700 


AACATTTCCA 


ACCAATGAAT 


GTCAATTTTG 


GGATCATCAA 


GGAGTTGGAA 


GGCGAGCGTA 


14760 


TCCGTGATAA 


GAAGCCTCGT 


TATGAAAAAA 


TTGCAGAGCG 


TGCCCTTGCC 


GACTTAGAGG 


14820 


AATTTTTGAC 


TGTCTAATTT 


TTTTGAAAGA 


ATTGCTCATC 


ATACTATAAA 


AATCTTAGAA 


14880 


ATTGTGATAA 


AATAGGTAGG 


ATGAAAGAAG 


GAGACTGAAA 


ATGGCGAATC 


CCAAGTATAA 


14940 


ACGTATTTTA 


ATCAAGTTAT 


CAGCTGAAGC 


CCTTGCCGGT 


GAACGTGGCG 


TAGGGATTGA 


15000 


TATCCAAACA 


GTTCAAACAA 


TCGCAAAAGA 


CATTCAAGAA 


GTTCATAGCT 


TAGGTATCGA 


15060 


AATTGCCCTT 


GTTATCGGTG 


GAGGAAATCT 


CTGGCGTGGA 


GAACCTGCAG 


CAGAAGCAGG 


15120 


TATGGACCGT 


CTTCAGGCAG 


ATTACACAGG 


AATGCTTGGG 


ACTCTTATGA 


ATGCTCTTGT 


15180 


GATGGCACAT 


TCATTGCAAC 


AAGTTGGGGT 


TGATACGCGT 


GTACAAACAG 


CTATTGCCAT 


15240 


GCAACAAGTG 


GCAGAGCCTT 


ATGTCCGTGG 


ACGTGCCCTT 


CGTCACCTTG 


AAAAAGGCCG 


15300 


TATCGTTATC 


TTTGGTGCTG 


GAATTGGTTC 


ACCTTACTTC 


TCGACAGATA 


CAACACCGGC 


15360 


CCTTCGTGCA 


GCTGAAATCG 


AAGCAGATGC 


CATCCTCATG 


GCTAAAAATG 


GTGTCCATCG 


1S420 


TGTTTACAAT 


GCCGATCCTA 


AGAAAGATAA 


GACAGCTCTT 


AACTTTGAAG 


AATTGACCCA 


15480 


CCGTGACGTT 


ATCAATAAAG 


CTCTTCGTAT 


CATGCACTCA 


ACAGCTTCAA 


CCCTCTCAAT 


15540 


GGACAACGAC 


ATTGACTTGC 


TTGTATTCAA 


CATGAACCAA 


CCAGGCAACA 


TCAAACGTGT 


15600 


CGTATTTGGT 


GAAAATATCG 


GAACAACAGT 


TTCAAATAAT 


ATCCAAGAAA 


AGGAATAAGA 


15660 


AAGAATATGG 


CTAACCCAAT 


TATTGAAAAA 


GCTAAAGAGA 


GAATGACCCA 


GTCTCACCAA 


15720 


TCACTTCCTC 


GTGAATTTGG 


TGGTATCCGT 


CCTCGTCCTG 


CCAATGCAAG 


CTTGCTTGAC 


15780 


CGTGTACATC 


TAGAATACTA 


TGGAGTCGAA 


ACTCCTCTTA 


ACCAAATCGC 


TTCAATTACG 


15840 


ATTCCAGAAG 


CGCGTGTTTT 


GTTGGTAACA 


CCATTTCACA 


AGTCTTCATT 


GAAAGACATC 


15900 


GAACGTGCCT 


TGAACGCTTC 


TGATATTGGT 


ATCACACCGG 


CTAATGACGG 


TTCTGTGATT 


15960 


CGCTTCGTTA 


TCCCACCTCT 


TACAGAAGAA 


ACTCCTCGTG 


ACCTTGCTAA 


AGAAGTGAAG 


16020 


AAGGTCGGCG 


AAAATGCTAA 


AGTGGCTCTC 


CGCAATATCC 


GTCGCGATCC 


TATGGACGAA 


16080 


GCTAAGAAAC 


GAGAAAAAGC 


AAAAGAAATC 


ACTGAAGACC 


AATTGAAGAC 


TCTTGAAAAA 


16140 


G AC ATT C AAA 


AAGTAACAGA 


CGATGCTCTT 


AAAC AC ATCG 


ACGACATGAC 


TGCTAACAAA 


16200 


GAGAAAGAAC 


TTTTGGAACT 


CTAAAAATAA 


ACAGAAAAAC 


TCACTTGGCA 


TTGCTGGCTG 


16260 


AGTTTTATTC 


GAAAGAAGGA 


AATATGAATA 


CAAATCTTGC 


AAGTTTTATC 


GTTGGACTGA 


16320 
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TCATCGATGA AAACGACCGT TTTTACTTTG TGCAAAAGGA TGGTCAAACC TATGCTCTTG 16380 

CTAAGGAAGA AGGCCAACAT ACAGTAGGGG ATACGGTCAA AGGTTTTGCA TACACGGATA 16440 

TGAAGCAAAA ACTCCGCCTG ACAACCTTAG AAGTGACTGC CACTCAGGAC CAATTTGGTT 16500 

CCGGACCTGT CACAGAGCTT CGTAAGGACT TGGGTGTCTT TGTGGATACA GGCCTTCCTG 16S60 

ACAAGGAAAT CGTTGTGTCA CTCGATATTC TCCCTGAGCT CAAGCAACTC TGGCCTAAGA 16620 

AGGGCGACCA ACTCTACATC CGTCTTGAAG TGGATAACAA AGACCGTATC TGGGGCCTCT 16680 

TGGCTTATCA ACAAGACTTC CAACGTCTTG CTCGTCCTGC CTACAACAAC ATGCAGAACC 16740 

AAAACTGGCC ACCCATTGTT TACCCTCTCA AGCTGTCAGG AACTTTTCTT TACCTACCAG 16800 

AAAATAATAT GCTTGGTTTT ATTCATCCTA GCGAGCGTTA CGCAGAGCCA CGTTTGCGGC 16860 

AAGTATTAGA TGCGCGCGTT ATTGGTTTCC GTGAACTGGA CCCCACTCTG AACCTCTCCC 16920 

TCAAACCACC CTCCTTTGAA ATGTTGGAAA ACGATGCTCA GATGATTTTC ACTTATTTGG 16980 

AAAGCAATGG CGGTTTCATG ACCTTAAATG ACAACTCATC TCCAGACGAC ATCAAGGCAA 17040 

CCTTTGGCAT TTCTAAAGGT CACTTCAAGA AAGCTTTAGG TGGTCTTATG AAGGCTCGTA 17100 

AAATCAAGCA GGACCAGTTT GGGACAGAGT TGATTTAGGG AGGCTTATGA GAAAATCATT 17160 

TTACACTTGG CTCATGACCG AGCGCAATCC TAAAAGTAAC AGTCCCAAAG CAATTTTGGC 17 2 20 

AGACCTCGCT TTTGAAGAGT CAGCCTTTCC AAAACACACA GATCATTTTG ATGAGCTCAG 17280 

TCGCTT T TTG GAGGAGCATC CCAGTTTCTC TTTTAACCTA GGAGATTTTG ACAGCATTTG 17 340 

GCAGGAATAT CTAGAACACT AGCATTTATT CATTGGGTTT CGGCTAGTAA TTTCTCCATC 17 400 

CCTCTGCTAT AATAAAAAGA AATAAAAGGA TTAGAGAGGT TCTTTATTTG AAGGAACATT 174 60 

CAATAGACAT TCAACTGACT CATCCAGATC ACCTGTTTCA TCTTTTTGGT TCCAATGAAC 17520 

GCCATCTTCG TTTGATGGAA GAAGAGCTTG ATGTTGTCAT TCATGCTCGT ACGGAGATTG 17580 

TCCAC C TT TT GGGAGAAGAG TCTGCCTGTG AGGAAGCCCG TCAAGTTATT CAGGCTTTGA 17640 

TGGTCTTGGT AAATCGTGGG ATGACCGTTG CTACGCCACA TGTAGTCACT GCGATTAGCA 17700 

TGGTCAAAAA TCATGAAATT GACAAGTTTG TCGCCCTTTA CGAAGAAGAA ATTATCAAGG 17760 

ATAATACTGG GAAACCTATC CGTGTCAAAA CCCTAGGGCA AAAGCTTTAT GTGGACAGTG 17820 

TCAAACAGCA TGATGTGACC TTTGGAATTG GGCCAGCAGG TACAGGGAAG ACCTTCCTTG 17880 

CAGTGACCTT GGCAGTGACT GCCCTTAAAC GTGGGCAAGT CAAGCGAATT ATCCTAACTC 17940 

GTCCAGCGGT GGAACCGGGA CAGAGTCTTG GATTTCTTCC GGGTGATCTT AAGGAGAAGG 18000 

TGGATCCTTA CCTTCGTCCT CTTTACGATG CCTTGTATCA AATTCTTGGG AAAGACCAAA 18060 
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CGACTCGTCT CATGGAGCGT GAAATTATCG AAATTGCGCC CCTTGCCTAT ATGCGTGGCC 18120 
CCACCTTCCA TGATCCCTTT GTCATTCTCG ATGAGGCGCA AAACACGACC ATCATGCAGA 18180 
TGAAGATCTT CTTGACGCCT TTAGGTTTTC ATTCTAAGAT GATTGTCAAT GGAGATATTA 18240 
GTCACATTGA CCTGCCACGT AATGTCAAGT CCGCTTTGAT TGATGCTCAA GAGAAACTCA 18300 
AGAACATCCA TCAGATTGAC TTTGTTCATT TTTCAGCCAA GGATGTGGTT CGCCATCCTG 18360 
TTGTCGCTCA GATTATCCGA GCCTATGAAT ATTCTACTGA AGTTGCACAC CACTGATTTT 18420 

GACGAAGTTC GCCTGCAAAA GAATAGACTT GTTCGGTAAC TGTAAAAAGT CTTATACTAT 18480 

TTTTATGGAA ACAGTATACG ACAAAGCACA AAAACTTAAC TCAAAAAACT TCAAACTATT 18S40 

GATTGCTCTC AAAAAGGAAA CCTTTCAACT CATCCTAGAA CACCTCAATT CACCCTATCA 18600 

GATTCACCAC CGAAAAGGTG GACGTCCACG TAGTCTGCCC ATGGAAGACC AGCTCATTAT 18660 

CACCCTCCCT TACTTGCGAT ATTATCCCAC TCACCGTCTG CTGGC C TTTG ATTTTGCCGT 18720 

CGGTGTAGCT ACGGTAAATG CCATCATCAC TTCGGTGGAG GATACACTTC GTGCCTCAGG 18780 

TAGCTTTGAT TTGGACCATT TAGAAGCCCC GAGTCCTGCT GTGGCTATTG ACGTGACCGA 18840 

AAGTCCCATT CAGCGTCCAA ACAAAACCAA AGCAAAAATT ATTCTGGTAA AAAGAAACGA 18900 

CACACCTTAA AAACTCAAAT TATGCTGGAT TTGACGACAC ATAAAGTCTG TCAAATGCCC 18960 

TTTTCTGACG GACATACGCA TGATTTTACT CTCTTCAAAG AAACTATTGG ACAAAGTTTG 19020 

CCTGAAACGA CGCTTGCCTT TGTTGACCTA GGTTATTTAG GCATCTTGAA ATTTCATGAG 19080 

AATACTTTCA TTCCTGCTAA AAATTCCAAA AATCCCCGCC TGAGTCAGGA TCATAAGCAG 1914 0 

TTAAATAAAG AGATGTCAGC GATACCAATT GAAATTGAAC ATTTTAACGC TAAATTCAAG 19200 

ACCTTCCAAA TCATCTCAGT CCCTTATCGT AACCCCAGAA AACGTTTCGA GTTACCGGCC 19260 

GAATTAATTT GTGCCATCAT CAATTATGAA GTGAACTAGA TTCCGAACAA GTCTAATATA 19320 

CTTTTGAGAG AGGAAAATCC AGTTGTATAG GCTAAAGGTT TTATCCAAAG GTCTGAGACA 19380 

ACGATTAGGC ACCATGGAAA GAACTTTTAT GTGGCTGATC ACGATCAGTG CATCTTCCTC 19440 

TCTCATAATC ACAGGGCACA AGAAAGTAGG AATTTCAAAA GATGATTGAC CAACTATCTA 19500 

ACT ATT AC AG TTGTAGGATA CTAACTGAAA AGGATATTCC AACTATTTTA TCTTTATATG 19560 

AAAGTAATCC TCTCTATTTT CACCATTCTC CACCAGAGCC AAATTTTGCA ACTGTAAAAG 19620 

AGGACATGCT TTGTCTACCT GAAGGTAAAG CTAAGGCTGA TAAGTTTTTT GTTGGATTTT 19680 

GGAATCGATC TGACCTTGTG GCTGTTATGG ATTTTGTCTA TGCATATCCT GATGAGGAGA 19740 

CTGTTTTTAT TGCTTTGTTT ATGGTTGATC AAGCCTATCA GAGAAAAGGG ATTGGTAGTC 19800 

ATATTGTGAC AGAAGCACTA GCTTATTTTG CTAAGAACTT TCGAAAGGCA CGTTTGGCTT 19860 
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ATGTTAAGGG 


AAATCCGCAA 


TCTCAGCATT 


I TluOunAnA 


Viv.nuuui. ill 




19920 


GATGCGAGGT 


TAAGCAAGAA 


CTCTATACGG 


I I Oil ATCGC 


TGAACAW»AviW 


l» i nuAAun 1 I 


i aq an 


AGAAATGGCA 


TCAAGTAAGA 


ACTATTTGGA 


ATTTGTTTTG 




UUiuA 1 1 nun 


20040 


TCATCTGACT 


TACCGTTCCA 


TGATGGGGGA 


GTATATTCTT 


I r\\- i IlLuLu 




cwlWV 


TGGCGGCATT 


TATGACGATC 


GCTTTTTAGT 


TAAACCCGTG 


CAAGCAGTCT 


TAGATAAGAT 


<uieu 


TGACCAATCT 


TCTTTTGAGT 


TTCCATACAA 


AGGTGCCAAA 


GAAATGATTT 


GAGTGGAAGA 


2U22U 


ACTTGATAAT 


AAGATGTTTC 


TATAACACCT 


AATTTTAGCT 


ATGTATAACC 


AACTGCCAAC 


20280 


GCCCAAACCT 


AAAAAGAAAA 


AGCAAGGGTG 


AACGAAGTAA 


AAAAGAAGTC 


TGCTAAGCCC 


20340 


CTCTCTTTGC 


ACGGGTAAAA 


TTTTATATAT 


AAAAAGAAGC 


TGGGACTAAA 


G AGCTC AGCT 


20400 


TCCTTTGGTT 


TATATAATTG 


TCATTACAAG 


ACGAAGTCGT 


TGGGCGAAAC 


TCTGTTGACT 


20460 


TTATTCAATT 


TAGAGTTTCT 


TATGCACAAT 


TGAGTCTGGA 


ACGAAAGTCT 


CCAGTTGCAA 


20520 


AGTATACACT 


ACAATAAACC 


AACGATGTAA 


TAGCTGATGA 


CACAAAGCAC 


AGTGGGTAGG 


20580 


ACTTCCGAAG 


TCACCCTTTT 


CTTTTCAAAA 


TTTATACTAA 


ATCATTGATA 


TCAGTGTAGT 


20640 


CACGATTAAG 


TCCTTGACCA 


ACTCGTAGGT 


TAGTCAAGTA 


ACCTTGATAA 


GTAGTCACAC 


20700 


CTTGACGCAA 


CCCTTCATCT 


TCAGAGATTG 


CTTCTCCGAA 


TCCTTTGCCA 


GCCAAAGCTT 


20760 


CCATATAAGG 


AAGAGTGACA 


TTGGTTAGGG 


CGATGGTTGA 


AGTGCGAGCA 


ACCGCACCAG 


20820 


GGATATTGGC 


AACGGCATAG 


TGCAGAACAC 


CGTGTTTTTC 


ATAGACGGCT 


TCATCCTGCG 


20880 


TTGTCACACG 


CTCAGCTGTT 


TCGATAACCC 


CACCTTGGTC 


AACAGCAACC 


TCAACGATAC 


20940 


AGAGCCTGGA 


CGCATTTGTT 


TGACCATCTC 


ATCTCTCACC 


AATTCCGGTG 


CTTTTGCACC 


1 1 AAA 

2 luau 


AGGGATGAGA 


ATGGCTCCAA 


TCACCACATC 


AGCATCTCTC 


ACACTTGCTT 




A 1UOU 


TGAATTACAC 


ATAAGAGTTT 


GAATTTGACT 


TCCAAAGACT 


TCTTCTAGAA 






CTTGGAACTA 


ATATCTAAAA 


TAGTCACTTG 


AGCACCAAGA 


CCAAGGGCGA 


TGCGGGCA1C 


2X180 


ATGTGTACCG 


ACCACACCAC 


CACCGATGAT 


AGTTACTTTT 


CCTTTTGGAA 


CACCTGGTAC 


21240 


ACCACCAAGT 


AGAACACCAG 


AGCCACCAGC 


TTGCTTAGTA 


AGGAAGTCAC 


CTCCGATTTG 


21300 


AACAGCCATA 


CGACCTGCAA 


CCTCACTCAT 


AGGAACGAGG 


AGCGGTAGTT 


GTCCTTGATT 


21360 


CTCACGAACA 


GTTTCAGTTG 


TTTTTGCTGT 


TAACATAGCA 


TCTGCTAATT 


CTGGAGCAGC 


21420 


GGCCATGTGC 


AAGTAGGTGA 


AGAGAAGAAG 


ATCCTCGCGC 


AACTAACCGT 


ATTCAGAACT 


21480 


TAAAGATTCT 


TTTACTTTCA 


CAACCAACTC 


TGCTGCCCAA 


CCTTCACCAG 


CAGTAGCGAC 


21540 


AATCTCAGCT 


CCTTGCTTTT 


GATAGTCAGC 


ATCAGTAAAG 


CCAGAACCGA 


GACCAGCATT 


21600 
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TGTTTCCATA 


AGGACACGAT 


GACCACGACT 


AACTAAGCTA 


TGAACACCTG 


CAGGTGTGAG 


21660 


CGCGACACGG 


TTTTCGTTAT 


TTTTAATTTC 


TTTTGGGATT 


CCGATTAACA 


TTGAGATAAC 


21720 


CTACCTTTCA 


ATTGACGGTC 


TTGTTTTGGT 


TGTCACATTC 


CAGTTCATAA 


ATCAAAAATC 


21780 


TGACGCTTTC 


ATTGTATATG 


AAACCCCTTC 


AAAAATCAAG 


AAAAACTTGT 


CATCCAAATT 


21840 


TTTTTATGCT 


AGACTAGTGA 


AAATCAAGCT 


CTAATGGAGG 


GAAAACTATG 


GAATCAATAT 


21900 


TTGTGAAATT 


TGCCCAGTAT 


CCGTCTATAG 


AAACGGAGCG 


TTTATTGCTC 


AGACCTGTAA 


21960 


CTTTGGATGA 


TGCCGAACAA 


TGTTTGACTA 


TGCCTCGGAC 


AAGGGTAATA 


CACCTTACAC 


22020 


TTTTCCAACC 


AATCAAAGCT 


TGGAAGAAAC 


CAAGAATAAC 


ATTGCTCAGT 


TCTACTTGGC 


22080 


TAATCCCTTG 


GGACGTTGGG 


GAATAGAACT 


AAAAAGCAAT 


GGTCACTTTA 


TTGGAACCAT 


22140 


TGACTTCCAC 


AACATTGATT 


CTGTTCTTAA 


GAAGGCAGCT 


ATTGGCTACA 


TTATCAATAA 


22200 


AAAGTATTGG 


AATCAAGGAT 


TAACGACAGA 


AGCCAATCGT 


GCTGTGATTG 


ACCTAGCTTT 


22260 


TGAGAAGATA 


GGGATGAATA 


AGTTGACTGC 


CCTTCACGAT 


AAGGCTAATC 


CCGCCTCAGC 


22320 


AAACCTCATG 


GAGAAATCAG 


CCATGCCTTT 


TTCCCATGCA 


GAACCATATG 


CTTGTATGGA 


22380 


CCAGCATGAA 


AAAGGCCGAA 


TCCTGACAAG 


ACTTCATTAT 


GTCTTGACCA 


AGGAAGACTA 


22440 


TTTTCCAAAT 


AAATAAGCAG 


TTGAAAAGAA 


ATTTTTCGAC 


TGTTTTTTCT 


TCCTCTTACC 


22500 


AATAATCTAA 


CAGAGGAGAA 


AATATGGAAG 


CAATTATCGA 


GAAAATCAAA 


GAGTATAAAA 


22560 


TCATCGTCAT 


CTGTACTGGT 


CTGCGCTTGC 


TTGTAGGAGG 


ATTTTTCCTG 


CTAAAACCAC 


22620 


CTCCACAAAC 


ACCTGTCAAA 


GAGACGAATT 


TGCAGGCTGA 


AGTTGCAGCT 


GTTTCCAAGG 


22680 


ACTCATCGAC 


CGAAAAGGAA 


GTGAAGAAGG 


AACAAAAGGA 


AG AACCC CTT 


GAACAAGATC 


22740 


TAATCACAGT 


AGATGTCAAA 


GGTGCTCTCA 


AATCGCCAGG 


GAT7TATGAC 


TTGCCTGTAG 


22800 


GTAGTCGAGT 


CAATGATGCT 


GTTCAGAAGG 


CTGGTGGCTT 


GACAGACCAA 


GCAGAGAGCA 


22860 


ACTCGCTCAA 


TCTAGCTCAG 


AAAGTTACTG 


ATGAGCCTCT 


GGTTTACGTT 


CCTACTAAGC 


22920 


GAGAAGAAGC 


ACTTAGTCAA 


CAGACTGGTT 


CGGGGACAGC 


TTCTTCAACA 


AGCAAGGAAA 


22980 


AGAAGGTCAA 


TCTCAACAAG 


GCCAGTCTGC 


AAGAACTCAA 


CCAGGTCAAG 


GGACTGGGAG 


23040 


GAAAACGAGC 


TCAGGACATT 


ATTGACCATC 


GTGAGGCAAA 


TGGCAAGTTC 


AAGTCAGTAG 


23100 


ACGAGCTCAA 


GAAGGTCTCT 


GGCATTGGTG 


CCAAAACAAT . 


ACAAAAGCTT 


AAAGACTATG 


23160 


TTACAGTGGA 


TTAAGAATTT 


CTCTATTCCC 


CTAATTTACC 


TGAGTTTTCT 


ATTACTTTGG 


23220 


CTTTATTACG 


CTATTTTCTC 


AG C ATCTT AT 


CTTGCTTTGT 


TGGGCTTTGT 


TTTTCTGCTA 


23280 




TTATCCAATT 


TCCGTGGAAA 


TCTGCTGGTA 


AAGTTCTAAT 


AATTTGCGGA 


23340 


ATCTTTGGAT 


TTTGGTTTGT 


TTTTCAAAAT 


TGGCAACAGA 


GTCAAGCCAC 


TCAAAATCTG 


23400 
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CCGGATTCTG 


TTGAAAGGGT 


ACGGATTTTG 


CCTGATACTA 


TTAAGGTTAA 


TGGTGATAGT 


^ ^ A £ A 


CTATCCTTTC 


GTGGCAAGTC 


TAACGGTCGT 


GCTTTCCAAG 


TCTATTATAA 


ACTCCAGTCC 


* J J*v 


GACGACGAGA 


AAGAAGCCTT 


TCAAGCTTTA 


ACTGACCTGC 


ATCAGATAGG 


ACTAGAAGGG 


*t t c a a 


AAGCTTTCGC 


AGCCAGAAGG 


CCAGAGAAAT 


TTTGCTGGCT 


TTAATTACCA 


AGCCTATCTG 


23640 


AAGACTCAGG 


GAATTTACCA 


CACTCTCAAT 


ATCAAAACAA 


TCCAGTCACT 


TCAAAAGATT 


23700 


GGCAGTTGGG 


ATATAGGAGA 


AAACTTGTCC 


AGTTTACGTC 


GAAAGGCTGT 


GGTTTGCATT 


23760 


AAGACGCACT 


TTCCAGACCC 


TATGGGCAAT 


TACATGACAC 


GACTCTTGCT 


GGGACATCTG 


23820 


GACACCGACT 


TTGAGGAGAT 


GAATGAGCTT 


TATTCCAGTC 


TAGGAATTAT 


CCACCTCTTT 


23880 


GCCCTATCTG 


GCATGCAGGT 


AGGTTTTTTC 


ATGAATGGAT 


TTAAGAAACT 


TCTCTTGCGA 


23940 


TTGGGCTTGA 


CCCAAGAAAA 


GTTGAAATGG 


CTGACTTATC 


CCTTTTCCCT 


TATCTATGCG 


24000 


GGACTAACTG 


GATTTTCAGC 


ATCGGTTATT 


CGCAGTCTCT 


TGCAAAAGCT 


ACTGGCTCAA 


24060 


CATGGGCTTA 


AGGGCTTGGA 


TAATTTTGCC 


TTGACGGTGC 


TTGTCCTCTT 


TATTGTCATG 


24120 


CCAAACTTTT 


TCTTGACAGC 


AGGAGGAGTC 


TTCTCCTGCG 


CTTATGCTTT 


TATCCTCACC 


24180 


ATGACCAGCA 


AAGAAGGGGA 


GGGGCTCAAG 


GCTGTTACTA 


GTGAAACTCT 


AGTCATCTCC 


24240 


TTGGG CAT AT 


TGCCCATTCT 


ATCCTTCTAT 


TTTCCGGAAT 


TTCAACCTTG 


GTCTATCCTT 


24300 


TTGACCTTTG 


TCTTTTCCTT 


TCTTTTTGAC 


TTGCTCTTCT 


TACCGCTCTT 


GTCTATCTTA 


24360 


TTTGTCCTTT 


CCTTTCTCTA 


TCCACTCATT 


CAGCTGAACT 


TTATCTTTGA 


ATGGTTAGAC 


24420 


GGCATTATTC 


GCTTGGTCTC 


CCAGGTGGCA 


AGGAGACCAC 


TTCTCTTTGG 


TCAACCCAAC 


24480 


GCATGGCTTT 


TAATCTTATT 


CTTAATTTCC 


TTCGCTTTGG 


TCTATGATTT 


GAGGAAAAAC 


24540 


ATTAAAGGAT 


TAACAGTATT 


GAGTTTATTG 


ATTACAGCTC 


TCTTTTTCCT 


TACCAAGTAT 


24600 


CCACTGGAAA 


ATGAAATCAC 


CATCCTGGAT 


GTOGGCCAAG 


GAGAAAGTAT 


TTTCTACGGG 


24660 


ATGTAACTCG 


CAAAACCATT 


CTCATAGATG 


TAGGTGGTAA 


GGCAGAATCT 


TATAAGAAAA 


24720 


TCAAAAAATG 


GCAACAAAAG 


ATGACGACCA 


GCAATCCCCA 


GCGAACCTTG 


ATTCCCTATC 


24780 


TCAAAAGTCG 


AGGAGTAGCT 


AAGATTGACC 


ACCTAATTTT 


GACTAACACG 


GACAAGGAGC 


24840 


ATGTTGGAGA 


TTTGTCAGAG 


ATGACCAAGG 


CTTTCCATGT 


AGGGGAGATT 


CTAGTATCAA 


24900 


AAGACAGTCT 


GAAACAGAAG 


GAATTTGTGG 


CAGAACTACA 


GGCGACTCAA 


ACAAAGGTGC 


24960 


GTAGTATGAT 


AGTAGGGGAG 


AACTTGCCCA 


TTTTTGGAAG 


TCAGTTAGAA 


GTTCTATCTC 


25020 


CAAGGAAAAT 


GGGAGATGGA 


GGACACGATG 


ATACCCTACT 


TCTGTATGGG 


AAATTCTTGG 


25080 


ATAAGCAATT 


TCTCTTCACG 


GGAAATTTGG 


AGGAGAAAGG 


AGAGAAGGAC 


TTGCTGAAGC 


25140 
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ACTATCCAGA 


CTTGAAACTA 


AATCTTTTGA 


AAGCTAGCCA 


ACATCGCAAT 


AAAAAATCAT 


25200 


CAAGTCCAGC 


CTTTCTAGAA 


AAACTCAAAC 


CAGAGCTTAC 


TCTTATCTCA 


GTTGGAAAGA 


2S260 


GCAATCGAAT 


GAAACTCCCC 


CATCAGGAAA 


CATTGACACG 


ACTGGAACGT 


ATCAATAGCA 


25320 


AAGTTTATCG 


AACTGACCAG 


CAAGGAGCTA 


TACGTTTTAA 


GGGGTTGGAT 


ACTTGGAAAA 


25380 


TCGAAAGTGT 


TCGATAGGAA 


GGATAAATGT 


TGTAGATTAG 


TGAAATAAAC 


TAAAAATTTG 


25440 


TTGCATAATA 


ATGATAAAAA 


TGGTATAATG 


AAAACGTATT 


CAATATTGAG 


GATATAAAAT 


25500 


CATTAAAAAT 


CAGCAAAAGT 


TGTTTTATTA 


GTTAGTTTAT 


AATCTATTGC 


TCTTCTTCAG 


25S60 


TCCACTCTAT 


CTGCTGTGAC 


AGTCACTAAA 


AGTTACAAGT 


ATGATTGGAA 


TACGGTTTGG 


25620 


CAATATAGTA 


CCAACTATCA 


CGACCATCAG 


TATGCTTGGA 


TTCCGTCATG 


CTCTCGTTAT 


25680 


GACAGCTATT 


CTGAGTATAA 


AGTTGGCGGA 


CGCTGGAACT 


ACGCTCCTTA 


TGAGGTCATA 


25740 


AACTATTACA 


GCGGAGGCTA 


TTAATTCTTA 


AAGAGTGAGA 


AAAAGGAGGG 


CTAGATATGT 


25800 


TGCAGCTTAC 


TCATGTGACC 


TTAAAAACGC 


GACAAGTCAT 


CTTGCAAGAT 


GTGGATTTCA 


25860 


CCTTTAAAAA 


GGGTAGGGTT 


TATGGTCTTC 


TTGCTATCAA 


TGCCTCTGGA 


AAGACGACCC 


25920 


TGTTCCGTGC 


CATTAGCAAT 


TTAATTCCCA 


TAAGTAGTGG 


AAATATCGCA 


GCCCCTCCTT 


25980 


CTTTATTTTA 


TTATGAGAGT 


ATTGAATGGC 


TGGATGGAAA 


CTTAACTGGG 


ATGGACTACC 


26040 


TTCCTCTTAT 


CAAAAACATC 


TGGAAGTCAG 


GTCTGAACTT 


GACGGATGAA 


AT CGCCT ATT 


26100 


GGGAAATGTC 


TGACTATATC 


AGTCTTCCCA 


TTCGCAAGTA 


TTCCTTAGGC 


ATGAAGCAAC 


26160 


CCTTCGTGAT 


TGCCATCTAT 


TTCCTCAGTC 


AGGCCAAATG 


CTCGCTCATG 


GATGAGATTA 


26220 


CAAATGGCTT 


AGATGAGTAT 


TATCGACAGA 


ACTTTTTTGA 


TAGCCTAGCA 


CAAATCGATA 


26280 


GACAAGAACA 


GCTGGTTCTT 


TTAAGTTCCC 


ACTATAAGGA 


AGAGTTGGTT 


GATCTCTGCG 


26340 


ATAGAGTAGT 


AACCATTCAT 


CAGCGGCACA 


TAGAAGAGGT 


TTAGTTTATG 


AAAGATGTTA 


26400 


GTCTATTTTT 


ATTGAAAAAA 


GTTTTCAAAA 


GCCGCTTAAA 


CTGGATTCTC 


TTAGCTTTAT 


26460 


TTGTATCTGT 


ACTCGGTGTT 


ACCTTTTATT 


TAAATAGTCA 


GACTGCAAAC 


TCACACAGCT 


26520 


TGGAGAGCAG 


GTTGGAAAGT 


CGCATTGCAG 


CCAACGAGAG 


GGCTATCAAT 


GAAAATGAAG 


26580 


AGAAACTCTC 


CCAAATGTCT 


GATACCAGCT 


CGGAGGAATA 


CCAGTTTGCT 


AAAAATAATT 


26640 


TAGACGTGCA 


AAAAAATCTT 


TTGACGCGAA 


AGACAGAAAT 


TCTGACTTTA 


TTAAAAGAAG 


26700 


GGCGCTGGAA 


AGAAGCCTAC 


TATTTGCAGT 


GGCAAGATGA 


AGAGAAGAAT 


TATGAATTTG 


26760 


TATCAAATGA 


CCCGACTGCT 


AGCCCTGGCT 


TAAAAATGGG 


GGTTGACCGC 


GAACGGAAGA 


26820 


TTTACCAAGC 


CCTGTATCCC 


TTGAACATAA 


AAGCACATAC 


TTTGGAGTTT 


CCGACCCACG 


26860 


GGATTGATCA 


GATTGTCTGG 


ATTTTAGAGG 


TTATCATCCC 


AAGTTTGTTP 


GTGGTTGCTA 


26940 
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TTATTTTTAT GCTAACACAA CTATTTGCAG AAAGATATCA AAATCATCTG GACACAGCTC 27000 

ACTTATATCC TGTTTCAAAA GTGACATTTG CAATATCCTC TCTTGGAGTT GGAGTGCGAT 27060 

ATCTAACTGT GCTGTTTATC GGAATCTGTG GCTTTTCTTT TCTAGTGCGA AGTCTGATAA 27120 

GTGCTTTTGG ACAGTTAGAT TATCCCTACC CAATTTATAG CTTAGTGAAT CAAGAAGTAA 27180 

CTATTGGGAA AATACAAGAT CTATTATTTC CTGGCTTGCT CTTACCTTTC TTAGCCTTTA 27240 

TCGTCATTGT GGAAGTTGTG TACTTGATTG CTTACTTTTT CAAGCAAAAA ATGCCTGTCC 27300 

TCTTTCTTTC ACTCATTGGG ATTGTTGGCT TATTGTTTGG TATCCAAACC ATTCAGCCTC 27360 

TTCAAAGGAT TGCACATCTG ATTCCCTTTA CTTACTTGCG TTCAGTGGAC ATTTTATCTG 27420 

GAAGATTACC TAAGCACATT GATAATGTCG ATCTAAATTG CAGCATGGGA ATGGTCTTAC 27480 

TTCCTTGCCT GATTATCTTT TTGCTATTCG CAATTCTATT TATTGAAAGA TGGGGAAGTT 27540 

CACAGAAAAA AGAATTTTTT AATAGATTCT AGCTTTCCTA TAGGTAGGCA AAATAAGTAA 27600 

AAACTAACAT AGAGAGGGAA TCAACTTGAT TCTCTCTTTT TGATTCGAAA ACCAAACCAA 27660 

AATACAAACA CAAACTTTTC AAAAAATAAC TTTTTATCTT CACAAGAGCT ACAAAACTTG 27720 

GTATCATATA AAAGTTGAGA AAAGCAGAAG TGAGAGCTTC TCGCCTTGTG ACATTAACTT 27780 

GCCTGGCCCT ACCGATGAAA AGTTTCGAAC AAACCCTATC ATAACGTGCG GGCTTGTATA 27840 

TTT ACAAGTC CGCTATTGTT TTTCTCTAAT AAAACAAAAG AGCTGAAAAC CATAGCAAAG 27900 

CAAGACTTAT TCATCAATGA TGAGATTCGT CTACGTGAAG TTCCCTTGAT TGGTCTTGAA 27960 

GGAGAACAGC TAGGTATCAA CCCACTCACT GAAGCGCAAG CTTTGGCTGA TAACCCTAAT 28020 

CTTGACCTAG TATTGATTCA ACCCCAAGCC AAACCCCCTG TTGCAAAAAT TATGGACTAC 28080 

CGTAACTTCA AATTTC ACT \ CCAGAAGAA3 CA.-AAACAAC AACG ? AAAAA ACAAAwCCTT 29140 

CTTACTGTGA AAGAAGTTCG TCTAAGTCCG G 28171 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CCGCTCAACT TTTGCAATCA AGGCTAAGTA GACAGCAGCA AATTTCATAT TGTATAATTT 
CTGACTCATA CTTCTCTCTT TCTATGTGTA CTAGTATAAA TAAGAAAAAG AAGG CCGTC A 



60 



120 
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AGCCTTCTTT 


TGATTTATTC 


TTCTGCTTCA 


TCTTCTGTAA 


ATTGACTATT 


GTACAAGTCA 


180 


GCGTAGAAGC 


CACCTTGCGC 


CATCAGTTCC 


TCATAGTTCC 


CTTCCTCGAT 


GATATTTCCA 


240 


TCTTTCATGA 


CCAAGATCAA 


GTCTGCATTT 


CGGATGGTTG 


ACAAGCGCTG 


GGCAATGACA 


300 


AAGGATGTCC 


CTCCTTCCAT 


CAAACGGTCC 


ATGGCTTTTT 


GGATCAATTC 


CTCTCTCCGT 


360 


GTCTCAACAG 


AAGAAGTCGC 


CTCATCCAAA 


ATCAAAAGCG 


GTGCATCCTT 


AAGAAGGGCA 


420 


CGAGCAATAG 


TCAATAGTTG 


TTTTTGTCTT 


ACAGACAAGG 


TCACGCTCTC 


ATCCAAGATG 


480 


GTATCATAGC 


CATCTGGCAA 


GCTCATAATA 


AAGTGCTGAA 


TTCCCACAGC 


CTTACTAGCT 


540 


TCCATCATTC 


GTTCATCACT 


AATCCCTATT 


TGATTATAGA 


TGAGATTGTC 


TCGAATAGTT 


600 


CCTTCAAAGA 


CCCAGCTATC 


CTGCAAGACC 


ATTGAAAAGG 


CATCATGCAC 


TTCTGAACGC 


660 


GTCATAGCCT 


TCCTATCCAC 


ACCATCAATG 


CCAATACTTC 


CCTTATCAAT 


CTCATAGAAT 


720 


TTCATCAAAA 


GATTGACAAT 


GGTTGTCTTA 


CCACCCCCAG 


TCGGCCCAAC 


AATGGCAACC 


780 


TTTTCACCAC 


CATGAGCTGT 


CGCAGAGAAG 


TCATAGTCTT 


GAACATTGAC 


ACCCTCCACC 


840 


AGAATTTCTC 


CTCCTGACAC 


CTCGTAGAAA 


CCTCGAATCA 


GATTGACCAG 


AGTTGATTTA 


900 


CCAGAACCTG 


TTGACCCAAT 


AAAGGCCACT 


GTTPGACCAG 


TTTCTGCTTT 


AAAGCTAACA 


960 


TGTTCAATAA 


CTGCCTCCGA 


ATTTGCCGCA 


TAGCGgAAGG 


TCACATCCTT 


AAACTCGACC 


1020 


TGACCTTTGA 


AGTTTTCATC 


AGTCAGCTGC 


ACTTGAACAG 


GGTTTTGGAT 


ACAAGAATGC 


1080 


AAATCTAAAA 


CTTGATTAAT 


CCGCTTAGCA 


GAGACCATAG 


TTCGGGGAAG 


AACCATGAAG 


1140 


ACTGCTCCCA 


TGAGAAGGAA 


GCCCATGACA 


ACCTACATGG 


CATAAGACAT 


GAAAACAATC 


1200 


ATGTCACTAA 


AG AGAGG C AG 


ACGCGCTATC 


GGACCAGCGT 


CGTTAATCAC 


ATAGGCCCCA 


1260 


ATCCAGTAAA 


TCGCCACACT 


CAAACCACTT 


GAAATCCCCA 


TCATCATAGG 


ATTCAAAATA 


1320 


GCCATAAGAC 


GGTTGACAAA 


CAAATTCAAA 


CGCGTCAATT 


CATCATTTAC 


TGCTGCAAAT 


1380 


TTTTCATTTT 


GATAATCCTC 


TGCATTGTAG 


GCACGAACGA 


CACCAATACC 


TGTTAAACTC 


1440 




1 At 1 V» i 1 UUj 






TCAAGGACTG 


TTTTGGAAAG 


1500 


GCTAGCGTCA 


TCAAAACGGT 


CGTCATCAGG 


ACGTTGATAA 


TCACTGCCAC 


AAGTACGGCC 


1560 


CAGAGCCACT 


ATTCTGAATG 


ACCTAAAATC 


TTCCCAATAG 


CCCAGATAGC 


CATAATTGAA 


1620 


CCACGCGTTA 


CCACTTGCAA 


GCCCATAGTA 


ATCAACATTT 


GAACTTGAGT 


AATGTCATTG 


1680 


GTAGTACGCG 


TCAAGAGGCT 


AGGAATTGAA 


AATTTCTTAA 




CGAGTAATCC 


1740 


AAAACTCGGT 


TAAAAATATC 


ACTTCTCAGC 


CTACTAGTAT 


AAGAAGCCGC 


CACTCGGGAT 


1800 


GCAAAAAATC 


CAACTG CAAC 


TACGGACAAG 


AAGGCAAGAA 


AGGACATTCC 


CATCATCATG 


1860 


CTTGCCGACT 


GCCACAACTC 


ATCTAAATTA 


GTTTCTTGAC 


TACCTAGCAA 


ATCCGTAATT 


1920 
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1 1 CGAG AT AT 


Abli I LVAiLAL 


TTCCAACTCT 








1 Qfift. 


GCTAGTAAAA 


TCATCCCCCA 


TTCTTTTCTA 


CTAATTCTTT 


TGGCTAATTT 


CTTTATTCTC 


O ft A ft 


TCCTCCTATT 


CCCTTGATAT 


TTTGCCTGTA 


GTTGACCGAG 




AAAATCAGTA 


2100 


ATT CATCTT C 


ft ft ft iiuwviMifi 

ATCAATCTCT 


TCCATCAACT 


GCTTGTCTAT 


GCLi 1 TCAAAA 


ft ft h ^^^wm ft ft 
AAACCCTTAA 


2160 


CCTGTTGCAT 


CTGAGAACGT 


GCTTTv/TCCG 


^ ^ ft ft ft 

TCAGACGAAC 


AAAL 1 1 AGCC 


CUWTTATCAA 


2220 


CAGCACTCGC 


CTCCAATTCC 


ACCAAACCAT 


TTTGCACTAT 


«l 0*0tmW% * % 0*0* 

ACGCTTAACC 


Ih 0* ft ft 0+ 

AGATTACTAG 


2280 


C AACAGGCTT 


GGTAATATTG 


AGTTCCTGCT 


CGATATCTTT 


AATCAAGACC 


AAGTCTTGGT 


2340 


TTTTCTCGCG 


ATTATCCAAA 


AAACGCACAA 


CCTGACCTTG 


CGGCCCACCC 


ftMtft ft ftMwn**ft ft 

ATAAATTCAA 


2400 


TGCCGCAACG 


TTTGGCTTCC 


TTTTGCACCA 


TCAGGTGAAT 


TTGATGACCA 


AAACGCTTAA 


2460 


AGACTAACAT 


CGGTTTATCC 


ATAATCTCCC 


CCTTCTAAAT 


AAAAATAGTT 


CTCTGGAGAA 


2S20 


TAATTAAATT 


TCTATGAGAA 


CTATTTTCTT 


CATTAAAAAA 


ATCCCAAGTG 


ATTTTCTCAC 


2S80 


TTAGGATCAT 


GTTCTATAGG 


TTAAATTAAA 


ACCCATCTAC 


GTTCGTATAA 


ATCTTTTGGA 


2640 


CGTCTTCGTC 


GTCTTCAAGA 


ACGCTGTAAA 


GTIIITCAAA 


GGTTTCAAGG 


TCTTCCCCTC 


2700 


ACAATTCCAC 


TTCTGACTGA 


GCAATCATTT 


CCAATTCAGT 


CACTTCGAAT 


TCTTCAATAC 


2760 


CAGACTCACG 


CACGGCAACG 


ATAGCCTTGT 


GAACGTCACT 


TCGCGCTGTG 


TAAACTCTGA 


2820 


TTGTACCTTC 


TTGTGCTTCT 


ACGTCATCCA 


CATCCACATC 


CGCTTCGAGC 


AATTGCTCAA 


2880 


AGACTGCCTC 


CGCATCTTCA 


CCTCCAAATA 


CAATAACACC 


TTTCTTGTCA 


AAGAGGTAAG 


2940 


AAACAGAACC 


TGAAGCCCCC 


ATGTTTCCGC 


CGTTTTTACC 


AAAGGCTGCA 


CGGACATTGG 


3000 


CTGCTGTACG 


GTTGACGTTA 


gaagtcaaag 


TATCCACAAT 


TAGCATAGAG 


CCATTTGGCC 


3060 


wAAMC w I I C 


GTAACGTCCT 


TCTGTAAAGG 


TTTCGTCTGT 


G I"l 'l l C w I -1 !' 


GCTTTATCAA 


3120 


TCGCTTTATC 


GATAATGTGT 


TTTGGCACTT 


GGGCTTGTTT 


tk 4h 0*0*0**tl0*0* 

AG CACGGTCG 


ft (t* » n ft ft imm 

ATAACGAATT 


3180 


TCAAAGCTGA 


GTTTGATTCT 


GGATCTGCAT 


CACCTTTTTT 


AG CTGCTACA 


TACATTTCTA 


3240 


CACCAAATTT 


TGCATATACT 


TTACAGTTAG 


CTCCATCTTT 


AGCCGTTTTC 


TTGGCTACGA 


3300 


TATTGGCCCA 


TTTACGTCCC 


ATTAGGAATC 


TCCTTTTTTC 


ACATTTTAAT 


CTTTCTTATT 


3360 


ATAACACAAG 


TTTTTTTGAT 


TTTCACTAGA 


GGAAATGGAT 


TTTATTAGCA 


AATCAAGCTA 


3420 


GGATAGCACT 


TTACCTGCTA 


AGATGCTCTT 


GCCTTTCTAT 


CTTTATCAAC 


AGGCACTCAT 


3480 


CCACATTCAA 


AAAACAAACT 


AGACCATTAT 


CTGCAAATAG 


AAAGTTTCAG 


CCAAGTTTGA 


3540 


CAAAGTCAGC 


TCAAATTACT 


GTTTGAAGTT 


TGTAGATATA 


AGCGACAAAA 


ACAATCATAC 


3600 


TCCACCTTTT 


GTTGACAGTC 


TACTCCAGAC 


ATATCATAGT 


TCAAGTAAAT 


ACTTTCAAAT 


3660 
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TCAACAGTTC TTATAGGCGC TATTGTATTC TAAGAAATCA ATAGAAQAGT TTCTAAGCAA 3720 
ACCTCTAATA CTCAATAAAA ATCAAAGAGC AAACTACAAA GCTAGCCTCA GGTTGCTCAA 37 80 

AACACTGTTT TGACGTTGCC GATGGGGCTG ACATGGTTTG AAGAGATTTT CGAACAGTAT 3840 
AATTTACGTG TTCCCAAGAT GGAGAAGTTA GACTAGTACA CTGGCACTTC TAAAACATTG 3900 
CTAGCAATTG ATTTGTTCAT ATTTAATTTC ATTTTTTCCA TAAATCGGTA TTAGATATAA 3 960 

ACAGCAAAAT ATTTCCGATA CGTGTCGTTC TTGAATTTCC AATCATCTAA AACAAGTAAA 4020 

GGATAATCAA TCCCCTGTAT ATCAAGGAAT TGGCTACCCT TTTTACTTTT TTACACATTC 4080 

TGTTTGATAG ATTC A TTT T A ACATCACGAG CATACTCCAA TGGAAATCGC TAGGCAAGAG 4X40 

ATAAACTTTC AGATATCCGC AGAGAGATCA TCCCCTCTTT TTGTCCCAAG CATTCTCCTC 4200 

TCCTAGTCAT TTTCTACCTT ATCTTCTACC TGAGGATAGA CACTTGTTCC CCAAATAGAA 4260 

ATCGTCCGCT TACGCACTAG TGGCAAATCG GTTTTTTCAT AAACCGTACG CCACCATTCC 4320 

CAGGCAAGCC CGGTACACTC TCTAATTTT G ACAGAGAGAT TACGAACATT CC C TTTT A AA 4380 

GGAATACTAG TGGTAAACTG AGCCGTTAAA TCCTGCCCAT TTCTGTCCCA AGCCTTAGGA 4440 

GTCAAGACTT CCTTACCTTG ATGATCATAG GATAATTCAT TCCAACTAAT ATAATATTGG 4500 

GCAACATAGG CACCACTATG ATCCAGCAGT AAATCTC CGT TTCTGTAAGC TCTAACCTTA 4 560 

GTCT C AACAT AGTCTGTACT ATTTTGAAAG GTCGCAACTA CATTGTCACG TAAAAAAGAA 4 62 0 

GTTGTATAGG AAATCGGCAA GCCTGGATGA TCTGCTGTAA AGCGACTGCC TTCTTGAATC 4680 

AACTCCTCTA CCATATCCAC CTTGCCTGTT ACAACTCCGG CACCCGAACT TGCGTCGCCC 47 40 

CCTAAAATAA CCGCCTTCAC TTCTGTATTC TCCAAAATCT CTTTCCACTC TGTCTGAGGA 4 800 

GCTACCTTGA CTCCTTTTAT CAAAGCTTCA AAACCACCCT CTACTTCATC ACTCTTACTC 4860 

GTGGTTTCCA ACTTGAGATA GACTTGGCGC CCATAAGCAA CACTCGAAAT ATAGACCAAA 4 920 

GGACGCTCTG CAGAAATTCC TCTCTCT T TT AAATCCTCTA CCGTTACAGT ATCTTGAAAC 4980 

ACATCTCCTG GATTTTTAAC AGCATCTACG CTG ACTGT AT AATAAATCTG CTTAAAATTA 5040 

ACAATCTCAA TCTGCTTTTC GCCTGAATGG ACAGAGTTAA AATCAATATC AAGAGAATTC 5100 

CCTGTCTTTT CAAAGTCAGA ACCAAACTTG ACCTTGAGTT GTTCCATGCT CTGAGCCGTG 5160 

ATTTTTTCAT ACTGCATTCT AGCTGGGACA TTATTGACCT GACCATAATC TTCATGCCAC 5220 

TTAGCCAACA AATCCTTTAC CGCTCCGCGA ACACTTGAAT TGCTGGGGTC TTCCACTTGG 52 80 

AGAAAGCTAT CGCTACTTGC CAAACCAGGC AAATCAATAC TATAAGTCAT CGGAGCACGA 5340 

TCGACCGCAA GAAGAGTGGG ATTATTCTCT AACAAGGTCT CATCCACTAC GAGAAGTGCT 5400 

CCAGGATAGA GGCGACTGTC GTTGGTAGCT GTTACAGAAA TATCACTTGT ATTTGTCGAC 5460 
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AAGCTCCGCT 


TCTTTCTTTC 


GATAACAACA 


AACTCATCGG 


GTAGCTCATT 


ACCCTCTTTG 


394V 


ATGAAACGAT 


TTTCAATACT 


TTCTCCCTGA 


TGGGTCAAGA 




ATCGTAATTC 


ce oft 
330U 


ATAGCTAGTA 


TAAAGTCATT 


TACTGCTTTA 


TTTGCCATCT 


TCTACCTCCT 


AATAAvj 




TGGATTGAGT 


TGCATAAACT 


CAGACTTGTT 


CAGCGAAATC 


AGCCGTGCTT 


GGACTAAGTA 


3 /uu 


ATCCAAAATT 


TCCTCCTACA 


ATTCTTCTGA 


GACATTGCCT 


CGCCGTCTGG 


CTAAATAAGA 




AGTCGGAATG 


ACCGTATTAT 


CCAACATAAA 


TACCTTATCT 


AAGTCAATCA 


AGGTTGGTCT 




TGTAAAAGGA 


TTACGAGCTA 


GATCCGGCTC 


TTCTATCATA 


AAGTl'C "1TG A 


CCAAACGTCT 


5880 


GGTCAAGAGA 


GCTGGTTTGA 


AGGTCTGATT 


TTTAACCAAC 


TCTTTGTTTT 


TAGTCATGCT 


5940 


GTTGTCAATA 


CAGATATACA 


TATGATTCTT 


CACAGCCAAA 


TCGCTACTAA 


TAGTCGGAAA 


6000 


AGGCAAATAA 


AGAGCTACAA 


CATCTCCTCT 


CTTAATCAAG 


CAAGAGCACC 


CCCTTTTCTC 


6060 


CTAATGTAAC 


ATACACAGGA 


TTGACCAAGT 


CTTCTGATTG 


ACTCAGAATT 


TCCAAAGTTT 


6120 


GAGTTTGGCG 


CGCTGTCAAT 


TTACTAGCAT 


CTTGTCTCTT 


CAATACAAAA 


TGCTTGTCGC 


6180 


CAATAACCTT 


GACAATATAA 


TCCTTCTCCA 


AACCTGACTG 


GTAAATCCAC 


ATCAGATCTT 


6240 


GTCTGTCCTG 


AGAACTCAAG 


AGAGAAGGAT 


TTTCAAGCCT 


CCCGATACTC 


TGATAAAAAT 


6300 


CAAAAACAGG 


AGCTAACTCC 


TGCCAATCTG 


ATTGGCTAGT 


TCTCAAGGCT 


AGAAAAAGGG 


6360 


CTTTGCGAGC 


TCATACTTCT 


TGGTTAGCCT 


TCAGAGTTAC 


TTTCCCCTCC 


AAG'ITTTTTA 


6420 


GAAATCGGGA 


AACTCCAGAA 


AGCAAATTTT 


TCTCTAACTG 


CGAGAAATAA 


AAACCTTTCG 


6480 


TTCCCAGACA 


TAAGTCTTTC 


ATGTCGCTTT 


CTCTAGCAAA 


TAAGAGCTCA 


AA C ATTTGAT 


6S40 


AGTAAAAGAA 


AAATATCTGG 


CACTGGGTCC 


CGCTCATCTT 


TTCCTTATCG 


GCTTCTTTTT 


6600 


TTAACCAGAG 


CAAGGGCGAC 


AGGTAGCTGG 


ATTGAGACAT 


TTCCTCTACC 


TCCTACTCTT 


6660 


TTTTAACTGG 


AGCATCTGCA 


CTACCTGCCA 


CTTCTTTTGA 


CTGGATACTT 


TCCCACTGGT 


6720 


TAATCTCCTC 


TGAGATAAGA 


CCTTCGCATG 


TCTTGACAAA 


TAGGGCAAAA 


GCCTTGGTCT 


6780 


TTCCTGCATA 


TTTCTCCGTT 


TGGCATTGAT 


AGAGGAATTT 


TTCTTTCTCC 


AGGAGTTGCG 


6840 


CAGTTTTTTG 


GTAAGAAATC 


CAATTTTCCT 


TTGCATTATA 


CAAATTGATA 


ATCCCCTCAC 


6900 


ACAGCAAGCC 


CAGACTGGAT 


AAGGCAACCG 


AAATCAAACG 


GTAGCGATCA 


CCTGGCATAG 


6960 


GAATAGCACA 


AAAGACAGCT 


ATGAGGAAAC 


CTGCCACGAT 


TTCTGTTATT 


TTTAATACCT 


7020 


TATAGCGCCT 


ACGATGTTGA 


ACGCTTTTCT 


TTAAAAAATC 


AGCTATCTGT 


ACGTCTAATC 


7080 


GCTCTGTCAG 


GTACATTTCT 


TCTGGCGTCA 


TATTCGTAAC 


TCCTTTCATT 


TACTTTGATA 


7140 


ATCAGGG 












7147 
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(2) INFORMATION FOR SEQ ID NO: 24: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



« 



CCGCATGGGA 


TTGGTGTCCT 


TTTGGGCAAT 


CTCTTTGACC 


AAACTGGAAA 


CATGTTTTAT 


60 


GCGCCTGCCT 


TTACTGCCCT 


TGTCGGCGGT 


ACGTCTATAT 


GATCCTAGTC 


GCAAAAGTTC 


120 


CGCGCTTTGG 


AGCCATTACC 


ACTATCGGCC 


TTCTCATTGC 


CCTCTTTTTC 


TTGGGAACTA 


180 


AACACGGTGC 




CTTCCTGGAA 


TTATCTGTCG 


CCTCCTACCA 


CATGGAGTAG 


240 


CTCATTTAGG 


AAAATACAAG 


GACAAAACAA 


AGAACTTCCT 


TTCTTTCATT 


ATTTTCGCCT 


300 


TTACTACAAC 


AGGACCAATC 


TTGCTTATGT 


GGATTGCGCC 


CAAAGCCTAT 


ATGGCTACTC 


360 


TTCTGGCAAG 


AGGAAAATCC 


CAAGAATATA 


TCGACCGTAT 


CATGGTCGCT 


CCAAACCCTG 


420 


GAACTGTCCT 


TCTATTTATC 


GCAAGTATTG 


TCATCGGAGC 


CCTAGTGGGT 


GCCTTGATTG 


480 


GACAAGCCTT 


GAGTAAAAAA 


TTTGCCCAGA 


AAATCTGATC 


AGTTAAAAAG 


AGCCACGCGG 


540 


CTCTTTTTTA 


TTTATGGCTC 


AATTTCTTAC 


TCAAGAAATC 


TCCCAAGAAT 


TCGATTGCAA 


600 


AGATAATCAA 


AATGATAATA 


ATGGTTGCCA 


AGATGCTCAC 


ATCGTGATTG 


TAGCGGTTAA 


660 


ATCCATAAGC 


GATGGCTACG 


TTACCGATAC 


CACCAGCTCC 


AACCGCACCG 


GCCATAGCTG 


720 


TTtcCCAACA 


AGGGa AC CAA 


GGTcACAGTC 


GTCAC 






755 



(2) INFORMATION FOR SEQ ID NO: 25: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TTCAATTGGT ATCTCAATCA ACGGTCTTCA CATGCTTTCA ACTGGTTTGA CTCTTGAAAA 60 

AGCGAAAGCT GCTGGTTACA ACGCAACTGA AACAGGCTTT AACGATCTTC AAAAACCAGA 120 

ATTCATGAAA CATGACAACC ATGAAGTAGC AATTAAGATT GTCTTTGACA AAGATAGCCG 180 

TGAAATTCTT GGTGCCCAAA TGGTTTCACA TGATATTGCA ATTAGCATGG GAATCCACAT 240 

CTTCTCACTT GCTATCCAAG AGCATGTGAC AATTGATAAA TTGCCATTGA CAGACCTCTT 300 
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CTTCTTGCCA 


CACTTCAACA 


AACCATACAA 


CTACATCACA 


A luVK. 


««T»ft. f*mr**vf*\ 


360 


AAATTAAAAA 


TGAATGAGCT 


ATCTGGCCTT 


AAGTTAAGGT 


(.nviA 1 Awl 1 1 


TTAGCTAATT 


4">0 


TGTCCCCATA 


CAATTATAGT 


TTTTTTATCT 


TGTGCTTCAT 


TCTGTTCTGA 


Li l AAAATUn 




AAAGGTAGCT 


ACCAATACAA 


ATGATGAGGA 


TAAAACAAAT 


A ft ft ft ft ft fM 

GACTGAAAAT 


»«tM»ft MfcJ^ ft. ft A 

CGaTATGAAC 


e a a 


TAAATAAAAA 


CTTGGCACAG 


ATGCTCAAGG 


CTGGTCTTAT 


TATGGATGTG 


^ ft ^ ft ft m m^mm 


£ n a 
OUU 


AACAGGCTCG 


TATCGCAGAA 


GCTGCTGGTG 


CGGCAGCTGT 


GATGGCCTTG 


GAACGAATTC 


660 


CCCCTGATAT 


TCGTGCAGCT 


GGAGGACTTT CCCGCATCAG 


CGACCCAAAG 


ATGATTAAGG 


720 


AAATCCAAGA 


AGCGGTTAGT 


ATTCCAGTAA TGGCTAAGGT 


CAGAATCGGG 


CA'lTTltaTTU 


780 


AAGCTCAGAT 


TTTAGAGGCT 


ATTGAAATTG 


ATTATATCGA 


CGAGAGTGAA 


GTTCTATCTC 


840 


CACCTGATGA 


CCCTTTCCAT 


GTGGACAAGA 


AAGAATTCC.A 


AGTTCCTTTT 


GTCTGTGGTG 


900 


CTAAGGATTT 


CGGTGAAGCC 


TTGCGTCGTA 


TCGCTGAAtiG 


TGCTTCCATG 


ATTCCTACCA 


960 


AAGGAGAACC 


AGGGACAGGG 


GATATCGTCC 


AAUL. l\* 


TCATATGCGT 


ATGATGAATC 


1020 


AGGAAATTCG 


CCGCATTCAA 


AACTTACGTG 




TTATGTTGCT 


GCCAAGGATT 


1080 


TCCAACTCCC 


TGTAGAATTG 


GTCCAATATG 


TTCATGAACA 


TGGAAAATTG 


CCACTTGTAA 


1140 


ATTTCCCTGC 


TGGAGGTCTT 


GCAACGCCAG 


CAGATGCTGC 


GTTAATCATG 


CAATTAGGGG 


1200 


CAGAGGCGGT 


CTTTGTCGGT 


TCAGGTATTT 


TCAAGTCAGG 


AGATCCTGTT 


AAACGAGCGA 


1260 


GTGCCATTGT 


TAAGGCTCTC 


ACTAACTTCC 


CTAATCCTCA 


AATCCTAGCT 


CAAATCTCTG 


1320 


AAGATTTACG 


AGAAGCCATG 


GTTGGTATTA 


ATGAAAATGA 


AATCCAAATT 


CTCATGGCTG 


1380 


AACGAGGAAA 


ATAGATGAAA 


ATCGGAATAT 


TGGCCTTGCA 


AGGCCCCTTT 


GCAGAACATG 


1440 


CAAAAGTGCT 


AGATCAATTA 


GGTGTCGAGA 


CTGTAGAACT 


CAGAAATCTA 


GATGATTTTC 


1SO0 


AGCAAGATCA 


GAGTGACTTG 


TCGGGTTTGA 


TTTTGCCTGG 


TGGTGAGTCT 


ft /"» ft ft ft TV***"* 


X9DU 


GCAAGCTCTT 


ACGTGACCAG 


AACATGCTAC 


TTCCCATCCG 


AGAAGCCATT 


CTATCTGGCT 


1620 


TACCAGTGTT 


TCCCACCTGT 


GCGGGCTTAA 


TTTTCCTGGC 


TAAGGAAATC 


ACTTCTCAGA 


1680 


AAGAGAGTCA 


TCTAGGAACT 


ATGGATATGG 


TGGTCGAGCG 


TAATGCTTAT 


GGGCGCCAAT 


1740 


TAGGAAGTTT 


CTACACGGAA 


GCAGAATGTA 


AGGGAGTTGG 


CAAGATTCCA 


ATGACCTTTA 


1800 


TCCGTGGTCC 


GATTATCAGT 


AGTGTTGGTG 


AGGGTGTAGA 


AATTTTAGCA 


ACAGTGAACA 


1860 


ATCAAATTGT 


TGCAGCCCAA 


GAAAAAAATA 


TGTTGGTAAG 


TTCTTTTCAT 


CCAGAATTGA 


1920 


CTCATGATGT 


GCGCTTGCAC 


CAGTACTTTA 


TCAATATGTG 


TAAAGAAAAA 


AGTTGAGATT 


1980 


GAATTTCTCA 


ACTTTTTTAC 


ATGTAATAAA 


CAATAGCGAT 


GTATTGAAGT 


GCGGACGCAG 


2040 
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CTACGATAAA GAGATGCCAA ATCATGTCGA AATAAGCTTT TTTCTTGGCA TAAAATCCAG 2100 

CTCCAACTGT ATAACAGACT CCGCCAGTTA CCATGAGACT CCAGAAAACC GGTGTCGTTT 2160 

GACTGATAAT GGCAGGAATG ATAGCCAGAA CCAACCAGCC CATAATCAGG TAAAGAGCAA 2220 

CGCTAAATTT CTCATTGACC TTTTTACCAA AGATTTTATA GAGAATACCA AAGATGCTCG 2280 

TTCCCCATTG GATGACAATA ATCAGATAGC CAAACCAGTT ATTCATCAAG GTCAAGACAA 2340 

CCGGCGTGTA TGAGCCGGCA ATGGCAACGT AAATCATAGA ATGCTCAATG ATTCGCAAAA 2400 

CATATTTGTG CGTCGAACCA TAGGCCATAG AGTGATAAAT GGTGGATGAT AGGAACATGA 2460 

GAAAGAGACT GATGACGAAA ATGGAAACGC CGATAGAGGA TAAAAATCCG TGTGCTTCAT 2520 

AACTATAGAT CGATGAAATA GGCAGCAAGA TAAGCATGAT GACTCCACCC ACAGCATGGG 2580 

TCACGCTATT AGCAATCTCC TCTCCAAAAC TGAGTTGTTT GCTGAGTTTA AGACTAGTGT 2640 

TCATTGGATT ACCTCCTCTT GAGTATGATC GATTAAGTCT AGACTTTGAT GATAGAGTTT 2700 

AACGGTTTCG CACCTGGTTT GGATAATAGG GTTAGCTGCG TCAATTCCTT GGTTCATGTA 2760 

GTCCACAAAA GCATCGTAGA GTTCCTCTGA ACTTGCTTGA CTTTGTAGAG TATTAAGTGT 2620 

CTGGGCTATT TCTTGAATAG AAAATACAGA CTTGAGGGTT CTGATAGCAA TCAAACGCGC 2880 

AATCTGTTGC CGTTGGTATT TTTTTTTGTC AGGCTTTCTC AGGTAACCAT TTTTCACATA 2940 

ATTGTTGACC ATACATGCTG TTAGGCCCTT GTCTTTATTA GGAGAGATAG GGGCGCAGAC 3000 
CTGATTGACA 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15213 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



3010 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CATAAATCGC TGCAAATAAC TTAATAGTGA AGTAGCCATT TCTTTCGTAT TTACCTGAGG 60 

CATATTCCCT AGACGAAAGA ATATTATTAT CAATCAAATC ATTGAATGAA CGTAGTCTTT 120 

CAACTTCTTC TACTGTTAGA TTTCTGACAA CATTTGTTGC ATAGACCTTA TTTCCATCAG 180 

GATCAGGATG GTACTCATTT GTAACTTTTC TAAGAAGTTG TTGTTTTTGA TTCGTATCCA 240 

ATTTAAGAAT TGAATTTCCT TCGAGATATT CCAACATATA AACAACGTCA AACATGTTCT 300 

GGACATATTG CTTCAAATCA TCTGCATTAT TAAATCTTGT AGTTGGATCA AGTACTTGTA 360 

ATCCTCGACT TTCTGTACTA TCAGATTTTG AATGTTTCAA GATGGACTTG ATGCTAATGG 420 
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TGCATGGTCT 


GGTGCTTGTA 


ATAATCCTTT 


AGCAAAGAAC TCTGCTCCCA 


480 




TCGACCATAT 


CCTCCAAGAT 


AAATGTCCTG 


ATCTGAGTCA TGTGTCATCT 


540 




AGTAATAGCT 


CCATCCTTAT 


CCAACATTCG 


ATAACCCATA TAATAAACTG 


600 


CATCALL 1 vj 1 




CCGTGTTGAT 


TATGCCCAAC 


TTTATTTCCA ACAGGTCCAA 


660 


AGAAATUTTW 


PATTCCJVGGA 


TTTGGATTAT 


CAAAATCTGC 


CACTTCTGTA GCTTTCCCTA 


720 


CGGTATTATC 


ATCGtUww* 1 


TTATAACCAT CGTAAAGCAA 


AATATTTCTA TAAAGTTTTT 


780 


CACGTGCATT 


GT CGTCT AAA 


ATACOA1 


AATAATCGTA 


GTGATCTCGC TGACCTTTGC 


840 


CTGTTTCACG 


LuuA 1 1 1 1 V. 1 


TWAALnAAAI 


CATTGAGAGC 


CTTGCCCGCT TTATGGTCAC 


900 


TACTGCGGTA 


GCwATtAl AA 


GCTCCAAA1L 


PT A.H ACT AG A 


CATGCTCGAG ATGACAAATA 


960 


CGGATC T CTC 


TGGCAAuulX 


AGGACAww* 


AGACCATATT 


GCGGTATTTC CATGTGGCAC 


1020 


TCGTGATACG 


ATCATAAACA 


CLuAJAuAAl 


ACTTGGTGCC 


AGCTAACCCT TGCTTCGTPT 


1080 


TCACCTCTTC 


GATAGTGGAT 


TTTTCTTCGA 


CAATGTAAGC 


CTTAGTCTCT GATTTAAACC 


1140 


ACTCATTATT 


GCTTGTATTT 


GOlAAAAAUA 


^ A * * *■ * 


ATGTTCCAGC GTGCTAAACA 


1200 


AATCTGTCGT 


ft *IW **l 1 ft. 

TCCATGTTt»A 




TGATACCATA 


ACTATCGACA TTATTCTTAC 


1260 


CTAGAAGATT 


GTT AAAGC C A 




ACTCAATCAG 


AGTATCTAAT GGTGAAGCAT 


1320 


TCCCCTTACC 


AAAGAAuTLu 


AAATGGTACA 


GAACTAvai * l. 


TTTGACATTC ACCTGACCAT 


1380 


AGCTAAAGTT 


ATACUAtUU 1 


TCCAGATAGG 


TCAAGCCAAG 


TAGCAAGGCT TCCTTGTTGC 


1440 


GTTTGATTTT 


ft t»v' "1" 

ATCTALAAtiA 


TAACCTTCAG 


TGACGGGGTT 


AGCACTAGCC AGTCCAGCAT 


1500 


CCGCTGACAA 


GAGTTTTTTC 


AAACTGTCTT 


CCAGTTGTTG 


'ITT'lUTTTTG GCGAACTGGT 


1560 


CTTCTAGATA 




TGCTTGACCT 


TTGGACAAAT 


ACCCAGCGTC TTTCTGATGG 


1620 


t_ 1 It l\inA Iu 




TTTTGTAAGT 


CAGGTAAGAC 


TTGCTTGATG ATAGACGTTT 


1680 


GGTCATACAG 


GAATTCGTTT 


GGCGTATAGA 


GAAGTCCACT 


ATTGCCCACA CTATATTCTG 


1740 


CTAATTTGGC 


GAAATCATTC 


TGGTATTTGA 


GATCCAGCTT 


CTCAGATAAA TCATCCTTCT 


1600 


AGTGAAGCAA 


GACTTTGTTT 


GCAGTCTGTT 


TGTTAGAAAC 


AATGTCTGTG ATCACTTGGT 


1860 


TGTCCTTCAT 


CATGACTCCT 


GACAAGAGTT 


CTTTTTGATA 


TAAAAGACTG TTCTCATTGA 


1920 


CCAGGTTTCC 


GTATTTGACG 


ATGGTTGCCT 


TGTTGTAGAA 


AGGTAGCAAT TTTTCAATGT 


1980 


TTTTATAAGT 


CAAGTTGCGC 


TTAGCTTGAT 


AATAGGCCAC 


CTTAGAAAAA TCACTGTCTT 


2040 


TTTTGCCACT 


TGTTGAAAGT 


GGCTCCACTG 


TTGGTAAAAT 


GAGAGGATTG ATTTCTGCTT 


2100 


TTTTGCTTGC 


AATTTGAGAA 


GCATCTAGCA 


TTGTTCCTCT 


TTCTTCAAAG GATTCCTTGC 


2160 
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TGACGACCTC 


ATCCTTGACC 


AAGGTGACAT TGTAGACTCT 


GTTGGCCTTG 


CTGCTGAATG 


2220 


TGTCCTTTAC 


CTTCATTTCG 


TTATAGTGGT 


AACCAGTGAT 


GGCATTTCCG 


TTGGTTACAT 


2280 


TAACATCGCT 


GAGAACATTG 


GTCAAACTTC 


CAGCATGCCT 


AACATCACCA 


GAAGTTCGAT 


2340 


CCCACAAATT 


GCCTGCCACT 


CCAGCGACTC 


TACCAAAGTG 


CTTGACATTG 


TTG AT ATCAC 


2400 


CTTCAGCATA 


GCTATCTTGG 


ATCTGTGCAT 


CTCGGTCTAC 


TAGGCCTGCA 


AGTCCACCCA 


2460 


CAGTCTCATC 


TGAAGTATTT 


GTGTTAGATG 


AAATGGCTAC 


TGTCGCTTTT 


GACTTAGTAA 


2520 


GTAAAGCCTT 


GTCACCTCTC 


AAATGACCGA 


CCATACCACC 


GATATTGTAG 


GCAGCAGTCG 


2S80 


TTTCATAAGT 


GTTGATAATT 


CTTCCCTTGA 


AACTGCTCTC 


TGTGATGCTT 


GATTGCTCAG 


2640 


CCTTAGCCAG 


CAAACCACCG 


ATACCACCTT 


CACCAGCCAG 


AACACCATCG 


ACGTGAACTT 


2700 


GCTTAATTTT 


TGTGTTATTC 


TGAGCTTCAT 


TTGCCAGTGA 


ACCGATATCA 


TCTT TCCCTG 


2760 


AAATAGCAAC 


ATTTTTTAGA 


CTCAGTTTTT 


CTACTGTAGC 


ACCACTCAAG 


TTTTCAAACA 


2820 


GAGGTTTTTT 


CAAATTATAG 


ATAGCATAAT 


TCTTGCCATC 


TTTTTCACCG 


ATTAAACGAC 


2880 


CAGTAAAGGT 


GTCCTTGATA 


TAGGATCTTT 


CATCAGGACC 


AAGCTCCACT 


TCCTTAGCAT 


2940 


TCAGGCTGGC 


CGCTAAATGA 


TAGCTTCCAG AGGGATTTTG 


GTTTATAGCT 


TTGACCAGAT 


3000 


T ACT AAAGG A 


AGTAAAGTTT 


GTTGTTTCTT 


CTGTTCCCTT 


CTTAGCT AG A 


TAGAAGGTAA 


3060 


AATTATCTTT 


ATATCTGCTT 


TCTATCTCCT 


GCTGAAGCTT 


CTCTACTTTT 


GCTGTGATTT 


3120 


TATAAAGGAT 


TTTATCATTT 


TTTCTTTCCT 


CTGATATTGA 


TGCTACTGGT 


AGGTATACAT 


3180 


CTTTGAATGA 


AGAAGATTTC 


ACTTTAACAA 


AGTAGCTATT 


TGGATTGCTT 


GGAACTTGCT 


3240 


CTAACGAAAT 


GTGTTGTTTA 


TAAGTACCAT 


TTCACAAACT 


GTATAACTCT 


AGGTCGGAAA 


3300 


CATTTCTTAA 


TTCAAGTCTT 


TTCTCTGGTT 


CTTCTACCTT 


TTTATCAGGG 


TCTAGTTCAT 


3360 


TTTCTTGTTT 


AATTTCTTCG 


TTTCCATTTG 


AATTGGATGT 


GTTTGATTCG 


GTTGAAACAT 


3420 


CCTCAGTTGA 


ATTTCCGTTT 


GATGGTTCTG 


GTTCTGTTTG 


TCCATTCTCT 


GATGTTGTAT 


3460 


TACCTGAATT 


TTCTGGTTTT 


CTTGCAGTTC 


CGTTTTTTTC 


TGGTTGATTT 


GATTCTTCAA 


3540 


CTGGTGGTTT 


TGAATCACTA 


GGTTTATTGG 


ATACTTCTCC 


AGTATTTTCG 


TTAGCTATTT 


3 600 


TCCCAGAGTT 


TGTTTGTGTT 


TCTTCTGCAG 


GTTGAACTGG 


TTTTTCTGTT 


TCTTGATTTG 


3 660 


AGGTACCTTC 


TACTGTGCCT 


TCATTTGGAT 


TTACTGGAAC 


TTCTTCTACA 


GTTTTTTCTG 


3720 


AATTTTCATT 


TTTAGAGTCA 


TTATGTTCTG 


GTTTATTTGA 


TTCTCCAACT 


GAGGTTGTCC 


3780 


AATCACTAGG 


ATTACTGGAC 


ACTTCCCCAG 


TATTTTTGCT 


AGATGTATCT 


CGTGATACTT 


3840 


TCTCTGAATT 


CGTTGTTGAT 


TCTTCTGCAG 


GTTGAACTGG 


ATTTTCTGCT 


TCTTGAATTG 


3900 


AGGTTCCTTC 


TGTAGTACCT 


TCATTTGGAT 


TTACTGGTGT 


TTCTTCTGTT 


1 GGTTTTACTG 


3960 
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GAACTTCTTC 


AG J W u I L Tl L TtT 






CTCAACCGCA 


GTTTCAGGTT 


4020 


TTACTTGCTC 


AATATTACCC 


TTATATTCTG 


f^AJlflPfSCTGC 


T AC CTGCTGT 


GGTTCACCTT 


4080 


TATCACTTAC 


CACAGTATCT 


GGCGAGTCTu 


\j 1 lunAUL I L 


ACTCTf^ACCT 


TTGTCGCTCA 


4140 


CAACTGCTTC 


GGGTAATGTA 


GGTTGAACTT 






ACTACAGCTT 


4200 


CGGGCAACTC 


AGGCTGAATT 


GCGGGTTCAA 


CAATAGCTUv. 


nunv. lul nV.« 


1 x—V.. 1 l/>4 V>l A 


4260 


CTACACCAGT 


CTCACGTTGT 


TCCTTTATAA 


CTTGAGTTTT 


TTTAGTACCT 


'ITTTrflArTA 


4320 


TTCTTCGACT 


AGGCGCAGTC 


GTTGAAGTTG 


AAACAAJ, 1 iv. 




IV, 1 1 A A V* A 


4380 


TTACAGAGAA 


TATTCTGACC 


ATTTCAACTT 


TCTTACCTAA 


TTTAOLTTCT 


lU 14 11 HSm. A W 


4440 


TTACAGTTCC 


TTCAGCTAAA 


TCAGGATTTT 


CTTGAATTTC 


■MB J ilHWU'i ft ft ft ft 

TTCTTGAAAA 


TCTATTTTTG 




TCTCCATAGT 


TTCCTCACGA 


TATAAGAGTT 


CAGGTTTGTT 


CAATTGACT-T 


/-» ft ^ ft ft ft ft 


A* Aft 
V9QU 


CATCCTGTGG 


ATTTAATGTA 


TTTACCCCAG 


IX-TiTlX-TW 


TGGAGAAATC 


TTCTCCTCTT 




TCTTCGTTTC 


TAGATTCTTA 


TGTTCGGCTA 


ATTGTTCTTG 


AGAATCTGAA 


GAnTj'rrii. i 


a can 


CTTCTTTTCT 


TGGATTCATT 


AATTCAGTAG 


AGAAAGGTTT 


TTCAACTACT 


TGAACTTCXG 


* tHV 


TCGCCTTAGT 


TGAAGAAACA 


GGTGTTTGTT 


CCTGAATAGC 


TTGTACTGTT 


Vj A I\j(jA I 1 


4800 


CTACAAAATT 


CGGTGT AACA 


TTATAATCCA 


CCTTTTGTTG 


TTTTGTAGGA 


GTUoCAAHVj 


fi £ A 


AACTCTTTTG 


ATTACTTACT 


TCAGACTCAG 


AAGTCGTTTT 


TCCCTCTTTG 




a o?n 


TATAAGTGTA 


ACCTGAAATC 


TCTTTAGGAA 


GAGGTAATTT 


TTCTCCAGAQ* 




4980 


AGTCCGTATT 


GTAATTTACC 


AAAAGATGAT 


TTTCTAAAGC 


ft *tW"^ ft ^*TV~ & S 


ft /^^» ft ft |*~ ft f"*» 


S040 


CATTTCCTAT 


CCCTGCAACC 


AATACTAAAT 


GTAATACCGT 


TTTATTCTTA 


ACCTTTTTCT 


S100 


TGGAAAC AG C 


AAAAATTAAA 


ATTCCCATAG 








5X60 


GCC TCTCATT 


CTTGCTTCCA 


GTATTTGGCA 




TTGATTTTGA 


GAATTTAACT 


5220 


TATAAACAAG 


ATAATAAGTT 


TCATCATCAT 


TCTCCACGTA 


TGTCGGAATA 


TCATAGACAA 


5280 


GCTGCTTCTT 


TTCTTCTCAT 


GATAGCTCTG 


AATCTGCCAC 


ATATTTATAG 


TGAACTCCCG 


5340 


CAGTTTCTTG 


AGCATCCACA 


GATGAACTAG 


CTAATACAGA 


CATAAAAAAT 


AAACTTGAAA 


5400 


TCGTTGCAGA 


TACAAGTCCT 


ACTGATAATT 


TTCTAAATGA 


AAAACGCTCT 


TGTTTTTCAC 


5460 


CAAAATACTT 


TTCCATTATT 


CCTCCTTGAA 


ATAAAATTTA 


TATATGTTAC 


AAAGACCTTT 


5520 


ATT AT ATT AG 


TGTATTATCT 


ATTATCTATA 


GAAAAGGCAG 


TATACCTTAA 


TTATACTCTT 


5580 


AATTTACAAA 


AAAGTCTTAA 


AATTGAGATG 


CGCTTTCATA 


CTTTGTTTTA 


TATTATTTGG 


5640 


AGGTACAATA 


ACACCTACCA 


TGAAATTTAC 


ACGGTAGGTG 


TTACTCATAT 


CACTAATCGT 


5700 
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TCTAAAAATG 


GTTTGAGGCA 


GTTGAGGAGA 


ATTCCTTCTA 


TCCAGCTTCC 


TTGTCCTGAT 


5760 


GAGCGATGGT 


CTTCCTGCAG 


GCTTTTTTTT 


AGAAAATCTC 


CCACTTGTTC 


TGGTGCGATT 


S820 


TCAAATTCAA 


AGGCTTTCAT 


TTTATAGAAA 


AACTCGATGA 


GATGATCTCA 


CAGGTATTCA 


5880 


GTTGAAAAGG 


GTACTTCACC 


ACTTTTTCTA 


TATPCTAATA 


AGAGTCTAGA 


AAATCGAGCT 


5940 


TTTTCTTCAC 


GAAGCTCACG 


AAAATAGGAA 


TTGAGGATCC 


AXGTCTGCTT 


CTCTTTTCTT 


6000 


TCAATTCGAT 


CCTGACTGGC 


AATTCGTTGG 


TCTTTTTCCA 


GCTCTTTTTG 


CTATTCTTTG 


6060 


GCCTTCATAG 


CTCGTTCTGC 


TCTATTTTTA 


CCAAAAAGAA 


TTTTTTCCCA 


CTTGCGTTCT 


6120 


TCTTGAGTCA 


GGGTCTCTGT 


AAAGCCAAAG 


TAATCTTGAT 


AAGCACGCTC 


TGCGGGTCCC 


61S0 


ATGGCTAGAA 


CCAGATTGTC 


TGCATATTGC 


TTGGCGATTT 


TATCCCTCTT 


CTTGCGTTCT 


6240 


TTCTCTGCCT 


GGATACGGAG 


TTCTTGTTCG 


TAGTCAATTT 


TCTCCTTGCC 


TAGCTTGACA 


6300 


AGGTAGAGTT 


GGTCATCCGA 


TTTCCCAAGT 


AAAAAGGGTT 


TGATACACTT 


TTCAAGGACT 


6360 


TCTTCCATCC 


GAGCCTTTTT 


CTTTCGTTCC 


GCCTTGGTCC 


AACTTCCTCC 


CTGAAAGACT 


6420 


TCTAGGAAAA 


GCTGGTAGTC 


TCTCTCAGGC 


GCAAATTGAT 


TGCCACGATT 


GGGTTTGAAA 


6480 


ACACCTTTTT 


CCCAGAGCCA 


TTTTAGAAGT 


CGCTCGTCAA 


AGTTACTTTT 


ATTGACCTTG 


6S40 


ATTTTTTCCT 


TTTTCTGAGC 


TTTTCTGGTT 


AGATTTTCAA 


CCTTTCTGAG 


CAGTTTTTCT 


6600 


TCCTCTTCCA 


ATTGCTGGTC 


AAGGGACAAT 


CGATGAAAAT 


GACCAACACA 


GTCGCT ACC A 


6660 


ATTGGAAAGA 


GGCGTTGGCC 


TGTGACACCG 


TTAAAGAGTT 


CATAAGCGTA 


TTTGATGGCA 


6720 


TTTCCACAGA 


CACAATTGCT 


ACGGCCGATA 


CCGTTAAAAA 


TAAAGGAAAC 


TTCATTCCAT 


6780 


TCCTTGGTAG 


CTTGTTCCCA 


AGTATCCCCT 


TTCGAAGCCT 


GTAAAACTGC 


ATCGTGCAGG 


6840 


GATTTTCTAA 


CTGGAAGTGT 


CATGAGGTCT 


CCTTTCTAAT 


ACTCAATAAA 


AATCAAAGAG 


6900 


CAAACTAGAA 


AGCTAGCCGC 


AATCAGCTCA 


AAACACTGTT 


TTGAGGTTGT 


AGATAGAACT 


6960 


GACGAAGTCA 


GCtCAAAACA 


CTGTTTTGAG 


GTTGTGGATA 


GAACTGACGA 


AGTCAgTAAC 


7020 


CATATATACA 


GCAAGGCGAA 


GCTGACGTGG 


TTTGAAGAGA 


TTTTCAAAGA 


GTATAAGTTA 


7080 


TACTTTTACA 


ACTTGAACCT 


CGTCTTTACC 


GAGTAAAATC 


AAGTATTTTT 


CAATATTTTC 


7140 


AATCGAATAG 


GCTCGTGATA 


AAGCCTCTTC 


CTATAGAGCT 


AACTGACCAC 


GATAGCGGTC 


7200 


TACGAGTTGA 


CTTGGTTCAT 


CATAGCGGTC 


TGTCTTGTAG 


TCGAACAGAA 


CAATTTTGTT 


7260 


TTCGTAAAGC 


AGATAGCCAT 


CAAGGATACC 


ACGGACAACA 


AAGTCTTCCT 


GACTCTTTTG 


7320 


GTCTCGTTTG 


AGCATGGAGA 


AAGGTTGCTC 


GCGATAAAGA 


TGGTCGGTAT 


TAGCAAGAAT 


7380 


TTCCTGACCG 


AGTACTGTGT 


CAAAGAAAGC 


AAGAATTTTA 


TCAAGATTGA 




7440 


GACAGCTTGG 


CTAGTTTGAA 


CTTGTTTGAG 


TGTTTCTGTT 


AGGCTAGCAA 


GGGTTAGTTG 


7500 
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CTCGCTGAGC 


TCAATTCTCT 


GCATGAGTTC 


■■■■■■ ■ m % A 

CTGAGTAGCA 






7S60 


TACCTTTTCT 


TTGCTTGAAA 


AATCTGGCAA 


ATCCAAGCTvi 


ATTTTCTTGC 


CTACTGACTG 


7620 


ACCTTGACCA 


GCAATCTCGA 


CACCTTCCAT 


ATCCATAACT 




» 4 pM«pf TTGAT 

t\ g * AW** * 


7680 


TTGACTTGGG 


GTTTGAACAC 


TAGGAAGTTC 


AATACCTGCG 


CGGTG AAG Mj 




7740 


TTCCACCTCC 


TTCAGCATTT 


CCAGAGCTTC 


TTTGATGGTA 


TCTGACTGAC 




7800 


TTGGGAGCTA 


TCTTGGAGAG 


GACTCTTGGT 


TTCCAACTCT 


CCGATAGCTT 


LIVluO 4 


7860 


CTGATCTTCG 


CCAATAAAAC 


GATAACTAAA 


CTTGAGCTTG 


TCCTTAGTAA 


ACACTTTACT 


f 7 *w 


GATAGCCCAA 


AGCCAATCTT 


GGAAATTCCG 


TGCTTGCAGT 


CTAGTATTGC 


TATTTAGTTT 


7980 


CCCATTTTTG 


CCTGCTGGGT. 


ATTCCTTGGA 


TTCCAGCTTT 


TCACGAGAAC 


CCTTGCCGAt 




AAGATACACC 


TTTTTCTCAG 


CCCGCGTCAT 


AGCAACATAC 


AGCAAACGCA 


TCTGCTCACA 


a i n n 
a 1UU 


ATAGCTTGCT 


AGCTGTAATT 


CCTCTTCGTT 


CTGCCTATAG 


CTCAGACTAG 


GAATGGAGAG 


Of CO 


TTTGATGCTT 


TTAGCATACT 


GGTCTTCTAC 


TGCCCCTCTC 


TCCATCTTCG 


CAATATATTT 


a i ift 


CACACCAAGA 


CCATTCTCAC 


GACTGACAAT 


GACTTCTGAC 


ATAGAGTCTT 


GCT TGTTGAA 


a o aft 


ATCTTCATCC 


ATATTGAGGA 


TAAACACGTA 


AGGAAACTCC 


AGCCC TTT AC 


TCTTGTGGAT 


fi 1 A ft 


GGTCATGAGC 


TCTACTGCAT 


CTTTTGGCGG 


TGCCACGGCC 


ACCCTTGCCA 


AATCGTGCTG 


Q ii Aft 
D4UU 


GGCTTCTAAG 


ACTTGGTCAA 


TCATACGAAT 


AAAACGCCAC 


AAACCTTTGA 


AATTGCTCTT 


n j £ft 


TTCAAATTCA 


TCAGCACCCA 


GTGCTAGGGC 


ATAGAGATTG 


GCCTGCCTAG 


CAGGACtATT 




CCGCAAACCC 


CCAACATAGT 


CAT AAT AAAA 


ACCCTCGTTC 


TAAATCTTCC 


AAATCAAUK. 


O 3" J 


ATAGAGAGAC 


TGGGTTTTCG 


CATACAAGCG 


CCAAGAACCT 


AGGATATCCA 


TGAATTOUl 1 


8640 


TAGTTTTTCA 


GCTAGAGCTG 


TGTGAATCAA 


GCCTTTTTGA 


CT ACTTGCC A 


~4 till IVl i 


8700 


ATTGACCAGT 


TTCTCATAGA 


GATTTTCGTG 


.-.-;.rr-ATcr 


TCTCCTTTCT 




8760 


ACGTGCTAGC 


TCATCCTCAT 


CAAAACCAAA 


CATTGGAGAC 


TTCATAAGGG 


CAACCAAGGC 


8820 


CTAGTCTTGC 


AGGGGATTCT 


GAATGACACC 


AAGAGTGTCT 


AGCATGACTT 


GCACTTCTAG 


8880 


GGATTGGAGA 


TAATTGTTTT 


GCTCTCCGTC 


AGTTTTGACA 


GGAATTCCGT 


ACTCAGACAG 


8940 


GGCCAGCACA 


- AT CTCGTCAT 


TACGACTGCG 


GCTGGAGGTC 


AGAAGGGCAA 


TTTCCTTAAA 


9000 


GCCAACACCT 


TTTTCTTGAT 


GAAGTTTCAG 


AATCTCCTTG 


ATAACTAAGC 


GCATTTCGCC 


9060 


TGTTAGTTTC 


GTTTCTGTTT 


GACTCTCTTC 


TTCCTCACCT 


GTATCGTCCT 


TGTCCTAGAG 


9120 


GAGAAATGCT 


G CCTTGTTGT 


CTGGATTGGG 


AGTC AGTTTG 


CTATTCCCAA 


AAACAAGCTG 


9180 


GT CCTTGTT A 


TC AT AGTTG A 


TTTCGCCGAC 


CTCTTGGTCC 


ATGAGACGTT 


CAAAGACATC 


9240 
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ATTCCTTGCT 


GACAGCACTT CTGAACTACT ACGGAAATTT 


TCCTTGAGGA 


TAATGAGCCT 


9300 


GCCTTCTTGG 


GGATTTTGCG 


CATAGCGTTG 


GAATTTCTCA 


TTGAAAATCT 


GCGGGTCTCC 


9360 


CTGACGGAAA 


CGATAGATGG 


ATTGCTTGAT 


ATCTCCCACC 


ATAAACCGAT 


TGTGGCCATT 


9420 


AGACAACAAT 


TCCAGCATCC 


GTTCTTGAAT 


ATGGTTGGTA 


TCCTGATACT 


CATCGACCAT 


9480 


GACTTCATGG 


AAGCGCTCCT 


GATAAGACTC 


ACGAACTTGT 


GGGAAATTCT 


CTAAAATCTC 


9S40 


AATGGTCTAA 


TGGCTGATAT 


CAGCGAATTC 


GAACCCATTT 


TCCTGTCGTT 


TTCTCTGACG 


9600 


ATAAGCCTCT 


ACAAAATCCC 


TCATGAAAGA 


TTCGAAGGTT 


TTAGCTAGTT 


TCCAAGTGTC 


9660 


TCCATGATAA 


CGTTCTTGAT 


AGTCGAGAAT 


CGCTATCTGG 


TCTGATAATT 


CTCCTAGTTT 


9720 


AGCAAACTGG 


GTCTTTCTCT 


CTTCCTTGTA GGCATCAGCC 


AGGGGCTTCA 


AATCAGCCTA 


9780 


CGGCTGCCAT 


TAGTCACAGC 


TCGACCGTTT 


TTCTCCTTAG 


AGATGGCGAC 


AACACGCGCA 


9840 


AGCACTGCCT 


GATAAGCCTG 


ACTATCGGAC 


TCCTGATTTA 


GGGAGCCAAT 


TTCATCCACA 


9900 


ATTAACTGAA 


CATTTTCTAA 


ATAGGCAGCC 


TTTCCAAACT 


CCTTGGCATC 


GTTATCCAGA 


9960 


TGGTAACGGA 


AAAAGCTTTC 


CAAATCCCAA 


ACGGCTTGTT 


TGATTTGCTC 


GCTCACTTTT 


10020 


tctttttcac 


TGGTAAAATC 


AGCTTTCTCA 


AATCCTTTGA 


GGAAAGATTC 


ACTCAGCCAC 


10080 


TTTTGAGGAT 


TACTGGTGGA 


TTGGAGGAAG 


TCATAGATTT 


TATAGACCTG 


CTCGCGCAGA 


10140 


CCCCGTTCCT 


CCTTGCCACG 


CCCAGCAAAG 


TTTTTCAGCA 


AATGACTAAA 


GGTCTCTTTC 


10200 


TGTTTACCTT 


GCTAATGCGC 


TTCAAAGACC 


TCATGAAAGA 


CTTCCTTTTC 


GAGAATAAGT 


10260 


TGCTCGCTTT 


GCTTTTGTAA 


AATACGGAAA 


TTACCTGCAA 


TATCAAGCAG 


ATAACCATCT 


10320 


TTGCCAAGGA 


ATTTTTGTCT 


GAAAGAATCC 


ATGGTTCCAA 


TGGCAGCGTT 


GGCTAGGTCT 


10380 


GCCAACTGGC 


GACCCAAGTG 


TTGTTTGAGG 


TCGACATCAT 


CTGTTTCTTG 


GATTTTCTTG 


10440 


CTGATTTTTT 


TCTCTAAACG 


TTCTTTAAGT 


TCAGTTGCAG 


CCTTGACGGT 


AAAGGTTGAG 


10500 


ATAAACAGTT 


GAGAAATTTC 


GACACCACGC 


GCCAATTGGT 


CCAGAATGCG 


CTCTGCCATG 


10560 


ACAAAGGTCT 


TTCCAGAACC 


ACCCGATGCT 


GAGACCAGGA 


TATTCTGGGC 


AGAAGTGTAG 


10620 


ATAGCTTCGA 


TTTGCTCGGC 


AGTTTTCTTC 


TGTTCCTTGC 


TCGAATTTGC 


TTCTCCTTCT 


10680 


TGCAGTTTTT 


GAATCTCCTC 


CTCACTTAAA 


AAGGGAATAA 


CCTTCATCGA 


TTCAACTCCT 


10740 


CTCTTATTTT 


TTCAAGCCAA 


GCTTGCTTGA 


CTTTTTCTCC 


GACCAGACGC 


TTGCCATCAG 


10800 


CTAGGTCCAA 


CTTTTCTAGG 


AAACGGGCTT 


GGCCCAGATG 


CTAATTGGCT 


TCAAAGCCTG 


10860 


TAATAGCCTG 


ATGTTGCTGG 


ACGTATGGGG 


CAATCCTTCT 


GCCATTTTCA 


GTATAAGGAT 


10920 


TGATGGCGAA 


CCGGCCTGCT 


AAAATCTTCT 


CAGCAGCTTT 


CTTGT AAAG A 


TAGGCATTCT 


10980 


AGTCCAGTAG 


GAGCTGAAAT TCCTCATCTG 


TCAGTTGATT 


AGCCTTGTTT 


TTGTTATAAA 


11040 
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ATTCGCCTAA 
TGCCTTCTAC 
CCATTTCCAA 
GATAGCTTGG 
GACTGGATTT 
CCACCTTGCC 
TTCCACCAAA 
CTCCACTTGT 
CTTGATAAAT 
GTTGGTCAAA 
AATTCCCGTG 
AGCGTAGGAA 
AAAACTCCTG 
TTGGACTGGT 
CACGCGACAG 
GGTGATAGGC 
CTCCTTTGTG 
GATAGGCAGA 
GCTTACGAGC 
TSCTGCCAAT 
CATCTGTCAG 
TAGCATAGAC 
TGTCCACTGT 
CTAGGAAGTC 
CATGGCAGAA 
TTGTCAAATC 
GTAGGAGTTT 
TTCTCACGCG 
AGGTTTGCTG 



ATAACTGCTT 
CACTGCTCCT 
GTACATGGCG 
TAACTGAGAA 
GTAGTCTACT 
TCGTACAAAG 
ATTTGCTTCT 
CCGTGCAACA 
AGCTTCAAAT 
GGAATCTTCA 
ACTACGGGCA 
ATAACTGTAT 
TTTGCCAGGA 
TGGGATAGCT 
AACCTTGACA 
AACCAGACTA 
ATTCATCCTC 
TTCCTTACTT 
AGAATTGACC 
CAGTWTTGA 
AAGACTGGTG 
AAAGTCACCA 
TGCTGGAATG 
TT CCAGACT A 
AGCCTTCCAA 
TTGTAACTGC 
TTCAGCCTTT 
GAGGACATTC 
AAAGGCTGGC 



TCTTTTTCCA 
GCCAGACTTT 
CCGAAAAAGT 
TTCAGCCCAT 
ACTCCTATCG 
ACACTGCGTC 
TCTTTGATCG 
TCAAGCAAAA 
TCGCGTTCTT 
TTAGGCAACT 
TCAGGATGCA 
TCATTGCGAT 
TAGAGAGCTT 
GGATTTTCCA 
AAAGTCAAAT 
GACAAAAGAC 
TTCTCTCTCC 
TCACTTTCGT 
AAGCAAAGCA 
ACGCCTTCTT 
TTTTGAGAAA 
GTCAATGGTG 
GTACGGTATT 
ACCTGTGAAC 
ACTTCGGCTT 
TTGGTCACAG 
TGTTTTCGGC 
AAACGCTCAA 
AAGCCATTGA 



303 
AGAAGAGCCC 
TTACCGCCAT 
TCTGCTCCCC 
TAAAGAAATG 
CTCCATTAGC 
CATTGTCTAA 
TTTCGATGGC 
CTTCCTTGGT 
CACTGGTTTC 
GTAAGGCGCG 
AACGTAATTC 
AAAACTCTGT 
GCAAGGTGTC 
GACCTTCCTC 
CTTGCTCAGT 
TGTGATAGGA 
GCCTAAATCC 
TAAAAAGGCT 
TAGTCTACCC 
CSGTCOCTTG 
TTTTTGGTAA 
CAATCAAATC 
GGCACAAACT 
CAGCAAAAAC 
GTCTTTCCTG 
CTCCTTCTTT 
TGGCAAAGAG 
GATTAAATTT 
TACCAAGATA 



TTCGTATTTC 
CAGAGATTGG 
TTCTCTTTTT 
AGGAAACTGG 
TTTCAAACGG 
TTGAATAAAG 
TGGATTGTGT 
AAACTGGGCT 
TTGAATAGCT 
TTCAAAGATA 
CTCCTGCAAG 
CAAACCCGAC 
CTTGGCTAAG 
ATCTAGTTTT 
ATCGCTCATC 
CCCCATATCC 
AAAATGGATC 
TGGAGCCCAC 
ATTTTTCTTG 



ATAGATTTGC 
ACAGGTTCAG 
AGGGCAGCAA 
AACTGAGTCA 
TCAATCCGGT 
GCTTGGTCTT 
CGGAGAATAT 
TCCAAACTTT 
TCTTCTAGAC 
CGATGCAAGA 
CCTAAAACGT 
CTAGACAGGT 
CTCTTGCTGC 
TTACCTATGA 
TCACCCTGCT 
TCCTTAGACA 
AACTCTTGAA 
AAGAACAACT 
AGATTTTCAC 



ATTGTCCTGA 
GTAACTCTGC 
CATTCCAGAA 
AGTCGCAAAT 
TTCTACAGCT 
TAGAAAGACA 
GGTTTCAAGA 
TCCATGGTGG 
GCGGATATAT 



GTTAGTCCAA 
ACCAGAACAG 
TGGAGCAAGG 
TGTTCTAAAA 
TCCAAAGTGG 
CTCCATTTTT 
GGTGCTAAAA 
GATTTGGTGA 
TGCTCAAAAG 



moo 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11620 

11860 

11940 

12000 

12060 

12120 

12160 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 
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CATCAATATC 


AGACTGACTG 


AGGTCAGTAT 


304 

ACAAATCAGT 




1 J tv\ I WUW\ 1 


12840 


CCTCCTGACC 


AAAACGGTAA 


CGTTTTAAAG 


CTAAAATAGA 


CTCGACAAAt. 




12900 


GATGATGACC 


CATCGCTTCG 


CTTCTACCAA 


GATAAAAAGG 


AATCTGATAC 


TGGTCAAAAA 


i ^Qfin 


TGGTTTTGAG 


ACATAACTGG 


TAAGAAGCTA 


CATCCCCCAA 


GAGAATACGA 


AAATGCTTGT 


1 J UaU 


AGCTCAGCTC 


TGAGTTCTCA 


TGTAATTTCT 


GACGAATACT 


ACGGCCTACT 


AGCTCCAACT 


.L JUOv 


CCTCCTTTTG 


CGTCAAACAA 


GACCAGATTT 


GTAAATTTTC 


ACGGTCTTTC 


TCATCGACAT 


13140 


CCAAAGCGAG 


TTCTGAAAAG 


TCATAAGAAG 


ACTCCAACAA 


ACGAGAGGCC 


TTGTCAAAAC 


13200 


TATCCATCTT 


CTCATGACTT 


TGAGAACAGT 


CCTGAGCAGG 


CGTTTGGTAT 


TTAGAAGCCA 


1 J2d0 


GATGATGGAG 


AAATTTTACG 


CTGGCTTCGT 


AGAGATTGCC 


CTCGCTAAAA 


GGACTGGTAT 


13320 


AGGCTTTCTT 


ACTAGCATAA 


GCCCCGATAA 


CAATCTCAAC 


ACCTTTGCCC 


TGAAGTAAGT 


13380 


CCACAACCCG 


CTCTTCCTCA 


GCAGAAAAAC 


GAGTAAAGCC 


GTCAATGACC 


AAGGCGATTT 


13440 


GATTAAAATC 


ACTACTTACC 


TTGTCATTCT 


CAATAGCCTC 


AATCAAATGG 


GACAACTGAC 


13S0O 


TTTCCTGGGC 


TAACTGACCT 


TGATTAAGAT 


AGGCTCTTAC 


TTTCTCAAAA 


ATCAAGAGTA 


13S60 


AATCCGCCCT 


CTTATCCTCA 


TCTGTTAAAT 


TCTCCAACTC 


CAAAAAACTC 


ATCTGAGATT 


13620 


TGGTCATCTC 


ATGCTAAACC 


TCAATTAACT 


GCTGGATCAA 


TTGAGGATCC 


TGCTTAATAG 


1 3680 


CGCCATAAAC 


ACGCAAGTCC 


TTGGGATCCA 


GTTCGGCAAG 


GCATTTCTAA 


AAGGCCAACC 


13740 


CAAGACCGAT 


ATCATCAACA 


GTAGTTTTAG 


CTGGTAAATC 


ATTCAAGACC 


AGATAGCGAG 


13800 


CCATTTGAGC 


AAAGCGCGTG 


ACGGTAATCG 


AAAAAGAAGC 


CTCCTGGGAC 


AAGTATTCCA 


13860 


GCACGGCGCG 


TTCCTTTTCA 


AAAGAAAGAG 


AGTTGGGGGC 


AATGTAGAAG 


ACCCGCTTGC 


13920 


CAGCTGCAAC 


TAGCTCTTCT 


GCCTCTCTTG 


TTAGAATTTC 


TGTCAAAGAA 


GTCCGAATAT 


13980 


CAGTATAAAG 


TAATTTCATC 


TCACCCTCGT 


TGGAATTTTT 


CATCACCCTA 


TATTATACCA 


14040 


TGATTAGCCT 


CGTAAATCTG 


TTAAAATATT 


TAGGCCATCC 


TTTCTTTTCT 


TCATCATCTG 


1 4 1UU 


CTAAATCTTA 


AATACTTAGC 


TTTACTTGTA 


TTAGATAGAA 


TAAGTCTGGC 


TACTGAAAAT 


11 1DU 


CACATAATAA 


AAAAGCCTCG 


CTAACAAGGC 


TTTGAGTTTT 


ATGATTGTTT 


CTTAGGTACC 


14220 


GAATACACTT 


CAATGTGTTC 


TCCCAGTATC 


TTAATGTCGA 


CTGGTAGATT 


GTCTGATTTA 


14280 


TCCCCATCAA 


CATCGGACTC 


TAATTCGATA 


TCAGAAGAAG 


TTTTAATATT 


ACCTGCCTTT 


14340 


ATATATTCAA 


TATTCTTGAT 


AGAATGATTG 


AACTATAGTA 


AATTGAAACT 


ATAATAGTAC 


14400 


ACCGTGGATG 


CTAAAATATT 


TCTAGAAATT 


AATTTGATTT 


CCCTAATCAA 


GCTATTCGTA 


14460 


TCTTATTTCA 


ATCTACTATA 


ATAAAATGAA 


CCAAAAATAG 


TACACAATGT 


GGTATAATCT 


14S20 


TCTTATGGCA 


TATTCAATAG 


ATTTTCGTAA 


AAAAGTTCTC 


TCTTATTGTG 


AGCCAACAGG 


14S80 
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TAGTATAACA GAACCATCAC ACCTTTTCCA AATCTCACCT AATACCATTT ATGGCTGGTT 14640 

AAAGCTAAAA GAGAAAACAG GAGAGCTAAA CCACCAAGTA AAAGGAACAA AACCAAGAAA 14700 

AGTTGATAGA CATAGACTTA AAAACTATCT TACTGACAAT CCAGATGCTT ATTTGACTGA 14760 

AATAGCTTCT GACTTTGGCT GTCATCCAAC TACCATCCAC TATGCGCTCA AAGCTATGGG 14820 

CTACACTCGA AAAAAAGAAC CACACCTACT ATGAACAAGA CCCAGAAAAA GTAGCCTTAT 14880 

TTCTTAAGAA TTTTAATAGT TTAAAGCACC TAGCACCTGT TTAGATTGAC GAAACAGGAT 14940 

TCGATACTTA TTTTTATCGA GAATATGGTC GCTCATTAAA AGGTCAGTTA ATAAGAGGCA 1S00O 

AAGTATCTGG AAGAAGATAT CAGAGGATTT CTTTGGTTGC AGCTCTAACA AATGGTGAAT 15060 

TAATCGCTCC AATGACTTAC GAAGAGACGA TGACGAGCGA CTTTTTTGAA GCTTCCTTTC 15120 

AGAAGTTTCT CTTACCAACA TTAACCACAC CATCGGTTAT TATAGTAAAA TGAAATAAGA 15180 

ATAGGGGGGG GGGGGGAGGG GGGGCGAGGG AGA 15213 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



TTATTACCTG 


AAACATTAAA 


TTTAATTGGA 


CATCCCGTTA 


TC AATTTT AT 


AATATCATCA 


60 


AGATTTTTAT 


TATCTGATTC 


AGGAATTTTA 


TCTGATATAA 


CAACACCATT 


TTCAAGATAG 


120 


TTCATTAAAT 


TATTTGATTC 


ACTAACATTA 


GTGTTTTGAT 


CTCrATCAAG 


CCAAAAATAA 


1*0 


TGGTT AT CGG 


AATCTAAATA 


CGATGAGTTT 


AAAATATTAT 


TACAAATTAT 


TTCATTTGCT 


240 


CCACCAGGAA 


TATATCTCAC 


TACTAAATTC 


TGTTTAAGAT 


TCTCACTACC 


TCAATGAGTC 


300 


ATAACAAACT 


CTAGAATATA 


TTTAGCTAGT 


CTATCTTCAA 


CATAAATCAT 


CTTCCTAGAA 


360 


TGATACACAT 


CACCTAATTC 


AAAAAATGCA 


TCCTGATAAT 


CAATATTTTC 


AATAACATCT 


420 


ACCTTTTCTC 


CGTTTTTCAC 


TAAAAGTTTC 


ACGGCTTCTC 


TAGGAAAATC 


TTTTATAAGT 


480 


TGTGTACAAT 


CTGTAGTGAT 


AATAATTTGA 


TGTTTTTTAT 


TTAAACACTC 


TTGAAGTAAA 


540 


AACTCTTTAA 


ATTTATAGAT 


TGCACTCGGA 


TGAAGTGAGA 


TTTCAGGTTC 


ATCTATTAAT 


600 


ATTAATGAAT 


TTGATTGCGC 


ATTTACTATA 


TCATTTACTA 


ACAAAATAAT 


TCTAGCCTCA 


660 


CCTGTTCCTG 


CAAAAGCCTC 


GGAATATTCT 


TTTCCAGATT 


TTTTCATCCA 


AATAGTTTTG 


720 
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GAAGCTTTTA 


TAT CATCACC 


TTTTGAATAC 


AACTTATGTG 


TTAAAATTTG 


AATCTCTCTA 


780 


TAAGATTCAT 


CCATTATTTC 


ACTAATAATT 


TCACAAACTT 


TATCATCAAC 


TTTAACATTA 


840 


TCTATAACCA 


TTTCCTTTTT 


ATAACGCGTA 


TACCTACTTG 


TATTATTCTT 


TAAAATATCA 


900 


CCAACTGGCT 


TAGATCCTAA 


TCTTATAAAA 


TCTTGTTTAC 


TACCTTGAGT 


AGAAATTTTT 


960 


TTAAAATTAT 


AGTGATAGAA 


AAATAAATCA 


AAAGCAGAAA 


CATATTCTTT 


ACAATCACAA 


1020 


AAGACAACAT 


TTTTTTCAAT 


GCCATCCCAT 


CTGTCTGTCG 


AAGAACTTCC 


AATATATTTA 


1080 


TTTTTCGGTA 


ATCTTTCCAT 


CTCATATTGT 


TTTTGAGGAG 


CATATGGTTC 


CCAATAATCT 


1140 


AATCCTTTTT 


TTGTTCCAGA 


ACGGCCTTTA 


AGAACTTCTA 


CATTTCTAGA 


AGCTTTAATG 


1200 


TTATAATATG 


AATAGATTAA 


ACATTGTTTC 


CCATCCACTT 


CATCTATTTG 


ATCAACATTT 


1260 


GTACTAAACC 


AATATTCAGA 


CACACTTTTA 


TTGGCTGGAG 


AACCATATAA 


ACCTTGTAAA 


1320 


ATTGAAGTTT 


TATTTACTCC 


ATATCTATTA 


CAGACACCTC 


AGGATTATTT 


AACTTATAAG 


1380 


TTTTAACAGC 


TACGGAATCA 


ATTTCAACAG 


CAACTTGAAC 


ATCTATGCCT 


GATTTTTTAA 


1440 


CGCCACTTGT 


AGTGCCACCT 


GCACCCTTAA 


ATAAATCAAT 


ACCAACAATT 


TTCCCCATAG 


1500 


TATTCTCCTA 


AAGTTTCTCC 


TTTTTATTAT 


AACATTATCA 


AATGTAAAAC 


CCAACCCGAT 


1560 


AGGGTTAGGT 


TTTTAACATC 


ATTTCACCAA 


CTTCTTCATC 


TCATCAATAC 


GTCCGACCGT 


1620 


CGCGTCATAT 


TTACCTTGGT 


AGTCAGCTTG 


TTTGTCGCAT 


TCTTTTTGCA 


CGACTTCTGG 


1680 


TTTGGCGTTG 


GCTACGAAGC 


CTTCGTTAGA 


GAGTTTCTTA 


CCAACCATGT 


CCAGTTCTTT 


1740 


TTGCCATTTA 


CCAAGTTCCT 


TGTCGAGACC 


GGCCAGTTCT 


TCTTCAACAT 


TCAGGAGATC 


1800 


GGCCAGTGGC 


AGGTAGATTT 


CTGCTCCTGT 


GATGACACTT 


GACATAGCCA 


CTTCAGGTCC 


1860 


AGGGATGGTT 


GATGCCATTT 


CCAAGTCTTC 


TGGATTTCTA 


AAGCCTTTGA 


TATAGTTGAC 


1920 


ATTGCTGTTA 


AAGAAGGCTT 


CCAAGTCGCT 


ATCGCTTGTC 


TTAACAAGGA 


TGCTGATAGG 


1980 


CTTGCTTGGT 


GCTACATTTA 


CTTCCGCACG 


CCCATTCCGA 


ACAGCACGAA 


TCAACTCTTT 


2040 


GAGACTTTCC 


ACACCACTGT 


GAGCCGCAAG 


GTCTTCAAAG 


GCTAGATTAA 


CAGTTGGGTA 


2100 


TGCAGCTGTC 


ACGATAGAAC 


CTTCTGAGAT 


TTGTCCAAAG 


ATTTCCTCTG 


TCACGAATGG 


2160 


CATCATTGGG 


TGAAGGAGAC 


GAAGGATCTT 


CTCCAGCGTA 


TAGAGGAGAA 


CAGATCGAGT 


2220 


AATGACCTTA 


TCGTCTTCAT 


TGTCGCTCTA 


TAGAACTTCC 


TTGCTCAACT 


CAACATACCA 


2280 


GTTGGCAAAT 


TCTTCCCAGA 


TGAAGTTGTA 


AAGGATATGA 


CCAGCCACAC 


CAAACTCGAA 


2340 


CTTATCAAAG 


TTTTCAGTAA 


CTTTTGCAAT 


GGTTTCGTTG 


AGATTCTGGA 


GAATCCAGCG 


2400 


GTCCGTCACA 


TTACCACCCT 


CACCTGTTGC 


AACTTTTGTG 


ACATTGTCAT 


GCGCCACATC 


2460 


CAGCGTCAAA 


CCTTCATTGT 


TCATGAGGAT 


ATAGCGAGAA 


ATGTTCCAAA 


TTTTGTTAAT 


2520 
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AAACTTCCAT 




4 4 4 4 4 4 n 


AGAGAAACGA 


ACGTCTTGAC 


CTGCTGCGGA 


2S80 


ACCCTTTGAA 




CAAGCC^ATC 
unnvMuut 4 *«• 


AGCACCGTAT 


TTCTCGATGA 


CATCCATTGG 


2640 


CTCAATCCCG 




#** 1 1 A nunvA i 


CTTGCGTCCT 


TGCTCGTCAC 


CCATCAGACC 


2700 


GTGGATAAGC 


ACGTTTTGGA 


A luwC 1 \»/*.v*W 


ACCACTAAAT 


TCCAAGGACT 


GGAAGATCAT 


2760 


ACGAGACACC 


CAGAAGAAGA 


I t»A l\» 1 V_v# i *t\ 




AAGGTTGAAG 


TTGGCAAATA 


2820 


ACGTTTAAAG 


TCTTCTCACT 




LLnuV»V«Ln 4 V* 


CTTGAAAATG 


GCCAGAGGGC 


2880 


AGAACTGAAC 


CAAGTATCCA 


AV*Av_V» i l» I IV- 


rTCfTGAGTC 


CATCCGTCAC 


CTTCTGGAGC 


2940 


TTCTTCGCCG 


ACATACATTT 


CAt.CATv.AWU 


UTTfiT Af*£* AG 


GCAGGGATTT 


GGTGACCCCA 


3000 


CCAAAGCTGA 


CGAGAGATAA 


CCCAGTCGTG 


UALAl 11 1V.V, 




GGAAGGTATC 


3060 


GTTGAAACGA 


GGTGGCTAGA 


ATTCGACCTT 


GTCCTC TGTG 


i 1 wVj * i nw 


CAATGGCGTT 


3120 


CTTAGCCAAT 


TGCTCCATCT 


TGACGAACCA 


I 1 uAVi 1 MVj/*\» 


JLAGrTZTGGCT 

1 WWW 1 


CAACTACGAC 


3X80 


ACCTGTACGT 


TCTGAGTGAC 


CAACACTGTG 


GACACGTTTT 


1 V_l»#/\ 1111 


CAAGGCCACC 


3240 


GATTTCTTCC 


AACTTAGCAA 


CGACTGCCTT 




AAArCATCCA 
nnnv^un k wv»*f* 


TGCCTGAAAA 


3300 


TTCAAAGGCA 


AGCTCATTCA 


TAGTTCCGTC 


t» J Lu 1 1 V_M 1 w 


/\V.\J 1 1 • * 


GTGGCAAGTT 


3360 


ATGACGTTGG 


CCAACCAAGA 


«L Ml/a Ik IP|WfU1/« 


M 1 V-Vj 1 VjUULA 


ftm*GTGATTT 

WU 1 KJ 1 * * * 


TCACGAQACC 


3420 


AGTACCAAGC 


TCAGGATCTG 


CGTGCTCATv- 


& & fV^ & tt 


GCGATCACTT 


TATTAGCGAT 


3480 


TGGAAGGATG 


ACU , rri"i - rAC 


CAATCAAUTw 


l. 1 1 u 1 nuW wv< 




GATTAACCGC 


3540 


AACCGCAACG 


TCCCCAAACA 


TAGTCTCAUG 


Z.m fcfTTftTA 
ALunu i Ivln 


fiCAACTTCAA 


GGGCGCGTGA 


3600 


ACCATCTTCC 


AGCATGTAAT 


TuATu luu 1 A 




TCTACATCCT 


TGTGAATCAC 


3660 


CTCAATATCA 


GAAACGGCTG 




TGCGTCCCAG 


TTGAT^ATAA 


ACTCACCACG 


3**:o 


ATAGATCCAG 


CCTTTCTTGT 


AnAuu 1 I w*v> 


AAAGACCTTA 


CGAACAGCTT 


TTGACAAACC 


3780 


TTCATCAAGA 


CTGAAACGCT 


CACGAGAATA 


GTCTACAGAA 


AGCCCCATCT 




3840 


TTCCTTGATG 


GTAGTGGCAT 


ATTCGTCTTT 


CCATTCCCAG 


ACCTTCGTCA 


AGAAAGACTC 


3900 


ACGACCTAGG 


TCATAACCCG 


TAATACCCTC 


ACCACGTAAG 


CGCTCCTCAA 


CCTTAGCCTG 


3960 


AGTCCCAATA CCAGCGTGGT 


CCATACCTGC 


AAGCCAAACG 


GTATCAAAGC 


CTTGCATGCG 


4020 


TTTTTGACGG 


ATGATGATAT 


CCTGCAAAGT 


CGTATCCCAA 


GCGTGACCAA 


GGTGAAGTTT 


4080 


CCCAGTTACG 


TTTGGTGGTG 


GAATCACGAT 


TGAATAAGGC 


TTAGCCTTTT 


GATCGCCTGA 


4140 


AGGCTTGAAA 


ACATCCGCAT 


CAAGCCATTT 


TTGGTAACGA 


CCAGCCTCAA 


CCTCCGCTGG 


4200 


ATTGTATTTA GGTGAAAGTT 


CTTTAGACAT 


GTGTGTGTCC 


TTTCTC T ATT 


TTGTTTATTT 


4260 



WO 98/18931 



PCI7US97/19588 



308 



TATTTTGAAT 


TTGCTT AG C A 


GCTTCTTCTG 


CAGACAAATT 


CCTATTATTT 


ATTTTAAAGT 


4320 


AGTGGTGCAA 


CTCATTCCGT 


TGATGTTGGG 


AATTTAATTG 


AAGTGTTTCA 


CCGGTCTCTA 


4380 


AAATTTCTCT 


TTCAGATACC 


TCAATATGTC 


GTTTTAAGGG 


TTTGTGCTTT 


AATCGATTCT 


4440 


CCCTTCGATT 


TCGACCTATG 


CACTCTTCAA 


GACTTGTTTC 


CAATTCAACA 


AACAGAATCT 


4500 


CTTGATGAAA 


GTTATCCAAT 


AAATCCTGAA 


TTTGCTTTAA 


ATACATCAGC 


TCGTACTGAT 


4S60 


TTGAAAAATC 


AATTACGTCT 


GTTAAAATTA 


CTGATCGCTG 


ATTTCTTGCA 


CTTGCTCCAA 


4620 


GGAAAGAAAA 


GGTAATTCCA 


CGAACAAATT 


CCCACATCTC 


CTCGGTATAA 


TCCTGATAGA 


4680 


TCTCTAGTGC 


AAAATCAATG 


GCTTGATGGT 


TATAAAATAG 


GCTAGCATCC 


GTCAGTCGAG 


4740 


ATAATTCTTG 


ACCAATGGTC 


ATTTTTCCTG 


ATGCTGGACC 


ACCAATGATG 


AAAAGATGCA 


4800 


TCAAATCACC 


TCCCACTCAC 


TCCTCAGCAA 


GCCATATCTC 


AAATCATCAC 


AGCAGTTGCC 


4860 


TTGAGCATCT 


TTGCGGTCTC 


TTATGCGAGC 


TTCGAGGCTA 


AAGCCAACCT 


TTTCCGAGAC 


4920 


TCGTTGACTT 


TGAACGTTAT 


ATCCAAAGCA 


AGTTAGTTCA 


ATCTTGTGAA 


GACCAAGTTC 


4980 


TTTAAAAGCT 


ACATCAATCA 


AGGAACACGC 


TGCTTCTGGA 


ACATAACCTC 


GACCCCAATA 


5040 


GTCTGGGTGC 


AAGGTATAGC 


CAAGCTCTAG 


CACATCATCC 


GCATGAAGAT 


GGTTGAAGTC 


5100 


AACAGAACCA 


ATGACTTTAT 


CGGTTCCTTT 


GACGACAATC 


CCATAGCCAG 


CTGGGAGATT 


5160 


TTCCTTTTGA 


GTACGCTCCG 


GAAGAATGTG 


CTCCAGATAA 


TAAATCTCAT 


CTTCCAAGAT 


5220 


CTTGACTGGA 


GGAAAACCTG 


CTGGATAGGC 


GACCTCTGCC 


AAACTACCGT 


AGGTATGGAT 


5280 


ATCCTCAGCA 


TCCACCACTG 


TGCGGACTCG 


TAAAACGAGA 


CCTTCTGTTT 


CGATTTTATC 


5340 


TGGCAGCTCA 


GTTCTTGCCA 


TCCTTCTTCC 


TCGCTTTTTT 


GATGAAACTG 


CCCTTCATAT 


5400 


CTACACCCTT 


GTCCAGATAG 


CGATAAACGC 


GCTGATATCC 


ATCTCCCATG 


AAATAGGTTG 


5460 


GGGCAAACAG 


TTGATTTTTA 


AAATGTCCCT 


TTTCATCCAG 


GAGTTCTGGG 


GCAACAAGTC 


S520 


GCTCAAGAAT 


CTTGGCAAAG 


ATGTGGCAAA 


TACCGTCTTC 


CTCAACAATC 


CTATCTACCC 


5580 


GACAATCTAA 


AACAAGTGGA 


CACGCGTCTA 


AAATAGGAGT 


CTGAGTTCGT 


TCACAAAT IT 


5640 


CATAATGCAC 


TCCCAAACGT 


TCCAATTTCT 


CCTGATGACT 


GATAAAACCA 


GCCTGCTCCA 


5700 


TCGCAACCAT 


AGAAGTTTCA 


TCAGAAATAT 


TCACAGTAAA 


TTTTTGATAC 


TGTTTGATCT 


5760 


GCTCTCCGGC 


ATTCTCTCTC 


GCAACGACTC 


CAATCACAAC 


CCAATCTCCT 


AGACTATAAG 


5820 


AGGAACTACA 


GGTCGTGATG 


TTATAGCCAA 


AATTCTAATC 


TTGATATCCT 


AAAATAAAAA 


5880 


CAGGAAAACC 


ATAATATAGT 


TTACTTGTGT 


TAAAAGATTG 


CTTCATAACA 


ACCCCCTTTG 


5940 


ACTAAGACGT 


AAAAGAAAAG 


CCCTGCCATC 


TACATGACAG 


GGACGAATGT 


GTTTATCCGC 


6000 


GGCG 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5B57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



TGTAGAATTC 


ACGACAATGC 


TTCGTTGATT 


TCTGGGTTGA 


TTTCGTCGCG 


TTCTGGCAAG 


60 


CGAGTCAATG 


AACCAAAAAT 


AGTACACAAT 


GTGGTATAAT 


CCTTTTATGG 


CATATTCAAT 


120 


AGATTTTCGT 


AAAAAAGTTC 


TCTCTTATTG 


TGAGCGAACA 


GCTAGTATAA 


CAGAAGCATC 


180 


ACACGTTTTC 


CAAATCTCAC 


GTAATACCAT 


TTATGGCTGG 


TTAAAGCTAA 


AAGACAAAAC 


240 


AGGAGAGCTA 


AACCACCAAG 


TAAAAGGAAC 


AAAACCAAGA 


AAAGTTGATA 


GAGATAGACT 


300 


TAAAAACTAT 


CTTACTGACA 


ATCCAGATGC 


TTATTTGACT 


GAAATAGCTT 


CTGACTTTGG 


^ £ A 


CTGTCATCCA 


ACTACCATCC 


ACTATGCGCT 


CAAAGCTATG 


GGCTACACTC 


GAAAAAACAA 


420 


CCACACCTAC 


TATGAACAAG 


ACCCAGAAAA 


AGTAGCCTTA 


TTTCTT AAG A 


ATTTTAATAG 




TTTAAAGCAC 


CTAACACCTC 


TTTAGATTGA 


CGAAACAGGA 


TTCGATACTT 


ATTTTTATCG 




AGAATATCGT 


CGCTCATTAA 


AAGGTCAGTT 


AATAAGAGGC 


AAAGTATCTG 


GAAGAAGATA 


600 


TCAGAGGATT 


TCTTTGGTTG 


CAGCTCTAAC 


AAATGGTGAG 


TTAATCGCTC 


CAATGACTTA 


660 


CGAAGAGACG 


ATGACGAGCG 


ACTTTTTTGA 


AGCTTGGTTT 


CAGAAGTTTC 


TCTTACCAAC 


720 


ATTAACCACA 


CCATCGGTTA 


TTATTATGGA 


TAATGCAAGA 


TTCCATAGAA 


TGGGGAAGCT 


780 


AGAACTCTTG 


TGTGAAGACT 


TTGGGTATAA 


ACTTTTACCT 


CTTCCTCCCT 


ACTCACCTGA 


840 


GTACAATCCT 


ATTGAGAAAA 


CATGGGCTCA 


TATCAAAAAG 


CACCTCAAAA 


AGGTATTACC 


900 


AAGTTGCAAT 


ACCTTTTATG 


AGGCTTTTTT 


GTCTTGTTCT 


TGTTTCAATT 


GACTATATAA 


960 


ATTCTCTAAG 


CGAAACAACC 


GATAAGAATT 


GGCACAAAAG 


CGACCGTATT 


TTTCTT ACCA 


1020 


ATACAGGAAA 


AACAGTTCAT 


AGTTCTATCT 


TGACCAAGTC 


TCTCCAGCCA GCAAACGAAC 


1080 


GCCTTAAAAA 


ACCAATTCCC 


AAACATCTGT 


CCCCTCACAT 


CTTCAGACAC 


ACCACTATTA 


1140 


GCATCTTATC 


AGAAAATAAA 


ATTCCTTTAA 


AAACAATCAC 


GGACAGGGTT GGTCATCCCG 


1200 


ACTCTGAAGT 


CACTACTTCC 


ATCTACACCC 


ACGTCACAAA 


CAACATGAAA 


GATGAAGCAA 


1260 


TCAATGTACT 


GGATAAAGTT 


ATGAAAAAGA 


TTTTTTAAAA 


AGTTTTGTCC 


CTTTTTTGCC 


1320 


CTCTAAATAC 


AAAAATAGCC 


CTTCGGATAA 


AATCCGAGGG 


GCTAGAAACG 


TTGTTAAATC 


1380 
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AACGGCCGAA 


CTTTTGAATT 


TCATGGTTCG 


CGATAAAATA 


GTTCACTGAA 


CTATTTTATT 


1440 


TTTTAAGGTT 


ATCATAATAT 


CAAATACTTC 


AATTAAATAC 


GCTAAATTAC 


TAATATACTT 


1S00 


TTTACCTTTT 


TCATTCTAAA 


ATGTAAAGTA 


CAAACAATTA 


CAATATACTA 


GAGGGGGAGT 


1S60 


AAAAAACGTA 


TTAAATCGAT 


GAGTTCAGCA 


GGCAAGAAAA 
TTCATGAATT 


TAGCACCTTT 


ACGGGTGCTA 


1620 
1660 


TTTTTTAATT 


AACGCCACGT 


TAACTTTTGA 


TTATTGTTTG 


GCACTTCTTT 


CATTTCACGG 


TAAACATCGA 


TGAAATTCTT 


TCCAACATTA 


TTTTTGGAGT 


TAACTGCATT 


1740 


TATTTTTGTA 


TTAATAACTT 


TTTTAGTATC 


GAAAGAATGG 


TTTAAGAAAT 


CCATAACTAA 


1800 


CTCTCCTTTC 


TCATCCTCTA 


ATCAAGATTT 


TTATCAATGT 


CAAAATAGTA 


TTTTCTATCA 


1860 


ATCCAAATTG 


GTCCTTCTCC 


TTTAGAAATA 


GCAAGTACAT 


CTACCGGACC 


TCCTACTGTT 


1920 


TCAAGACTGT 


TGACAATTTT 


TCTCTTAAAT 


GAAGTTAATT 


CAATAAATGT 


TTTACCTGTA 


1980 


CTCCCCATTT 


CATTAAGTGG 


TTGCATTCCA 


ATAAGGTCTA 


TTATAGGATT 


TATATAATAT 


2040 


TTTTGCTGTA 


TAGATGATAT 


ATTTTCAAAT 


ATATTCTCAA 


TTTCATCACC 


CAATCCATTT 


2100 


TTCTCCATAA 


CTCATGATAC 


TTCCTCTGCG 


ATATATACAT 


TTAAGTTAGG 


ATCTATACCA 


2160 


TTCATAATCG 


TCTCAACCAT 


CTCTGACTGT 


GCAAAAGGGA 


TTATATGACA 


AGTTTTATGA 


2220 


TGATTTATCA 


CACTTTCATT 


AATAACTTTC 


CAAATTAATC 


GTTTAGAAAA 


AATTCCATAT 


2260 


AATTC AATTT 


GTCTTATAGA 


TGGAAATATC 


TCGTCTGTAC 


CATAACCTGC 


TATAACTAAT 


2340 


CCAGTTATGT 


TTGTTGAGTC 


ATATCCAATG 


AAAATCCCTT 


TATATAAACA 


TTTAGCAATA 


2400 


ACTTCAACCT 


CATCATCAGT 


ATGAGGAAAG 


GATTTAAAAA 


CATCGTCTAC 


AATGCTTTTT 


2460 


ATTAACTCTA 


ACTCAGCTTC 


AAAAAATTCA 


AAATTACTTT 


CAGCTTCTAC 


TTTTGAAATT 


2S20 


TCTAAACTAA 


AATTAGTTAT 


ACCATTTAAT 


AAAATTTTAT 


TAAAATCATC 


TAGAGTGATG 


2580 


GTTTCACCAT 


TAGAAACTCT 


TAAATCAGCT 


GTTTCTTGCG 


CTTCATAGGC 


AATGCTCTCC 


2640 


AAAATACTTC 


TTGTACTTCT 


GACAATATAA 


TTTCTTAATA 


AATCCTCAAC 


TTGTAGATGT 


2700 


TTAAAGGAAA 


TTAAAAATTC 


TATTAGCTTT 


TCAACGTATT 


GGGCAGTATT 


ATCTAATAAA 


2760 


TCTGTGCCAA 


TAGCCTGCTT 


AAACTCATTT 


AAAATTACCT 


CCCACGGAAT 


TTCCATAAAC 


2820 


GAAGCGTTCC 


CATATATCAT 


GATCCCCACG 


GAATGTTCTT 


TTGATAAACT 


GAATAATTTT 


2880 


CGGGCGCTAT 


TAAAAACTTT 


TGAATTTTTC 


CCGTCTGATA 


AGGTTACAGC 


GCTATCAGAA 


2940 


CCCAATACAA 


CACCATTTTT 


ATTTAATATT 


CCAATTTCTG 


CTCTCAAAAT 


ATCACCTAAA 


3000 


CTTTCTAAAC 


CTGCTCATGC 


TCTAATGGTA 


CAACAGCTAA 


GGTCTTACCA 


AGACTTGCCA 


3060 


ACACTTTTAA 


TACTGTATCA 


AGTTGTGGGC 


TTCTCTTTCC 


TGTTTCCATT 


CTAGCGATAA 


3120 


CTGGCTGACT 


AACACCGCTC 


ATCTCCTCTA 


GTTTCTTCTG 


ACTAATACCC 


TTTTCATTTC 


3180 



I 
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AAGCTCACTC 


atgataccca 


CGCGCATATC 


ACTTTCCAAA 


ATTTCCTCTT 


3240 


i WW 1 WAA 1 /»n 




ACATCTTTCC 


ACTTACTACC 


AATAGCATTA 


TTTTTCATTG 


3300 


t W 1 AAAWW 1 W 




ATCTGCAAGT 


TCACGTTTAG 


CTTGCTCAAT 


CTCTCTTTTG 


3360 


GGTGTTTTCT 


V> lu 1 WW lilt 


CATAAAATGA 


TCCAGTAAAA 


CAAAACTACC 


ATCCATCCAA 


3420 


at& 


f\£V\ 1 1 W i /\ a w 


TCTAAGTGCT 


CTC AGCTCC C 


AAATTTCAGC 


ATCTAAATGC 


3480 


TTAATATATw 


W» 1 lLuLl luL 


uUu lull W\.fl 


TCTTCCCTTA 

A W 1 4 WWW * * *■ 


ACAACTCAAT 


ATAATCATTA 


3S40 


ATTTTATTAA 


GCTTAATTCT 


GCTATCTTTC 


^ ^^n^ j *^^ j ^ J ^ A ^ 

WWII 111 I nV» 


TGGTAAGCTC 


TCGC AT ATAA 


3600 


TCAAAAACAG 




Willi 1A1\W 


TTftTAAAAAT 


AGATATTATG 


CACT ATT AAC 


3660 


m M«*gBUMtu«kMM«M 

ACCTCTTCCT 


AATAAlAATT 


nl AAWW 1 AAA 




TCTAAATACT 


*I*TT AAG TT AT* 


3720 


#*t * ft ft k #a ih k ft ft 

TAAAATAAAA 


AGCAwwTAGT 


11 LL 1 nwA 1 v* 


W I AwWAWA/* A 


CACACCCATT 


CGCACCGTGC 


3780 


CTACCTCTAT 


CAAG(jTGT AC- 


TCCTTCTATA 


Lin i WW W 1 1 w 


TCCTTTACAA 

1 wW ill AU/V\ 


TATTATACCA 

4 4 4 #■ 4 r^\+\mr\ 


3B40 


CACAATCAAC 


TAG AT ACCTA 


CCATCTWATtp 


A 1 A 1 AW WW WW 


ATTTWWirA 
nl 1 1 i uVfv)V.A 


AGGGTXCAXC 


3900 


GCTAAAATAC 


AAATCAGAAT 


AGATATTAAA 


C CACT T ATTT 


AAfTTATCAT 
rv\W 1 Inl W/\ 1 


AAGCTGGTGA 


3960 


TTGACTGATA 


AATAATATWW 


wL 1 UnLAAuL 


Tf rr. A T A Af* A 


TTCATCTGAT 


TGTACACATA 


4020 


AACCTCTTTT 


AC AG CCTw - A 


AAA iul WAwW 


W i WAW 1 lul 1 


TCTACCCTAA 
4 w & r\^> W * nn 


TATCTGTTAT 


4080 


CTGCTTGATA 


GTTGCGTATT 


I I l V*A l KMliU 


1 AW>W A 1 A 1 W 1 


•PnATTTTTAR 


CAGCATCAAA 


4140 


ft MMWI ft ^* ^ ^ 

CATTTTACGC 


TCAAGUACAL 


1 A 1 AW 1 X nVAl 


4 a %* * & W & A *. « 


TCTCCCATGA 


AATACCACTT 


4200 


GAGwCATAAA 


ATCTTTTCTC 


u u> 1 w ini 1 AW 


AfiAAATACGC 


TpAATTTTCT 

ft \* ^^^^ * 4 * * ^* * 


TCTTXGTCAT 


4260 


TGCTACCTCC 


I AAA I LA 1 UA 


ATTTJL&r&AT 
A 1 1 X AAW«A 4 


TCTAACCACT 


CACTTTTAGA 


AATAGTTCCA 


4320 


TAGATwTTGT 


1 Lwn lulnlu 


ATACAAACCT 


TCTAAATCTT 


TTTCCACCCT 


AATATAGTTC 


4380 


ATCTTATCCT 


WA1 Ww InviU 


AAACTATAGT 


ATTT c cgttt 


CATCCTCGTT 


TAGGATACGA 


4440 


TTGCACCAAT 


CATCAATAAT 


AACTGG CACT 


TCCCACTCAC 


CCCA'tlT'i'lT 


AAGGTTTTCT 


4S00 


AAAAGTTCAT 


TATCACTAAA 


TAGCTCGCCA 


TCTATTTGGA 


AAAATTCCCC 


TAAGTCATTG 


4S60 


TTTCCTTCAA 


CAATAATAAA 


CTCTGGCATA 


TTTCTATTAC 


TTAATAACTC 


CTTGAGTTCT 


4620 


TGTAACTCTT 


TGATTTCCTT 


TAGATACTTC 


CTCAATTTCC 


AACCTCAATT 


CTTCAATCTG 


4680 


CCTTACTACT 


CCaAAAATTT 


CATGGGTCTT 


ATAAGATTGT 


TCAAGTATAG 


CCTTTGCTGC 


4740 


TTGAGTTCTT 


ATAAACGGGT 


TGACCTTACT 


GTCCATCATA 


ATATCATTCA 


GTACAGAAAC 


4800 


AGCGTTACAT 


gatcctaaat 


AAAGCATTTG 


AGTTGTTTTA 


TCCATCATCT 


C AT CTTGCTT 


4860 


TATCCTCAAT 


GTCTTTTTAA 


CCGCTGCAAC 


TTTTAGATAC 


TTATGACCTC 


TTGCGCGTGA 


4920 
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TACCCCTCCT 


TTTTGACATG 


CTTTGTCTAT 


CGTTGGCTCG GTAAGCATGG 


CATCTATGAA 


4980 


TTTAATTTGC 


TTGGACCTAA 


GGTTATCATT 


TTCATTTCCT 


GCCATCTATT 


ACCTCCTCAT 


5040 


TATCAAAATA 


AAGGGTTGCC 


CCTTTATTTC 


CCTATGCTAG 


ATAATTCTGC 


AATTCTGCAT 


5100 


CCATTGCCTC 


TGAATTGCCC 


TCAACAATCA 


TTTCATCCTG 


TACTAAATCA 


ATCTTATCTC 


5160 


CGTTAATAAG 


TAAACCACCG 


TGGAAATAAT 


CAATTTTTCT 


ATCRAGGAAA 


TCTACTAGCT 


5220 


TTTCAAGGCG 


TTGCTGTTGG 


CTGAATTGCT 


CCATCTCAAT 


TTCGATATAA 


GCAAGGGTAG 


5280 


TATCATTATC 


CATAATATCT 


TCTAATTTTC 


TAAGAGCTAG 


AGGTTTATTT 


TTATATTTTT 


5340 


CTAGGTATTC 


TCTCATTTCT 


GCCACTCTTA 


ATTTGATACT 


AGATAATAAA 


CTTAGTTCAC 


5400 


CTCCATCATC 


TGCTGTAATA 


GGCTCTTCTT 


TTGATTCATC 


GTTTGCTAGT 


TCAGCATTTT 


5460 


TCTCTTTTTC 


TAGTTGCTGA 


TACAATAGCT 


GAGCAGTATT 


TTGGGAATAG 


TTTTCGCCCT 


5520 


CTTTTTTATA 


TTTTAAAAGT 


TCTTGCTCTG 


CATACACTTT 


CCCGATAATC 


ACTTCCTTAT 


5580 


AAACTAATTG 


CCCATCTTGA 


CCTTTTACCT 


TAATACTCCC 


ATGCTCTGCA 


ATTTCAATAT 


5640 


ACTTAATTAT 


ACCATTTTTT 


GAGTATAAAA 


CAAAGCCTTT 


CTCCATCATT 


TTTAATAATT 


5700 


TATCATCCTT 


GTTTTCAGTC 


ATGCTTTTCT 


CCTTTATTTC 


ATTTTATTAT 


AATCTGAATA 


5760 


CCCCTAGTCT 


ATTTATTTCA 


CTAGGTTTTT 


AGGGTTCGTA 


TGCTAAAATA 


CTACCCTTTT 


S820 


TGTGTACCTT 


ATGGCTGACT 


TTTCAAATTG 


GTTAGTT 






S657 



(2) INFORMATION FOR SEQ ID NO: 29: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10254 base pairs 

( B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



AAAATGATAG 


CAGGAGAGTT 


TTCCCCTCCA 


TCACACCCAG 


AACTGAGAGC 


CTTAGCTCAG 


60 


GCTTCTCGCC 


AAAAACAGGC 


CGCCTTTAAC 


AAGGAAGAGA ACCCCTTGAA 


GGGAGCCGAA 


120 


ATCATCAAGA 


CTTCGTTTGC 


CTCAACCGGG 


AAAAATCTTT 


ACATCAACAC 


TCGCTTGATG 


180 


GTGGACTACG 


GTCTCAACAT 


CCATCTAGGG 


GAAAATTTTT 


ATTCTAATTG 


GAACTTGACC 


240 


ATGCTGGATA 


TCTGTCCCAT 


TCCTATCGGG 


GACAATGCTA 


TGATTGGTCC 


T AATTGTC AG 


300 


TTTTTGACAC 


CCCTCCATCC 


ACTAGATCCA 


CAGGAACGCA 


ATTCAGGTAT 


CGAGTACGGA 


360 


AAGCCTATCA 


CAATCGGAGA TAATTTCTGG 


ACTGGTGGTG 


GCGTCATTGT 


CCTTCCTGGA 


420 


GTGACACTGG 


GAAATAATGT 


CGTTGCACGA 


GCAGGGGCAG 


TAATTACCAA 


ATCTTTTGGC 


480 
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GACAACGTTG 


TCCTAUtTW* 


W#w\ l WW x www 


CGCGTGATTA 


AGGAAATACC 


TGTTAAATAG 


540 


AAGTAAAAAG 




WW 4 »U« * .1 W k 




TTTCATCATT 


TTTTACCCAG 


€00 


TTCACATTTA 


CCTACTVi A I 


/T^TT A A A 


GTCTGTTTCA 


TTAAGCAAGT 


TCAAAGCATC 


660 


TCGTAAGTGG 


GATGTTTTTC 


i WV- i\Jtt> 1 lv 


ATCACCTTCC 

w\ i w nw w * * » 


TCCTTGACAC 


TCGGTCAGAT 


720 


TTTGATACAA 


•n ft mniAR IV IV A 

TACTACAAAA 




CAGGCTATGA 


TTCAGAAACA 


TGCGATTCCT 


780 


ATTTT AGAGT. 


TTGATGACAA 




CTTATCATGC 


CCAATCACGA 


GGGGCTGGAC 


840 


TTGCAGTTGC 


CAAAGAAGTG 


TGl iiATGLA 


1 1 i t i /*ww * w 


AGGAGATTGA 


CCGCTATGCG 


900 


AGGGAAGTAG 


GGGCGAACTG 


TGTTGGCGAA 


1 1 1 W 1 i 1 w * w 




CTATCCAGTT 


960 


TATGTCGTGA 


ACTACAAGGA 


CGAGuAGuTv. 


IVlVp Iwuv * 


awww i ww • w » 


A \*WW A WWXJW A 


1020 


CCAGCAGCCC 


AGTTTATGGA 


TTGGTTGATT 


ATWiTvi 


i wwAwWAwA i 


± A 4 W & W & «%W A 


1080 


GGGACCTGTG 


GTGTCCTAGC 


TGATATAGAG 


O lv & lv iv TPPPT 


1 IV. 1 n\» 1 Www 


1 W 1 4 WWW WW A 


1140 


CTGCGAGATG 


AAGGAGCCAG 


TTACCACTAT 




w 1 Ww 1 lAt Ai 


CCAAATCCAG 


1200 


CCAGAGGCTA 


TTGCTGCTAT 


TGAGGAAGTT 


TTwGAAGAwA 


wAwwwAI t WW 


TT ATfi AAHAA 

1 In IvinnwM 


1260 


GTCATGACCT 


GGACGACACA 


CGGTTTTTAC 


WwAwAAAWww 


W 1 w/WW*Ww 1 


GC CTTATCGT 


1320 


AAGGAACAAG 


GCTGTGCTGT 


TGTGGAGATw 


UAU lu I 


W 1 W 1 i WW WWW 


ACT AGCTCAA 


1380 


TTGCGTGGGG 


TTCTCTGGGG 


TGAATTGTTG 


TTVACAuUUj 


fc TTrrrr a fif* 

A i ILtwl nuv. 


wwr\w a Awwnw 


1440 


CAGTACGACA 


GTCCTGACTG 


GGGCTCwAA 


GCTTTTAATA 


AW WW WW * wn 


ACTGAGTTTA 


1500 


GCAAGTGTTC 


ACCACCTTTA 


GTTGTACTww 




G7*?TT ATCA7 


AAAATGTCTA 


1560 


GCTCATACTT 


TTCAAAAATA 


TwTTTAAAww 


4% WW i W/>*- W 1 1 


WW i W * i W 4 WW 


TAGGCATGTT 


1620 


GAGGTTGGGA 


AAAATCTTTA 




AACCTATCAT 


ATCAGGTGAT 


CAAAACTTTG 


1680 


ACACTATGCG 


TTTTATGTCG 


A*!* & n/~ M'TT A 
A 1 Anun IHA 


w/\w* w#%wn • w 


AAATGATACT 


CTTCGAAAAT 


1740 


CTCTTCAAAC 


CAGGTCACCT 


TCACCTTGCC 


GTACGTATAT 


GTTACTGACT 


TCGTCAGTCT 


1800 


TATCCGGCAA 


CCTCAAAACG 


GTGTTTTGAG 


CTGACTTCGT 


CACTTCTATT 


TGCAACCTCA 


1860 


AAACAGTCTT 


TTGAGCAACC 


TGTGACTAGC 


TTTCTAATCG 


ATGCCTTCGT 


TTTCATTGCC 


1920 


TATAATCAAA 


AAGAGAAATT 


TTCTCCTGAA 


AAGCATATAG 


AGTAGCTGGC 


GTTAAAAGCT 


1980 


CCTGTCTTGC 


TTTTTTGACC 


TATAGTCACA 


TCTATCAAGT 


ATTGTTCTTG 


CCTAAGCTAT 


2040 


CAATAAAAAG 


GTGGCATTTT 


TTAGGCTTGG 


TGTTAGTAGA 


TTTTGCCTTA 


TCCTATCTAA 


2100 


GTCATTTCGA 


ACTTTTTATG 


GTACAATGGA 


AACATGTTAT 


TCAAATTATC 


TAAGGAAAAA 


2160 


ATAGAGCTAG 


GCTTATCTCG 


TTTATCGCCA 


GCCCGTCGTA 


tttttttgag 


TTTTGCCTTG 


2220 
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GTCATTTTAC 


TAGGCTCTCT 


TCTTTTGAGC 


TTGCCCTTTG 


TCCAAGTTGA 


AAGCTCACGA 


2280 


GCCACTTATT 


TTGATCATCT 


TTTCACTGCT 


GTCTCTGCAG 


TCTGTGTGAC 


GGGTCTCTCA 


2340 


ACCCTTCCAG 


TAGCTCACAC 


CTATAATATC 


TGGGGTCAAA 


TAATCTGTTT 


GCTCTTGATT 


2400 


CACATCGCTG 


GTCTAGGGCT 


CATGACCTTT 


ATTGGGGTTT 


TCTATATCCA 


GAGCAAGCAA 


2460 


AAGCTTAGTC 


TTCGTAGCCG 


TGCAACTATT 


CAGGATAGTT 


TTAGTTATGG 


AG AAACTCGA 


2S20 


TCTTTGACAA 


AGTTTCTCTA 


TTCTATTTTT 


CTCACGACCT 


TTTTGGTTGA 


GAGCTTGGGA 


2S80 


GCTATTTTGC 


TTAGTTTTCG 


CCTTATTCCT 


CAACTTGGCT 


GGGGACGTGG 


TCTTTTTAGT 


2640 


TCCATTTTTC 


TAGCGATCTC 


AGCCTTCTGT 


AATGCCGGTT 


TTGATAATTT 


AGGGAGCACC 


2700 


AGTTTATTTG 


CTTTTCAGAC 


CGATTTACTG 


CTCAATCTGG 


TGATTGCAGG 


CTTGATTATT 


2760 


ACAGGCGGCC 


TTGGTTTTAT 


GGTCTGGTTT 


GATTTGGCTG 


GTCATGTAGG 


AAGAAAGAAA 


2820 


AAAGGACGTC 


TGCACTTTCA 


TACGAAGCTT 


GT ACT ATT AT 


TGACTATAGG 


TTTGTTGTTA 


2880 


TTTGGAACAG 


CAACTACTCT 


CTTTCTTGAG 


TGCAACAATG 


CTGGAACGAT 


TGGCAATCTC 


2940 


CCTGTTGCCG 


ATAAGGTTTT 


AGTTAGCTTT 


TTTCAAACAG 


TGACGATGCG 


AACAGCTGCC 


3000 


TTTTCTACGA 


TAGATTATAC 


TC AGGCT CAT 


CCTGTGACTC 


TTTTGATTTA 


TATCTTACAG 


3060 


ATGTTTCTAG 


GTGGGGCACC 


TGGAGGAACA 


GCTGGGGGAC 


TCAAGATTAC 


GACATTTTTT 


3120 


GTCCTCTTGG 


TCTTTGCACG 


AAGTGAGCTT 


CTAGGCTTGC 


CTCATGCCAA 


TGTTGCGAGA 


3180 


CGAACGATCG 


CGCCCCGAAC 


GGTTCAAAAA 


TCCTTTAGTG 


TC 'ITT ATT AT 


CTTTTTGATG 


3240 


AGCTTCTTGA 


TACGATTGAT 


TCTGCTAGGG 


ATAACAGCCA 


AAGCCAATCC 


TCCCTTTATC 


3300 


CACCTCGTAT 


TTGAAACCAT 


TTCACCTCTT 


ACTACAGTTG 


GTGTAACGGC 


AAATCTGACT 


3360 


CCTGACCTTG 


GGAAATTGCC 


TCTCAGTGTT 


ATCATGCCAC 


TTATGTTTAT 


GGGACGAATT 


3420 


GGTCCCTTGA 


CCTTGTTTGT 


TAGCTTGGCA 


GATTACCATC 


CAGAAAAGAA 


AGATATGATT 


3480 


CACTATATGA 


AAGCAGATAT 


TAGTATTGGT 


TAAGAAAGGA 


AAGAGCATGT 


CAGATCGTAC 


3S40 


GATTCGAA.TT 


TTGGGCTTCG 


GAATTTTTGG 


GAGCAGTGTC 


CTAGCTGCCC 


TAGCCAAGCA 


3600 


GGATATGAAT 


ATTATCGCTA 


TTGATGACCA 


CCCACAGCGC 


ATCAATCAGT 


TTGAGCCAGT 


3660 


TTTCGCGCGT 


CGAGTGATTG 


GTGACATCAC 


AGATGAAGAA 


TTATTGAGAT 


CAGCAGGGAT 


3720 


TGATACCTGC 


GATACCGTTG 


TAGTCGCGAC 


AGGTGAAAAT 


CTGGAGTCGA 


GTCTGCTTGC 


3780 


GGTTATGCAC 


TGTAAGAGTT 


TGGGGGTACC 


GACTGTTATT 


GCTAAGGTCA 


AAAGTCAGAC 


3840 


CGCTAAGAAA 


GTGCTAGAAA 


AGATTGGAGC 


TGACTCGGTT 


ATCTCGCCAG 


AGTATGAAAT 


3900 


GGGGCACTCT 


CTAGCACAGA 


CCATTCTTTT 


CCATAATAGT 


GTTGATGTCT 


TTCAGTTGGA 


3960 


TAAAAATGTG 


TCTATCGTGG 


AGATGAAAAT 


TCCTCAGTCT 


TGGGCAGGTC 


AAAGTCTGAG 


4020 
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TAAATTAGAC CTCCGTGGCA AATACAATCT CAATATTTTG GGTTTCCGAG AGCAGGAAAA 4080 

TTCCCCATTG GATGTTCAAT TTGGACCAGA TGACCTCTTG AAAGCAGATA CCTATATTTT 4140 

GGCAGTCATC AACAACCAGT ATTTGGATAC CCTAGTACCA TTGAATTCGT AAAGAGGGAT 4200 

GACCCCTCTT TTTTGATGCC TAAGATGGCA AATAGAGACA GAAGCCCCTT CTCTTCTAGT 4260 

AAAAGTTCTT CAAAGGCTGG ACTTTATGCT AAAATAGAAA GAACTGACAA GAGAGAGTAA 4320 

TACTCAATGA AAATCAAAGA TCAAACTAGG AAACTAGCTA COGGCTGCTC AAAACACTGT 4380 

TTTGAGCTTG CACATAGAAC TGACGAAGTC ACTAACATCT ATACGGCAAG CCGACCTTGA 44 40 

CGCGGTTTGA AGAGATTTTC GAAGAGTATA AGAAAAAATC AGTCCCCTAA AGGAGTAGAT 4500 

TATGAAGTTA TTGTCTATCG CAATTTCTAG CTATAATGCA GCAGCCTATC TTCATTACTG 4 560 

TGTGCACTCG CTAGTGATTG GTGGTGACCA AGTTGGGATT TTGATTATCA ATGACCGGTC 4620 

TCAGGATCAC ACTCACGAAA TCGCTGAGTG TTTAGCTAGC AACTATCCTA ATATCGTTAG 4680 

AGCCATCTAT CAGGAAAATA AATGCCATGG CGGTGCGGTC AATCGTGGCT TGCTAGAGGC 4740 

TTCTGGGCGC TATTTTAAAG TAGTTGACAG TGATGACTGG GTGGATCCTC GTGCCTACTT 4800 

GAAAATTCTT GAAACCTTGC AGGAACTTGA GAGCAAAGGT CAACACCTGG ATGTCTTTGT 4 8 60 

GACCAATTTT GTCTATGAAA AGGAAGGGCA GTCTCCTAAG AAGAGTATGA CTTACGATTC 4 920 

AGTCTTGCCT CTTCCGCAGA TTTTTGGCTG CCACCACGTC GGAAATTTCT CCAAAGGCCA 4980 

GTATACCATG ATGCACTCGC TGATTTATCG GACAGATTTG TTGCGTGCTA GCCAGTTCTA S040 

ACTGCCTGAA CATACTTTTT ATGTCGATAA TCTCTTTGTC TTTACCCCCC TTCAGCAGGT 5100 

CAAGACCATG TACTATCTGC CTGTCGATTT CTATCGTTAT TTCATTGGGC GTGAGGACCA 5160 

GTCTGTCAAT GAGCAAGTGA TGATTAAGTG CATTGACCAG CAACTCAAGG TCAATCGACT 5220 

CTTGATAGAC CAACTTGATT TGTCCCAACT CACTCATCCC AAAATCCGAG AATATCTCCT 5280 

GAATCATATT GAACTCACGA CGGTGATTTC CAGTACCCTG CTCAACCGAT CTGGAACAGC 5340 

GGAGCATCTC GCAAAAAAAC GCCAATTGTG GACCTATATT CAGCAGAAAA ATCCAGAACT 5400 

CTTTCAGCCT ATTCGTAAGA CCATCTTGAG CCGTTTGACC AAACATTCTG TCTTCCCAGA 54 60 

TCGCAAACTG TCCAATCTCG TCTATCAAAT CACCAAATCT GTTTATGGAT TTAATTAATA 5520 

TAAGTCTTTT ATAAGAGGGA TTTAAGAAAA ATTTTAACTT TTTCTTAGTC CTTTTTAATT 5580 

TCAGGAGATT ATACTAGAGT CATCAAATAA AGAAAGACTC TAAGGAGAAT CCTATGAAAT 5640 

TCAATCCAAA TCAAAGATAT ACTCGTTGGT CTATTCGCCG TCTCAGTGTC CGTGTTGCCT 5700 

CAGTTGTTGT GGCTAGTGGC TTCTTTGTCC TAGTTGGTCA CCCAAGTTCT GTACCTGCCG 5760 
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ATGGGCTCAA 


TCCAACCCCA 


GGTCAAGTCT 


TACCTGAAGA 


GACATCCGGA 


ACGAAAGAGG 


SS20 


GTCACTTATC 


AGAAAAACCA 


GGAGACACCG 


TTCTCACTCA 


AGCGAAACCT 


GAGGGCGTTA 


5680 


CTGCAAATAC 


GAATTCACTT 


CCGACACCTA 


CAGAAAGAAC 


TGAAGTGAGC 


GAGGAAACAA 


5940 


GCCCTTCTAG 


TCTGGATACA 


CTTTTTGAAA 


AAGATGAAGA 


AGCTCAAAAA 


AATCCAGAGC 


6000 


TAACAGATCT 


CTTAAAAGAA 


ACTGTAGATA 


CAGCTGATGT 


GGATGGGACA 


CAAGCAAGTC 


6060 


CAGCAGAAAC 


TACTCCTGAA 


CAAGTAAAAG 


GTGGAGTGAA 


AGAAAATACA 


AAAGACAGCA 


6120 


TCGATGTTCC 


TGCTGCTTAT 


CTTGAAAAAG 


CTGAAGGGAA 


AGGTCCTTTC 


ACTGCCGCTG 


6180 


TAAACCAAGT 


AATTCCTTAT 


GAACTATTCG 


CTGGTGATCG 


TATGTTAACT 


CGTCTATTAC 


6240 


TAAAAGCTTC 


GGATAATGCT 


CCTTGGTCTG 


ACAATGGTAC 


TGCTAAAAAT 


CCTGCTTTAC 


6300 


CTCCTCTTGA 


AGGATTAACA 


AAAGGGAAAT 


ACTTCTATGA 


AGTACACTTA 


AATGCCAATA 


6360 


CTGTTGGTAA 


ACAAGGTCAA 


GCTTTAATTC 


ATCAACTTCG 


CGCTAATGGT 


ACTCAAACTT 


6420 


ATAAACCTAC 


TGTTAAAGTT 


TACGGAAATA 


AAGACGGTAA 


AGCTCACTTG 


ACTAATCTAG 


6480 


TTGCTACTAA 


AAATGTAGAC 


ATCAACATCA 


ATGGATTAGT 


TGCTAAAGAA 


ACAGTTCAAA 


6S40 


AAGCCGTTGC 


AGACAACCTT 


AAAGACAGTA 


TCGATGTTCC 


AGCAGCCTAC 


CTACAAAAAG 


6600 


CCAAGGCTGA 


AGGTCC ATTC 


ACACCAGGTG 


TCAACCATGT 


GATTCCATAC 


GAACTCTTCG 


6660 


CAGCTGATGG 


CATGTTGACT 


CGTCTCTTGC 


TCAAGGCATC 


TGACAAGGCA 


CCATGGTCAC 


6720 


ATAACGGCGA 


CGCTAAAAAC 


CCAGCCCTAT 


CTCCACTAGG 


CGAAAACGTG 


AAGACCAAAG 


6780 


GTCAATACTT 


CTATCAACTA 


GCCTTGGACG 


GAAATGTAGC 


TGGCAAAGAA 


AAACAAGCGC 


6640 


TCATTGACCA 


GTTCCGAGCA 


AAyGGTACTC AAA CTT AC AG 


CGCTACAGTC 


AATGTCTATC 


6900 


GTAACAAAGA 


CGCTAAACCA 


GACTTGCACA 


ACATCGTAGC 


AACTAAAAAA 


GTCACTATTA 


6960 


ACATAAACGG 


TTTAATTTCT 


AAAGAAACAG 


TTCAAAAAGC 


CGTTGCAGAC 


AACGTTAAAG 


7020 


ACAGTATCGA 


TGTTCCAGCA 


GCCTACCTAG 


AAAAAGCCAA 


GGGTGAAGGT 


CCATTCACAG 


7080 


CAGGTCTCAA 


CCATGTGATT 


CCATACGAAC 


TCTTCGtAGC 


TGATGGTATG 


TTGACTCGTC 


7140 


TCTTGCTCAA 


GGCATCTGAC 


AAGGCACCAT 


GGTCAGATAA 


CGGTGACGCT 


AAAAACCCAG 


7200 


CCCTATCTCC 


ACTAGGTGAA 


AACGTGAAGA 


CCAAAGGTCA 


ATACTTCTAT 


CAATTAGCCT 


7260 


TGGACGGAAA 


TGTAGCTGGC 


AAAGAAAAAC 


AAGCGCTCAT 


TGACCAGTTC 


CGAGCAAACG 


7320 


CTACTCAAAC 


TTACACCGCT 


ACAGTCAATG 


TCTATGGTAA 


CAAAGACGGT 


AAACCAGACT 


7380 


TGGACAACAT 


CGTAGCAACT 


AAAAAAGTCA 


CTATTAACAT 


AAACGCTTTA 


ATTTCTAAAG 


7440 


AAACAGTTCA 


AAAAGCCGTT 


GCAGACAACG 


TTAAGGACAG 


TATCGATGTT 


CCAGCAGCCT 


7500 


ACCTAGAAAA 


GGCCAAGGGT 


GAAGGTCCAT 


TCACAGCAGG 


TCTCAACCAT 


CTGATTCCAT 


7560 
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ACGAACTCTT 




ftGCATGTTGA 


CtCGTCTCTT 


GCTCAAGGCA 


TCTGACAAGG 


7620 


CACCATGGTC 




/-hCGCTAAAA 


ACCCAGCTCT 


ATCTCCACTA 


GGTGAAAACG 


7680 


TGAAGACCAA 


AGGTCAATAw 


1TCTATCAAG 
4 4 w 4 n 4 


TAGCCTTGGA 


CGGAAATGTA 


CCTGGCAAAG 


7740 


AAAAACAAGC 


GCTCAnvAC 




CAAACGGTAC 


TCAAACTTAC 


AGCGCTACAG 


7800 


TCAATCTCTA 


TGGTAACAAA 


n a r*fir?r a a A£* 


CAGACTTGGA 


CAACATCGTA 


GCAACTAAAA 


7860 


AAGTCACTAT 


TAAGATaAAT 


CTTAAAUAAA 




AGCAAATGGT 


TCATTATCAC 


7920 


CTTCTAACTC 


TGGTTCTGGC 




TCaATCACaA 


TCATGCTACA 


CGTACTACAG 


7980 


ATAGCATGCC 


TGCTGACACC 


ATGACAAGTT 




CATGGCAGGT 


GAAAACATGG 


8040 


CTGCTTCTGC 


TAACAAGATG 


TCTG AT ACGA 


if*Cl aTT-Tf Afl A 


CCATAAAGCT 


ATGCTACCAA 


8100 


ATACTGGTGA 


GACTCAAACA 


TCAATuGCAA 


UlAl 1 UU 4 A 1 


CCTTGGGCTT 


GCGCTTCCAG 


8160 


GTTTACTCGG 


TGGTCTAGGT 


TTGAAAAACA 


A AA A AfTAAftA 


AAACTAATCA 


GCTAAGGAAA 


8220 


TAAATGATGG 


ATAGTGGGCT 


GACTAAGATT 


m MMWlk ft fh /"» m ft 

AGTTTAACAA 




AATCAGGACT 


8280 


TTCTTTCAAT 


AGCAGATTAA 


AATCATCGTA 


& A Af*A A*PA A A 


AATAGTGTTA 

4 Aw 4 w 4 * *"» 


TACTTAAAGC 


6340 


AGTATACCAC 


TGTTTTT AT C 


AAAGG AG AG A 






ACTCCTTGAC 


8400 


GACGAGGTAG 


AAATCACAGA 


T ATT C AT v. A L* 




TTCAGGCAGG 


TTATCAGGTC 


B460 


TTGGTAGCCC 


ATGATGGACT 


GCAAGCGCTA 


CI A RTTCTTCA 


AGAAAAAACC 


GATTGATTTG 


8S20 


ATTATCACAG 


ATGTCATGAT 


CCCTCGGAT^* 


CATfWTTATC 

1 VJM 4 I /% 4 V» 


ATTTAATCAG 


TGAGGTTCAA 


8580 


TACTTATCAC 


CAGAGCAGCC 


I HtCTAI I I 


ATTACTCCTA 


AGACCAGTGA 


ACAGGACAAG 


8640 


ATTTACGGCC 


TGAGCTTGGG 


AwUUj A 1 V</\ i 


TTTATTftPTA 

4 4 4 *■ 4 4 • 


AGCCTTTTAG 


CCCACGTGAG 


8700 


CTGGTTTTGC 


GTGTCCACAA 


TAi i i i vA.uL 




GTGGGGGCGA 


AACAGAGCTG 


8760 


ATTTCCCTTG 


GCAATCTAAA 


An 1 w/v\ 4 Wft 4 


AGTAGTCATG 


AAGTTCAAAT 


ACGAGAAGAA 


8820 


ATGCTGGATT 


TAACTGTTAA 


ATCATTTGAA 


TTGCTGTGGA 


TTTPtAGCTAG 


TAATCCAGAG 


8880 


CGAGTTTTCT 


CCAAGACAGA 


CCTCTATGAA 


AACATCTGGA 


AACAAGACTA 


CGTGGATCAC 


8940 


ACCAATACCT 


TGAATGTGCA 


TATCCATCCT 


CTTCGACAGG 


AGCTGGCAAA 


ATATAGTAGT 


9000. 


GACCAAACTC 


CCACTATTAA 


GACAGTTTGG 


GGGTTGGGAT 


ATAAGATAGA 


GAAACCGAGA 


9060 


GGACAAACAT 


GAAACTAAAA 


AGTTATATTT 


TGGTTGGATA 


TATTATTTCA 


ACCCTCTTAA 


9120 


CCATTTTGGT 


TGTTTTTTGG 


GCTCTTCAAA 


AAATGCTGAT 


TGCGAAAGGC 


GAGATTTACT 


9180 


TTTTGCTTGG 


GATGACCATC 


GTTGCCAGCC 


TTGTCGCTGC 


TGGGATTAGT 


CTCTTTCTCC 


9240 


TATTGCCAGT 


CTTTACGTCG 


TTGCGCAAAC 


TCAAGGAGCA 


TGCCAAGCGG 


GTAGCGGCCA 


9300 
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AGGATTTTCC 


TTCAAATTTG 


GAGGTTCAAG 


GTCCTGTAGA 


ATfTCA^CAA 


TTAGGGCAAA 


9360 


CTTTTAATGA 


GATGTCCCAT 


GATTTGCAGG 


TAAGCTTTGA 


TTCCTTGGAA 


GAAAGCGAAC 


9420 


GAGAAAAGGG 


CTTGATGATT 


GCCCAGTTGT 


CGCATGATAT 


TAAGACTCCT 


ATCACTTCGA 


94B0 


TCCAAGCGAC 


GGTAGAAGGG 


ATTTTGGATG 


GGATTATCAA 


GGACTCGGAG 


CAAGCTCATT 


9540 


ATCTACCAAC 


CATTGGACGC 


CAGACGGAGA 


GGCTCAATAA 


ACTGGTTGAG 


GAGTTGAATT 


9600 


TTTTGACCCT 


AAACACAGCT 


AGAAATCAGG 


TGGAAACTAC 


CAGTAAAGAC 


ACTATTTTTC 


9660 


TGGACAAGCT 


CTTAATTGAG 


TGCATGAGTG 


AATTTCACTT 


TTTGATTGAG 


CAGGAGAGAA 


9720 


GAGATCTCCA 


CTTGCAGGTA 


ATCCCAGAGT 


CTGCCCGGAT 


TGAGGGAGAT 


TATGCTAAGC 


9780 


TTTCTCGTAT 


CTTGGTGAAT 


CTGGTCGATA 


ACGCTTTTAA 


ATATTCTGCT 


CCAGGAACCA 


9840 


AGCTGGAAGT 


GGTCGCTAAG 


CTGGAGAAGG 


ACCAGCTTTC 


AATCAGTGTG 


ACCGATGAAG 


9900 


GGCAGGCTAT 


TGCCCCAGAG 


GATTTGGAAA 


ATATTTTCAA 


ACGCCTTTAT 


CGTGTCGAAA 


9960 


CTTCGCGTAA 


CATGAAGACA 


GGTGGTCATG 


GATTAGGACT 


TGCGATTGCG 


CGTGAATTGG 


10020 


CCCATCAATT 


GGGTGGGGAA 


ATCACAGTCA 


GCAGCCAGTA 


CGGTCTAGGA 


AGTACCTTTA 


10080 


CCCTCGTTCT 


CAACCTCTCT 


GGTAGTGAAA 


ATAAAGCCTA 


AAACCCCTTT 


ACAAATCCAG 


10140 


CTATTCATGG 


TAGAATAGAT 


TTTGTGTCAA 


ATATCAGCAG 


GAAAGCATGA 


AGCTCGTCAA 


10200 


CAGGTGTCTT 


ATGACAACTA 


ACCTTGGCTG 


TTTAGGCGAA 


CGGCATCTGC 


ACGG 


10254 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9769 base pairs 
<B> TYPE: nucleic acid 
<C> STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CCGGCGACTA 


TCGATAACAC 


TTGACTTGGT 


AGCCCCACAT 


TTTGGACAAC 


GCATCCTTTC 


60 


CCTCCTTATC 


GTTTTCTTTT 


CATTATACCA 


TTTTTTAAGC 


GATTCCCAAA 


ACAATTCTTC 


120 


TTTTTGCTTG 


ACAAGTTTTT 


TGTTTTGTTG 


TATTATTTAA 


TTAAGACAAC 


AAGGTAAAAG 


180 


AAAGGAGACT 


AAGATGTCCT 


CGACATTTGA 


CAACAAAAAA 


CCCATCTATT 


TACAGATTAT 


240 


GGAGAAAATC 


AACCTTCAGA 


TTGTTTCCCA 


TACACTGGAA 


CCCAATCAAC 


AACTTCCAAC 


300 


CGTGAGGACC 


TAGCTAGCGA 


GGCTGGTGTC 


AATCCCAATA 


CCATCCAAAG 


AGCCTTATCA 


360 


GACCTTGAAC 


GAGAAGGATT 


TGTCTACAGC 


AAGCGAACAA 


CTGGACGATT 


TGTGACTAAG 


420 


GATAAGGAGC 


TAATCGCCCA 


GTCACGCAAA 


CAATTATCAG 


AAGAACAATT 


GCAACACTTC 


480 
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CTTTCCTCCA 




TGGCTATGAA 


AAAGAAGAAC 


TACCAGGCGT 


AGTCAGTGAT 


540 


TATATTAAAw 


unu ill nAUV« 


CTATGTCATT 


ACTAGTATTT GAAAATGTAT 


CCAAATCATA 


600 


TGGAGCAACA 




AAAATGTTTC 


TCTTGACATT 


CCAGCTGGAA 


AAATTGTCGG 


660 


CCTTCTTCGG 






AACCCTGATT 


AAACTAATTA 


ATGGCCTCTT 


720 


ACAACCAGAT 


l»AAkVV*MWl» i V* 


TCCTCATCAA 


CGACATGGAC 


CCAAGCCCAG 


CAACCAAGGC 


780 


CCTTGTAGCT 


TAl 1 iGCLTu 


A 1 /\1>\*/\V*V» * *\ 


TCTCAATGAG 


CAAATGAAGG 


TCAAAGAAGC 


640 


CCTAACCTAC 


TTCAAGACCT 


in 1 AAnw* 


TTGTCAGATC 


TTGAACGCGC 


CCATCATCTA 


900 


CTTGCAGACC 


TGGGCATTGA 




CGTCTCAAGA 


AACTATCAAA 


AGGAAACAAA 


960 


GAAAAGGTTC 


AACTGATTTT 




CGTGATGCTC 


GTCTCTATCT 


TTTGGACGAA 


1020 


CCCATTGGTG 


GGCTGGATCC 




GCTTATATCC 


TCAATACCAT 


TATCAACAAC 


ioeo 


TACTCACCAA 


CTTCTACCGT 


TTTGATTTCT 


ACC CACTTG A 


TTTCTGATAT 


CGAGCCAATC 


1140 


TTGGATGAAA 


TTGTCTTCCT 




AAAGTCGTCC 


GTCAAGGAAA 


TGTAGATGAT 


1200 


ATTCGCTACG 


AGTCAGGTGA 


ATCCAl I bAL 


CAACTCTTCC 


GTCAGaATTT 


AAGGCCTAAG 


1260 


C AAAG G AG AT 


TATTTATGTT 


1 1 uuAA J 1 l n 


GTTCGCTACG 


AATTTAAAAA 


TGTTAACAAG 


1320 


TGGTATTTAG 


CCCTCTACGC 


AQCCGTbL 1 A 


GTCCTTTCTG 


CCCTCATCGC 


AATACAGACA 


1380 


CAAGGCTTTA 


AAAATCTACC 


TTACCAAunA 


AGTCAGGCTA 


CTATGCTACT 


TTTTCTAGCT 


1440 


ACAGTCTTTG 


GTGGCTTGAT 


U^. 1 1 ALAw i 1 


GGGATTTCAA 


CCATTTTCTT 


GATTATTAAA 


1500 


CGCTTCAAAG 


GTAGTGTCTA 


V> IjAV. 1. \*£\\^+*\ 


GGCTATCTGA 


CTTTGACCTT 


CCCACTTTCT 


1560 


GAACACCATA 


TCATCACAGt 


t» A/W^t- 1 Aft 1 V. 


GCTGCCTTTA 


TCTGGTCATT 


GATTAGCACC 


1620 


GCTGTATTGG 




TGTT ATT ATT 


t • ' »VW • ■ » /V\ 


CAGCTCCACA 


ATGCATTCCT 


1680 


CTTTCTTATG 


IwAi inltfti l 


TCTAGAAACA 


CATCTCCCTC 


AGATCTTTCT 


TACACCTATA 


1740 


TCCTTCCTAC 


TAAATACTAT 


TTCAGGAATC 


CTCTGCATCT 


ACCTGGCTAT 


TTTCATTGCA 


1800 


CAGCTTTTCA 


ATCAATACCG 


TACAGCACTC 


CCTGTTGCAG 


TCTACATTGC 


TATCCAAATC 


1860 


GTCATTGGAT 


TTATTGAACT 


TTTCTTCAAT 


CTTAGTTCTA 


ATTTCTATGT 


CAATTCACTG 


1920 


GTAGGACTCA 


ATGACCATTT 


CTATATGGGA 


GCAGGTATAG 


CCATTGTTGA 


AGAACTCATA 


1980 


TTCATACCTA 


TCTTTTATCT 


CGGAACCTAC 


TACATCTTGA 


GAAATAAGGT 


TAATTTGCTT 


2040 


TAAATAATTT 


TTACCTAGAT 


ATGTAACATA 


CTCATAGAAC 


AAAAGAGACC 


AGGCAAAAAG 


2100 


TCTTTAAAAT 


TAGAAAACGC 


ATAGTATCAG 


GTGTTCAATA 


TGTACTGCcC 


CCCAAAAGTT 


2160 


AGATTTTTTC 


TGTCTAACTT 


TTGGGGGCAG 


TTCATAAGAA 


CCTTGGTAAT 


ATGCGTTTTT 


2220 
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TGTGAGCTGA CTTATTTCCT TTCACTATAT CGCAAAATGA AATAAGAACG GAACCATGGG 2280 
ATTTTGGAAT TCAAATCAAT TTATAAGAAT GTTTTAGAAG TAATATTATC CTATTCCAGA 2340 
TTCAGTTCAC TATACAATTG ACTTTTCAAG CAACCTCTTT ACATAATCTG TACATAATTA 2400 
GGTTCGTGAT TCCACCCTTT TCACCTTTAA AAACCTCGCT TTCGCAAGGC TCTTCTATTT 2460 
ATAAGATAAG GCACGTTTAA AGCTTTTCCA AATCCCTAAA TCATCCGTTT GAAGAACGAG 2520 
ACTAGCATAC ATGCGTCCGA TAAATCCTGT TCCTACCACC GCAAAAATCA CTGTAATAGC 2580 

AAGTGAAATC CATCCTTCTG CTCCCCCCGC ATAGTCATTA ATCGTTCGAA ACGGCATAAA 2640 

GAAGGTCGAA ATAAAGGGAA TATAAGAACC AATCTTCAAG ACGAGATTGT CACCAGCTCC 2700 

ACCTAGAGCT GTCACTCCAA AAAAACCACC CATAATCAAA ATCATCAAAG GCGACAAGGC 2760 

TTTCCCTGAG TCCTCAGGAC GAGAAACCAT AGATCCTAGG AAGCCTGCCA AGACTACCTA 2820 

CATGAAAAGA CTCATCAAAA TAAAGAGCAA CGTATTCACT CAGATAGCAT CTCCCAAGTG 28 80 

ATCCAAAATA CCAGACTGAG CCAAGAATGG CAAATCTTTA AAG AG CAAAA CCGCAGCCAG 2940 

ACCACCTACA ACATAGATCC CAATATGCGT TAAAATCACT AGAAACAGAG CCATCATCCG 3000 

CGCATAGAAA TAGTGACTTG CCCTTATGCT AGAAAAAACG ACTTCCATAA TTTTGGTGCC 3060 

TTTTTCACTG GCAACTTCCT CAGCTGTTAC ACCCGCATAG GTAATCAGAA TCATATAAAG 3120 

AAAGAATCCT AAGGCACCTG CTGCAATTGT TTCAATAAAC TTTTTATTTT CCTTGGCTTC 3180 

ATCAATCTTT TCTGTGAATT GAATTGTCTG CCCTAACCCT TTTTCCTGCT CTTGAGACAA 3240 

CCAAGCAGTT CAACCATTAA GCTGATTTTG CAGTTCATTG AGTGTACCTG TAACCTCAAA 3 300 

TTTAATTCCA TTTTCAAGCG ATGTTTCGCC ATGATAAACT GCCTTTAGAA CACTATCTTC 3360 

TTGATCAATG GTCAAATAAC CTTTTAATTT TTCTTCTTTA ATTGCTTCTT TGGCACTTGC 3420 

TTCGTCTTTA TAGTCGAAGT TAACACCATT TACATTCTTC AGTCCTTCTG CTACAGATGG 3480 

CACTGTTGTC ACTACTGCCA CTTTATTATT TTTAGCCATA GAAGAACCTT GGAGATGCCC 3540 

AATTCCTACA GAGATTCCTA AAAAGAGGAA CGGCGAAATC ACCATAAAGA AGAAACTCCA 3600 

TGACTCGACA TGTCGAAGAT AGGTTTCCTT GATTACAACC CACATATTTC TCATACTTCC 3660 

ACTCCTGATT CTAGTTTAAA GATTTCATCG ATAGTTGGCG CTT G TTCGTC AAATGTTGCG 3720 

ATATATTGAC CTTGAGTCAA GATTGAGAAG AGTTCCCTTC CAGCGCTCTC ATCCTCCAAA 3780 

ATCAATTTCC AACTGCCTTG TTTGGTCAAG CTCACCTCTT TGACATGAGG AAGATTTTCC 3840 

AATTCTTCCT TGCTTCGTTC ACTTGAAACA AAGAGACGCG TTTTCCCGTA TTGATTGCGC 3 900 

ACATCCTGAA CTCGTCCGTG CAAGACCACA CGGCCATCTC GGATCATCAG AATATCGTCA 3960 

CAAAGTTCCT CAACATTGGT CATGACATGG TCAGAAAAGA TAATGGTTCT CCGCGCTCTT 4020 
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TTTCCTCAAA 


AATGACTTGT 


TTGAGCAATT 


CTCTATTAAC 


TGGGTCCAAT 


CCACTAAAAG 


4080 


CCTCATCCAA 


GATAATCAGG 


TCTGGTTCAT 


GAATCAGAGT 


AATAATGACC 


TGAATCTTCT 


4140 


VfV* A VM\ A * A W W 


TTTTGACAGA 


CTCTTGATTT 


TATCTGTCAG 


CTTTCCTTTC 


ACTTCCAACC 


4200 


TCTTCATCCA 


TTGAGGGAGT 


TTTTCTTTGA 


CTTCTTTGGC 


ATCCATGCCT 


TTTAGAGTCG 


4260 


^w*w\u i nuV>Vj 


njw* 1 1 U A 4 * ■ ■* 


AGAACTGTCA 


ATTTAGGCAT 


GAGATGCCTT 


CTTCAGGCAG 


4320 




f*ftAfif*ATAGG 


TCTCCTGACG 


AATATCCTGA 


CCATCCAGAC 


CGATTTCTCC 


4380 


CTGATATTCT 




AAATACTATG 


GAAAATCGTT 


GTTTTTCCAG 


CACCATTTTT 


4440 


TCCGAUiAljl 






CGCTTGAAAG 


TCAATACCAA 


ACAAAACTTG 


4500 


CTTGGATCCA 


AAAt 1 t i i V_ l 


\^ 1 AVJAVa 1 tV>A 


TACTTCTAGC 


ATCTTTCACC 


TCCGAAATTT 


4S60 


CTTGCACTCA 


TTATAWTv-t 1 


111 IvlM 1 /*V**i— 


CTTTACAATG 


TTTTTTGTCC 


ATTTTTAGAA 


4620 


GACTATTGCT 


GTGTAAAATA 




C* A CTTTT AT A 


CTCAATGAAA 


AT CAAAG AGO 


4680 


AAACTAGGAA 


GCTAGClGTA 




AGTACAGCTT 


TGACGTTGCA 


GATAAAACTG 


4740 


ACCAACTCQA 


LTLAAAnUAU 


TftTTTTC AG G 

1 W 1111 VAU\J 


TTGTGGATAG 


AACTGACGAA 


JcCrTAaCTAT 


4800 


ATCTACCGCA 


AGGCGAACTG 




AAGAGATTTT 


CGAAGACTAT 


TACTGATAAA 


4860 


TCCATTATAC 




TTAATTTATA 

1 1 /^r\ 111 1 ** 


CCTTCCCCTC 


CTCAACTGTC 


TATTTTTAAT 


4920 


I bAA I 1\» i 




nwiwwi * * * * 


CCTCGTAAAG 


TTTTCTTCCT 


CTAAAACTTC 


4980 


TuuAAAAAuu 


V. 1 t\J\ k nVf ill 


CAGACAACAT 


TTTTATAAGA 


AACAAGTTCA 


TCTGTCATTT 


S040 


l_ AAuAAvjOAU 


1 f\f\ 1 1 1 1 /\ 


TCTACTAATG 


GACGGAACAC 


AATTCAACCG 


CTTGTCCGAT 


5100 


ATGTTTTCTA 


AfVt ATT AT AT 

AUVJ M 1 1 1 1 


AGTAAAATGA 


AATAAGAACA 


GGACAAATTG 


ATCAGGACAG 


5160 


TCAAATTGAT 


TTTTAACAAT 


CTTTTAGAAG 


TAGATGTATA 


CTATTCTAGT 


TTCAATCTGC 


5220 




ATGCACACCC 


CTATAGCATC 


TAATGAAAAT 


CACAACAGGC 


TCATTCATAG 


5280 


ATGCTTACCT 


AAGCCTAAGG 


GAACTAAGAA 


AACGAClAl, l_ 






5340 


AAAGTAGATT 


AACAACTATC 


CTAAAAAATG 


CTTGAACTAC 


AAGTCCCCCA 


GAGAAGACTT 


5400 


CTGGATGACT 


AACTTGAACT 


TGAAATTTAG 


CAATAATTAA 


TTCACTATCT 


AACTATATTT 


5460 


AGTAATTATT 


TCAGAACTGA 


TTAATATTAA 


AATTAACTAA 


CAATTCAAAG 


GATTCATACT 


5520 


AGCCATAAAT 


TACGTCCATC 


AGAGAGAGAC 


TCTTACTACT 


TTTAGATTTT 


AGTCTTTCTA 


5580 


GCTTCAGAAT 


ACATCTAAAC 


TTTAGGGAAA 


ATGACTATTC 


GAAAGCGCGA 


ATGCCTCAAA 


5640 


ATTATCTCAG 


ATAAGCTATT 


CGAAACTTAG 


AATGCTTTTA 


AATTTATGGA 


ATTGCG ATT A 


5700 


TTCGAAACCT 


AGAATGCATA 


TAACCTTTAG 


TTGACAGACC 


TATTCTAACT 


CTCGAAGGGC 


5760 
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CTCATTCCCC 






5820 


TATAGTAGAA 


ATATACTATC 


TATGAGGAGT 


TTACATCTCA 






5880 


TCTTTCTCCC 


CTTCTGCAGC 


GAGTTATCAA 


m "fe 4Vft**tf*t^* ft i****^* 

TATCTCATCG 




GGGTTGGGAG 


5940 


TTTGATTTTC 


TGTATTTGGG 


CTTATCAGGC 


TGGGATTTTA 


CAATCCAAGG 




6000 


TGCCTTTATC 


CAGCAGGGAG 


GCATCTGGGC 


TCCACCTCTC 




X AC AG A"l J 1 L , l u I' 


6060 


ACAGACTGTC 


CTCCCTATCA 


TTCCAGGGGC 


CTTGACCTCG 


GTGGCTGGGG 


TCTTTATCTA 


OX4U 


CGCCCACATC 


ATCGCGACTA 


TCTACAACTA 


TATCGGCATC 


GTGATTGGCT 


GTGCCATTAT 




CTTTTATCTA 


GTGCGCCTAT 


ACGGAGCTGC 


CTTTGTCCAG 


TCTGTCGTCA 


GCAAGCGCAC 


D44U 


CTACGACAAG 


TACATCGACT 


GGCTAGATAA 


GGGCAATCGT 


TTTGACCGCT 


TCTTTATTTT 


O JUU 


TATGATGATT 


TGCCCCATTA 


GCCCACCTGA 


CTTTCTCTGT 


ATGCTGGCTG 


_~L_n__r*LrH-)~i ft ^ft#«ft ft 

CCCTGACCAA 


o30U 


GATGAGCTTC 


AAGCGCTACA 


TGACCATCAT 


CATTCTGACC 


AAACCCTTTA 


CCCTCGTGGT 


£ J* "J A 


TTATACCTAC 


GGTCTGACCT 


ATATTATTGA 


CTTTTTCTGG 


CAAATGCTTT 


CACACGTAAA 


6480 


AAATCCGTTT 


GGTTTCCCAA 


GTGGATTTTT 


AAAGCGTAGA 


TTAACTATAG 


CTTGATACTA 


6540 


AATATACTTT 


GGTATGGAAA 


TCATGCATAT 


TTTTCGATAG 


TGAGGCGAGG 


ACTTACCTAG 


6600 


CCTTTCCGCC 


GTGATAGAAA 


CACCTGAAAT 


CTAATGGTTT 


CAGGTATTCC 


C AAACTTTG A 


6660 


GCCTAGTGTC 


TCAAAGTTTA 


GGTATGGAAT 


TTTGAAGAAA 


GTCGCTACCG 


TCCGTAATCA 


6720 


CTTAAGGAAA 


GGCTCAAAAA 


TATTGTTTTC 


AACCACAAAA 


TCCGTTTGGT 


TTCCCAAGCG 


6780 


GATTTTGTGC 


TTTATTTTGA 


AACTTCTTTT 


GCAAGAACAA 


AGTTCCCAAG 


TGTGGCAGAA 


6840 


CCATTTCCTG 


CGACTGCTGG 


CGTCACCATA 


TAGTCACGCA 


CATCTGGTAC 


TGGTAGGTAA 


6900 


CCATTAAGAA 


GAGATGTAAA 


TTTCTCACGG 


ACACGCTCCA 


GCATATGTTG 


TTGAGCCATG 


070U 


ACCCCTCCAC 


CAAAGACAAT 


CACGTCTGGG 


CGGAAAGTCA 


CTGTCGCATT 


AACCCCACCT 


<v«U 


TGAGCGATAT 


AGTAGGCTTG 


AACATCCCAA 


ACAGGGTTGT 


TGAGTTCAAT 


AGTTTCCCCA 


/UBV 


CGTACACCTG 


TACGAGCTTC 


CAAACTTGGA 


CCAGCTGCAT 


AACCTTCTAG 


ACATCCCTTA 




TGGAAAGGAC 


AAACACCCTT 


AAACTCTTTT 


TCAATATCCA 


TTGGGTGTCT 


AGCAACATAA 


7200 


TAATGACCCA 


TTTCAGGGTG 


ACCCACACCA 


CCGATAAACT 


CACCACGTTC 


CATGACGCCT 


7260 


GCACCGATAC 


CTGTACCGAT 


TGTGTAGTAA 


ACCAACTTTT 


CGATACGACC 


ACCAGCATTG 


7320 


TTACGGGCAA 


CCATTTCACC 


GTAAGCAGAC 


CTGTTTACGT 


CTGTTGTGAA 


CTACATTCGC 


7380 


ACGTTTAGGG 


CGCGACGAAG 


GGCACCAAGC 


AAGTCTACAT 


TTGCCCAGTT 


TGGTTTTGGA 


7440 


CTCGTCGTGA TAAAGCCATA AGTTTTTGAG 


TTTTTGTCAA 


TATCAATCGG 


CCCAAATGAA 


7500 


CCAACTGCAA 


GACCAGCAAG 


GTTATCCAAT 


TTTGAGAAGA 


ACTCAATGGT 


TTTATCGATT 


7560 
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GTTTCCATTG 


GAGTTGTTGT 


TCCAAATTGT 




CJUOTTTAAA 


GTTT TC A*TC A 


7620 


CCGACAGCAC 


AGACAAACTT 


TGTACCGCCC 




TTP rATATAA 


TTTTGTCATG 


7680 


ATAAACCTCT 


TGTTTTTATT 


TTCTTTATTA 


TAGCATAtTl 




AATGTCTCTA 


7740 


TTTTTTAGAT TTTCCTCTGT AAATCTTACT 


ATCTAATAAA 




CATGTCATTT 


7800 


GTTCGT TTTC 


ACATTAGAGA 


GGATTGATTA 


GATTTTCACT 




CATCCCCCTT 


7660 


AGCAACTGAA 


CCTGTTGCGA 


CTGGAGCTAC 


TGAAGCGTAG 


TCACCTuTAT 


TTGTAACGAT 


7920 


AACCATTGTT 


GTATCATCAA 


GTCCAGCTCC 


AGCGATTTTG 


jiui_ru~ n ^«ff*^M K 

TTTGAGTCAA 




7980 


AACATCGCCA 


GCTTTCACCT 


TATTACCTTG 


ACCAACTTTT 


GTTTCAAAAC 


CGTCACCGTT 


8040 


CATAGATACA 


GTATCAATAC 


CAACATGAAT 


CAAAACTTCA 


GCACCATTTC 


TlVl l i 1A.AA 


8 100 


ACCAAAACCG 


TCCCCTGTTG 


GAAAGGCAAT 


TGAAACTTCA 


GCATCAGCTG 


GTCCATAuAC 




CACGCCTTGG 


CTTGGTTTCA 


CAACGATACC 


TTGTCCCATA 


GCTCCACTTG 


AuAAuAw IuVj 


B220 


GTCATTGACA TCAGCAAGAG 


CGACAACATC 


ACCGACGATA 


GGAGTTACAA 


GTGTTTCATT 


8280 


TTGAAGAGCT 


GCTGGCGCAA CTTCTTCTTT 


TTCTTCAGCC 


ACTTCAGCTC 


GTTTTGCAGC 


8340 


TGCAGTTGCG 


TCTACTTCAT 


CTTCGTAACC 


AAACATGTAA 


GTAAGAGC/\A 




8 400 


AAATGATACA 


GCTACCATAA 


GAACCTATTG 


TGGAACTTGT 


CCGTTACCAA 




84 60 


TGTACCAGGG 


ATGATGGTGA 


TACCATTACC 


AGTACCAGCA 


AGTCCAAGGA 


TMiAAuLw Art 


8520 


TCCACCACCG 


ATTGCACCAG 


CAATCAATGA 


AAGGAAGAAT 


CC n i ACCGA 




8580 


CACCCCGAAG 


ATAGCAGGCT 


CTGTAATACC 


TAGGAACCCA 


bAAAUAuLALi 




8640 


AACTGTTTTC 


AGTTTTGGAT 


TTTTTGTTTT 


AAC ACC AAC C 


GCAAWh*TAU 




8700 


AGCTGTCATA 


GCAGCTGTGA 


TGATAGCGTT 


CAATGGGTTA 






8760 


TTGCACTTCA 


AGCAAGTTGA 


AGATGTGGTG 




ALwAUwn i 


ATTGCTGAAC 


8820 


CCCACCAATC 


AACAAACCAC 


CAAGACCAAA 


TGGCATGCTA 


AGAATCGCTT 


TTGTAGCAAT 


QQQA 

0 0 o\j 


AAGGATGTAG 


TTTTCAACAA 


CGTGGAAAAC 


TGGTCCAATG 


ACAAAGAGTC 


CAAGGATAGA 


8940 


CATGACCAAA 


AGTGTCACGA 


ATGGTGTTAC 


CAAGAGGTCA 


ATGACATCTG 


GAACAACTTG 


9000 


CGGACAGCTT 


TTTCAAATTT 


AGCTCCGACA 


ACCCCGATGA 


TGAAGGCTGG 


AAGAACGGAA 


9060 


CCTTGCAAAC 


CAACAACAGG 


GATGAAACCA 


AAGAAGTTCA 


TCGCTGTTAC 


TTCACCACCT 


9120 


TGAGCAACTG 


CCCAACCGTT 


TGGAAGTGAG 


CCAGAGACAA 


GCATCATACC 


AAGAACGATA 


9180 


CCAACGGCAG 


GATTTCCACC 


AAATACACGG 


AAGGTTG AC C 


ACACAACCAA 


ACCTGGCAAG 


9240 


ATGATCAAGG 


CTGTATCTGT 


CAAGATTTGT 


GTGTAAGTTG 


CAAAGTCACC 


TGGAAGTGGC 


9300 
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ATTTCAACAG CGTTGAAAAG ACCACGCACA CCCATGAAGA GACCTGTCGC TACGATAACT 9360 

GGCATGATTG GAACGAAAAC ATCACCAAAA GTACCGATAG CACGTTGGAA CCAGTTCCCT 9420 

TGTTTAGCAA CTTCTGCTTT CATGTCATCC TTAGATGATG TTGGTAATCC AAGTACAACA 9480 

ACTTCATCGT ACATTTTGTT AACTGTACCT GTACCAAAGA TAATTTGGTA TTGCCCTGAG 9540 

TTAAAGAAAG CACCTTGAAC TTTTTCCAAG TTCTCAATCA CTTCTTTATT GATTTTCTCT 9600 

TCATCTTTGA CCATGACACG TAGACGAGTC GCACAGTGGG CAACACTATT GACATTTTCA 9660 

CGTCCGCCCA AGGCATCGAT GACTTTTTTT GCAATTTCCT GATTGTTCAT TTGCAAAAAT 9720 

CTCCTTATAT AACATTTT G T TCTTGTTTGA AACCGATTTT ATTCGCCGG 9769 



(2) INFORMATION FOR SEQ 10 NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3149 base pairs 
<B) TYPE: nucleic acid 
<C> STRAND EONESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



CCCTTGACTG 


CTAATTCATA 


GTTCTATTGT 


ATCACTTGCT 


CAGAAATAAT 


CAAGAAAAAA 


60 


CTCTCACTTT 


CTCAAGATAA 


AAAGCCTCAG 


ACCAACTCAG 


ACTTTTTAAT 


TCTTAAAATG 


120 


GCAATTCTTC 


CTCTTCCAAG 


ACCAAATCTG 


CCAAATCTTG 


GCCTGCATTA 


TTTTCACGCA 


180 


TAGCACGTTG 


GGCACGACTT 


TCCAAGAGTT 


GGAATCCTGT 


GACAAGTACT 


TCGGTCACGT 


240 


ACTTCATTTG 


GCCATTTTTC 


TCAAAGCGAC 


GGGTACCCAA 


TTCTCCATCA 


ACGGAAATGA 


300 


GACTACCTTT 


GCTTGCGTAC 


TTGCCAAACT 


TTCTGCTAGT 


CTGCCCCATA 


GGACCATATT 


360 


GACAAAATCA 


GCTTCACGTT 


CACCGTTTTG 


GTCTTTGTAA 


CGACGGTTCA 


CAGCGATAGT 


420 


TGCTCCCGCT 


ACCCACTTCT 


CATTGTTGGT 


TTTGTCCAAT 


TCTGGTGTAG 


ACGTTAAACG 


480 


TCCAATCAAG 


ATAACTTTAT 


TATACATATT 


TTCTTCCTCC 


TACTTATCTA 


TTCGTAGGAA 


540 


ATCAAAAAAA 


GTTACAGAAA 


TTTGTAACTT 


TTCGAGAAAA 


TTTTTTATTT 


TTTATGAACC 


600 


ATGAAACCTG 


TCGCCTGTTG 


ATTGGCCATA 


ATGGTCATAT 


CTGTAATCTG 


AACACCACCA 


660 


GGTTGACTAG 


TCACATAGAC 


TACTGTATCT 


CCAATATCCT 


GAGCTTGCAA 


AGCTTCTATT 


720 


CCTTGGTAAA 


CGGACGCAGC 


TCGTTCTTTA 


TCACCATGAA 


AACGCACTGT 


AGAAAAATCT 


780 


GTTTCGACAA 


TTCCAGGCTG 


AATGGTCGTC 


ACCTTGATAT 


CCGTTCCGAT 


GGTATCAATT 


840 


CGCAGTCCAT 


CTGAAAAGGT 


CTTAACTGCC 


GCCTTGGTGG 


CTGAGTAAAC 


AGCTGCACCA 


900 


GCATAGGCAT 


AAATTCCTGC 


GGTTGACCCC 


ATATTGATAA 


TATGACCTTG 


ATTGGCTTTT 


960 
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AwwAJ luwlsi 


WW rv%W#>rtrt W a 


GCCACTGACT 


GCCATCAAAC 


CTTTGACATT 


GGTATCCAAC 


1020 


A 4 v»w x waww A 


4 n * w w aa w 4 w 


TTCATAGTCT 


TGATAGGGAG 


CTAAGCCAAG 


AGCCAGTCCT 


1080 


GCGTTATTGA 


CCAftftJVTnTf* 
UknuVin i W a W 


AATCTGACCT 


ATCGTTTCTA 


AAATATCAGA 


GCAGACAGTC 


1140 


TTTACCATTG 


T ft A T* & *T*Cf*ft*P 
1 Ln 1 A 1 www 1 


GACATCTAGG 


AGAAAAGTCC 




ATTTGGAAAA 


1200 


v> 11 1 w lULnn 


nL I LvO^L 1 A 


AAGACCTTCT 


AGTCTGTCTA 


TCCGTCGTCC 


TGTTAGAACG 


1260 


ACATCLIVAC 


ftftrvft'^ftftAft 


ATAAGCACCC 


GCAATCGCTT 


CACCGATTCC 


TGATCTCGCT 


1320 


^1 ntkJW ft ft ft 

CCTGTAATCA 


CAACATTTTT 


TGCCATCTTA TTTCCTTCTA 


ftft*pftftrc^H at* 

WW 1 WW A W A A A 


CAGATATTAA 


1360 


CAACTTCTTA 




TGTTTCGCTG 


GGTCGAACGG 


a w 1 A wwwaw^* 


ACTTGGTCTT 


1440 


j WIV* a *f*ft ft umij'i 

CTGATAATTC 


& Kf*r* ^ftftftft & 


CGTTTTTGTG 


GAGCATTTGG 


CAGATGCAAT 
w aw a a wv/Vi a 


TCACGAGGAC 


1500 


TGCACATCAT 


m A A ft ft ft ft ^^V*^ 

ACCAAAACTC 


TTTTCACCAC GAAGTTCACC 


1 wwOAAAA 4 w 


AW A A 1 WWW A A 


1560 


TTGGCATCAT 


AGCTCCAGGA 


AGCGCGACAA 


TGGTTTTCAA 


V,Lv. LAUnLui. 


CftATTftfigAC 
wwA 1 1 www AW 


X D«U 


CTCCTGCAAC 


GATTTGTACA 


GTCTTATCAC 


TTGCGACTGC 


& A/TTftftft Aft 

AM. 1 1 WWW Aw 


r\ 1 W 1 A Waww 4 


1680 


GCTCACTATC 


TGGATGGGCT 


ACCATCTCAA 


CAATTTCACC 




1 1 Aww 1 A WW 4 


1740 


TATCATTAAC 


AATTTCTTCT 


GTAAAACCTT 


CCGCCTGCAA 


L 1L 1 1 WW 14V. 


AAACG AGCCA 

AAA WW A WW W « 


1800 


CTTGCTCATC 


TGTCAaAAaw 


ACTTGACCCC 


CCTCTGCAAT 


TTCAAATAAJV 

4 1 W AAA A AAA 


CTTCAAACTT 

1^ A A W^*^V»W A A 


1860 


CGAAAATATT 


CCAAGCCACT 


GTTTCCCCAT 


TATCTTTGAG 


AAAAAWAWWW 


ft^ **p & ft ftH**i*ftft 

WW A AWW A 4 WW 


1920 


CTTTGCGCTC 


^» ft ft WW****^* ft ^ Nft 

CACATCCAGT 


TTCCCATCTC 


CGCTATTTTT 


W AwwA A WAWW 


ATAACCATAT 

A 4 AAWwAWA A 


1980 


CACCCACATG 


T'l'CTTTATTA 


TATGTAAAAA TCATTGTTTC 


PTTTTTPTPP 

W l A * 1 1 W A WW 


*P A T*PTC AftTft 

A AAA W^^^J 1 V* 


2040 


ft ft ft ft ft 

CTGCTAAAAA 


GTCATTGATT 


TGTTGCTTGC 


TTTTACGGTC 


CCCATTCACA 


AAACGACCGA 

1 W %W W V AW w^#^^ 


2100 


TTTCCTTGTC 


CTTTTCTAGA 


ACAACAAGGC TAGGAATTCC 


GTAAACATCC 


CAGAGTTTGG 


2160 


ftft A & iTOCAT 


AT AfTRaTCT 

A 1 AW 4 W A 1 W 1 


CGGTCCATTC 


GAATAAAGGT 


GAACTCTGGA 


TTGGTCTCCT 


2220 


CAATCTCTGG 


TAAGGCACGA 


TAAATATAAC 


GACAATCGCT 


ACACCAGTCT 


GCCACAAAAA 


2280 


TGAAGACCTT 


CTTGCCCGCT 


TTTTCCACTA AAGATCCTAA 


TTCTTCTAAA 


CTTGCTGGCT 


2340 


GTATCATAAG 


ACTTCCTCCT 


CATAGACTAG 


GTCTTCATTT 


TCATAGACAA 


AGGTATAATG 


2400 


ACGGCCATCC 


TCAAAAATGA 


CGCCACCAAC 


CAAGCTCTCC 


AGACTGCTTT 


CGTAAACTTG 


2460 


AACATAAAGG 


GTCGCAATTT 


CCCCCATGTC 


GGAAAAATGG 


TCTCGCACAA 


TCTCTGTCAA 


2520 


CTCTTCCTGA 


GTCTTCATGA 


GCTTACGGTC 


ATCTGCAACT 


TTTTTCGTAG 


CAAGAGCAAG 


2580 


G CTTCCGAT A 


CCTAGCAGAG 


CCAAGCCTGC 


CATCCACATT 


TTTTTAGCTT 


TCATACCATT 


2640 


CATTTTAACA 


CAAAAAAGGC 


TTCAGGACAA 


ATGAGGAAGC 


AGCAGAAAAG 


CAAGTAAAAA 


2700 
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GCCTCTTCCT TTAAGGAAAA GGACTTCTTA TACTCAATGA AAATCAAAGA CCAAACTAGG 27 60 

AAGCTAGCCG CACGCTGCTC AAACCACTGC TTTGAGGTTG TAGATAGAAC TCACGAgTCa 2820 

CTCAAAACAC TGTTTTGAGG TTGTGGATGA AGCTGACGTG GTTTGAAGAG ATTTTCGAAG 2 880 

ACTATTATTC TTATTGCCAG GCACCTAAGT TGCCAACGTA GTAACTATCA GGTGTGTAGG 2940 

TATTGCGAGC ATCTTACCTG ATGAAGCCAG ATAATACTAC TTGCCATTGT CTTTGACCCA 3000 

ATCATTCGCA ATCATGGAAC CAGAAGAACT TACATAATAC CATTCTCCCT TGTCATAAAC 3060 

CCAAGTACTG ACTTTCATGG TTCCTGAGCA ATTAAAGGCA AAAAAACTGT CCAATAACAT 3120 

TC G TTTTTTA AAAGCATTTG ACACTACAT 3149 
C2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CCAAAAATTC AACCTTTAAG GGGAGTCCAG AGAGACTCAC AAGGTCTCAG AT AAAAG AAT 60 

GGTGCAATTT TCTAGAGGAG A C TTTTT G AG TGTGCTCTCT TGTGTTGTAC GATTTTAACT 120 

GAGGCCTTGC ACTAGCAAGG TCTTTTCTTT ATCTGGTCCC CTTAAAATTT AAGGAGGAAA 180 

AGTTATGAAT CCCACATGTA AGAAGCCTTT GGGTGTCATT CGGTTGGAAA CCATGAAGGT 240 

CGTTGCACAA GAGCAAATCG CGCCACAATC TTTGAATTAC TCCTAGAAGG AGAAATGGTT 300 

GAAGCCATGC GAGCAGCCCA ATTTCTTCAT CTGCGTGTAC CGGACGATCC CCATCTCTTA 360 

CGTCGTCCTA TTTCAATTTC GTCTATTGAC AAGGCAAACA AGCACTGTCA CCTCATTTAT 420 

CGGATTGACG GAGCTGGGAC TGCAATTTTT TCAACCTTAA GTCAGGGAGA CACTCTTGAT 4 80 

GTGATGGCGC CTCAGGGAAA TGGTTTTGAC TTGTCTGACC TTGATGAGCA GAATCAGGTT 540 
CTCCTTGTTG GTGGTGGGAT TGCTCTTCCA CCCTTGCTTG AGGTGGCCAA CGAATTGCAT 
GAACGTGGAG TGAAAGTAGT GACAGTCCTC GCTTTTGCTA ATAAGGATGC TGTTATTTTG 

AAAACGGAAT TGGCTCAGTA TGCTCAGGTC TTTGTAACGA CAGATGATGG TTCTTATGGC 7 20 

ATCAAGGGAA ATGTTTCCGT TGTTATCAAT GATTTAGACA GTCAGTTTGA TGCTGTTTAC 780 

TCGTGTGGGG CTCCAGGAAT GATGAAGTAT ATCAATCAAA CCTTTGATGA TCACCCAAGA 840 

GCCTATTTAT CTCTGGAATC TCGTATGGCT TGTGGGATGG GACCTTCCTA TGCCTGTGTT 900 

CTAAAAGTAC CAGAAAACGA GACGCTCAGC CAACGCCTCT GTGAAGATGG TCCTGTTTTC 960 



600 
660 
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4 



CCCACACCAA 

W WW r% w n Wnn 


W#%W • 4 W A A * 


ATAAGGACAA 


AATTATf!AfT 

■* A A A UAL A 


ACAAATCGAT 


TACAAtTTTTV* 
A A A AW 


1020 


1 W A nw W * WW 4 


A & WW** & I i w#% 


AAAATCCGAT 

/WV% 4 w wwr» « 






1 1 www A A A UVl 


1080 


AwAAwAw AAA 


wV. w AAV* t nV. 1 


A 1 wA 1 1 1 AV»#\ 


WW 1 i 1 1 Aww A 


IvlAl A A lun 


A LMbuUUAL 


1 1 A ft 


AAWWW I iw\A 


LLALU 111 1 W 


rtftAATfVAAf* 


TfV* A Af" AfiTYt 
A ^.wiVNUnv A W 


CZC* AC AH A^Vt f" 


W 1 WW luVti AA 


lAUU 


wV_ 1^.AA I WW A 


AT A\A*w I IwW 


a a a atomw? 

AAAA X V.V. 


AAA AuA vw i A 


GTTTTGGCTG 


AAAAGCTACC 


ou 




AuACAATATL 


LAAAILI 1 WW. 


AAA 1 A 1 J uLL 


AA Aw 1 AVfV> Aw 


Oil i l^lwAAA 




A A Af R /' >^ » « 
AwAAwAw TAT 


GCAGCTGTTT 






ft pwi ft A'tV A A 

ALTAATG1 AA 


A A ^^tfn ft fftA/a ft 

AAGCTATCGA 


1380 


ww 1 V.AA 1 A I i 




A Aw miAU.A 


W lu 1 AA J A_A 1 


GGACTTTTGA 


>l»«^^^»t»y» ft ft ft 

TTGGTCAAGA 


1440 


TCtAGATTTG 


GCTTATGATG 


TGGTGAAAGC 


AGvTTGTGGAA 


'* A ft A^ 

GLLlVAGAAG 


TGCCAGTTTA 


1500 


ft ft MMft ft 

TGTCAAATTA 


ACCCCGAGTG 


TGACCCATAT 


CGTTACTCTC 


GCAAAAGCTG 


CAGAAGATGC 


1560 


GGGAGCAAGT 


GGCTTCACCA 


TGATCAATAC 


TCTGGTTGGA 


ATCCCCTTTG 


ACCTCAAAAC 


1620 


TAGAAAACCA 


ATCTTGGCCA 


ATGGAACAGG 


TGGAATGTCT 


GCTCCAGCAG 


TCTTTCCAGT 


1680 


AGCCCTCAAA 


CTCATCCGCC 


ft ft fHWil/* /*^*»ft 

AAGTTGCwwA 


ft ft ft ft ft R ^ 

AACAACAGAC 


CTGCCTATCA 


TTGGAATGGG 


1*740 


AGGAUTGGAT 


TCGGCTGAAG 


CTGCCCTAGA 


A A "TPT A TPTf 

AATGTATGTG 


GCTGGGGCAT 


CTGCTATCGG 


1 800 


ft ^ A ft ft A ft 

ACT 1 I GGAACA 


GCTAACTTTA 


CC AATC Li" A 


TGCCTGCCCT 


GACATCATCG 


AAAATTTACC 


1860 


AAAAGTCATG 


GATAAATACG 


GTATTAGCAG 


«TW *«IVT ft ft ^ ft ft 

TUTGGAAGAA 


CTCCGTCAGG 


AAGTAAAAGA 


1920 


CTCTCTGAGG 


^ ft R 1 MtAM ft ft 

TAAACTGCAA 


TCAAiVl\il 1' 


CTTGATTTTT 


TATTAGTTTC 


TAATATGAAT 


1980 


*'ny> ft /■"•('■♦ ft /™ x ft m 
I A MiOAd AAT 


TilGGTALAA 


*T» A a A A T ft. A A ^» 
1 AAAA I AAA 1 


A aril f**Af* Af* 

AAGnA L> At* AG 


/■» » ft ft ft /V'TT 

GAAGAAGGTT 


ft * <ft<» ft ft *** ft » ft 
AATGAAGAAA 


2040 


Vj I AAw A 1 1 1 A 


n i rn i auv. 


TCTGCTATTT 


XTCTTAGCTA 


u 1 WwAwAwww 


*T*f*f ft h<ft^C^T 




Aw 1 wA Iwu l A 


w 1 l\jOV.AnuVj 


AAAAV.Au 1 A 1 


PnY!AAAriAfi 
V* 1 uAaAuaau 


A luuLMj A \. A 


Aww AuCAAAT 




*jnu A w ww ill 


TTvt ata r**pr* a 

i 1 nL A w A 


1 1 A A w. AA Iwi 


TftRTT("T ATA 
A VAJ A I v. a a a a 


1 AAAAwWAwA 


TVHT A &r*TKT 
1 WW A AAw A A I 




GCTGAAAATG 


AATGGCTAAA 


GCAAGGTGAC 


GACTATTTTT 


ACCTCAAATC 


TGGTGGCTAT 


2280 


ATGGCCAAAT 


CAGAATGGGT 


AGAAGACAAG 


GGAGCCTTTT 


ATTATCTTGA 


CCAAGATGCA 


2340 


AACATGAAAA 


GAAATGCTTG 


GCTAGGAACT 


TCCTATGTTG 


CTGCAACAGG 


TCCCAAAGTA 


2400 


ATAGAAGACT 


GGGTCTATGA 


TTCTCAATAC 


GATCCTTGGT 


TTTATATCAA 


AGCAGATGGA 


2460 


CAGCACGCAG 


AGAAAGAATG 


GCTCCAAATT 


AAAGGGAAGG 


ACTATTATTT 


CAAATCCGGT 


2520 


GGTTATCTAC 


TGACAAGTCA 


GTGCATTAAT 


CAAGCTTATG 


TGAATGCTAG 


TGGTGCCAAA 


2580 


GTACAGCAAG 


GTTGGCTTTT 


TGACAAACAA 


TACCAATCTT 


GGTTTTACAT 


CAAAGAAAAT 


2640 


GGAAACTATG 


CTGATAAACA 


ATGGATTTTC 


GAGAATGGTC 


ACTATTATTA 


TCTAAAATCC 


2700 
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GGTGGyTACA 




TCAATGGATT 


328 
TGGGATAAGG 


AATCTTGGTT TTATCTCAAA 


2760 


wm A #lft^*^*^ ft 

TyTGATGGGA 


AAAT luCiwA 


AAAAGAATGG 


GTCTACGATT 


CTCATACTCA AGCTTGGTAC 


2820 


TACTTCAAAT 


CCGGTGGTTA 


CATGACAGCC 


AATGAATGGA 


TTTGGGATAA GGAATCTTGG 


2880 


TTTTACCTCA 


AATLi wA 1 vVj 


RAAAATAGCT 


GAAAAAGAAT 


GGGTCTACGA TTCTCATAGT 


2940 


CAAGCTTGGT 




ATCTGGTGGC 


TACATGGCGA 


AAAATGAGAC AGTAGATGGT 


3000 


TATCAGCTTG 




TAAATCGCTT 

4 A WVi» * * 


GGAGGAAAAA 


CTACAAATGA AAATGCTGCT 


3060 


TACTATCAAC 


TAGTGCCTGT 




GTTTATGATX 


CAGATGGTGA AAAGCTTTCC 


3120 


TATATATCGC 


AAGGTAGTGT 


I.W Iniuwv * 


GATAAGGATA 


GAAAAAGTGA TGACAAGCGC 


3180 


TTGGCTATTA 


CTATTTCTGG 


jl\ji^.Aw\jv» 


TATATCAAAA 


CAGAAGATTT ACAAGCGCTA 


3240 


GATGCTAGTA 


AGGACTTTAT 


LU> 1 1 A 1 1 #\ A 


CAGAGTCATG 


GCCACCGTTT TTATCACTAT 


3300 


GTGGCTCAGA 


ATGCTAGTAT 


CCCAOTAUV. 1 


•Pf*T f*AT t**I*"l"r 


CTGATATGGA AGTAGGCAAG 


3360 


AAATATTATT 


CGGCAGATGG 


CCTGCAll * i 


w/\ 1 ww i i * * *» 


AGCTTGACAA TCCCTTCCTT 


3420 


TTCAAACATT 


TAACAGAGGC 


TACAAACTAt. 




AATTGGATAA GGTATTTAGT 


3480 


TTGCTAAACA 


TTAACAATAG 


CCTTTTGwAw 


AA£*AACGGCG 

S\/\ f» /V* WWW*- VI 


CTACTTTTAA GCAAGCCGAA 


3540 


GAACATTACC 


ATATCAATCC 


TCTTT ATCTC 




GTGCCCTACA AAGTAACTGG 


3600 


GGAAGAAGTA 


AAATTGCCAA 


AGATAAGAAT 


AA 1 1 1 * i w 


GCATTACAGC CTATGATACG 


3660 


ACCCCTTACC 


TTTCTGCTAA 


GACATTTwAT 


ATTVTCIG AT A 

A w • wwn « *» 


AGGGAATTTT AGGTGCAACC 


3720 


AAGTGGATTA 


AGGAAAATTA 


TATCGATAGw 


rrAAf.iACTT 

wwAAw/VtW I * 


TCCTTGGAAA CAAGGCTTCT 


3780 


GGTATGAATC 


TGGAATATGC 


TTCAGACCwi 


T A TTCCCGCC 
1 /\ 4 iwuwwww 


AAAAAATTGC TAGTGTGATG 


3840 


ATCAAAATCA 


ATGAGAAGCT 




n ATT ACT ACT 


ATAAGTGAAT ATCATTTGAG 


3900 


TGAATAGTAA 


GTTAAAAATC 




GTAAAATCAG 




3960 


TTTTTTGGAG 


TCTGGTGTGA 




CTTTTGTCCT 


GTGTAAGTGA CAAAGCCGGG 


4020 


TTTTCCACCA 




lunu i i x i i i 


GACTTCAATC 


ATATCTACCT GC ACC AGATT 


4080 


CGACAGGCGC 


CCTTGAGAGA 


AGTAGGCAGC 


TAACTCTGCT 


GCGTCTGTCT TGACTCCATC 


4140 


AGATGGGTCA 


AGATTTCCTG 


AGATGACAAC 


ATGGCTTCCA 


CGAATGTCCT TAGCATGGAA 


4200 


CCAAAGTTCC 


TCCTTGCGGG 


CCATTTTAAA 


GGTCAATTCC 


TCATTTTGAA GATTGTTTCG 


4260 


TCCGACATAC 


ATGATGGTTT 


TGCCATCCCT 


TGCTACATAT 


TGTTCTAGTT TTTT GCGTTT 


4320 


CTGGATTTTC 


TCC CGTTGTC 


TTCTGCGGAT 


AAAACCTGTT 


TGAATCAATT CTTCACGGAT 


4380 


TTCAGCGATT 


TCTTCCAGTC 


CAGCTTGGTT 


GAGGACGGTT 


TCTACACTTT CCAGATAGAG 


4440 


AATAGTGGCT 


TTGGTTTCTT 


CAATCAAATC 


AGTCAAGTAT TTCACACCTT CTTTGAGTTT 


4500 
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CTGATACCGT 


TTAAAATAGC 


GTTCCGCATT 


CTGGTTGGGA 




TATwAACCuW 


4560 


AATCATGATA 


GGTTGGTTGG 


TATAGTAGTT 


GTCTAGGATA 


ACCTGGTCTT 


Go i C\» i l Auu 




CACTTGGTGG 


AGGAAGGTTG 


TCACCAATTC 


TCCTTTTTGA 


CGAAATTCTT 


CAGCGTTGTC 


a can 


TGTCGCCAGT 


AACTCTTTTT 


CCTGTTTTTT 


GAGTTTGTGT 


CGGTTTTTCT 


GAAGTTCATT 


A "1 A f\ 


TTCAACACCA 


CGAATCAGTT 


CACTGGCCTC 


CTCTTTGACG 


CGGTCGCGCT 


CAGCCTTATC 


4800 


CTTATAGTAC 


GTGTCCAACA 


AATCAGAAAG 


ATTTGCAAAA 


GGCTCTCCCA 


CCTGATTTGC 


4860 


AAAAGGAACT 


GGACTGAAGG 


AACTCTCAGT 


CAAGCATGGC 


TTCGTTTCTT 


GATTGAAAAA 


4920 


ATTTCCGAAA 


GCGCAAACTT 


TTTCACTAAC 


CAGTATCCTT 


TCCAATTCAT 


TTGCCGTATC 


4980 


GCGTCCCAGA 


CCTTGAAAGA 


GGCTTTGAAG 


ATTTTTTGCT 


GTTAGTTCTT 


GGGTTTGCAG 


S040 


GATTTCAAAG 


AGCTTTTCAT 


CCTTGATACT 


AAAAGGATTG 


AGAGATTTTC 


TACTTGGCGC 


5100 


AGCGATATAG 


GTCGATCCTG 


GAAGTAAGGT 


GCCGTAGCTA 


TTTTGTGAAA 


AGCCGACCTG 


5160 


TTTGATAACT 


TCGAGGATTT 


TATGACTGCT 


TTTATCGACC 


AGTAGAATAT 


TACTGTGTTT 


5220 


CCCCATAATT 


TCGATAATCA 


AGCTAGCCTG 


GATATGGTCT 


CCAATCTCGT 


TTTTATTGGA 


5260 


AACTCTAATT 


TCCACAATAC 


GGTCATTTTC 


CACTTGCTCA 


ATCGACTCAA 


TCAGGGCCCC 


5340 


CTGCAAATAC 


TTTCTCAAAA 


CCATGATAAA 


CGTAGAACCT 


TGAGCTGGAT 


TTTCAAAACT 


5400 


CGTTTGGGTC 


AGCTGAATGC 


CTCCAAAAAC 


TGGATGGGCA 


GAAAGGAGCA 


GCCGATGGCT 


5460 


TTGGCCATTC 


CTGCGGATTT 


GCAACACCAA 


CTCTTGTTCA 


AAAGCCTGAT 


TGATTTTCTG 


5S20 


G ATGCG AC C A 


TTCACTAATT 


CCCTTCGCAA 


TTCCTCAACT 


ATGTGCTCTA 


AAAAAAATCC 


5580 


GTCAAATGAC 


ATCGTTCTCT 


CCTTGTCATT 


GTATTCCATA 


GTATTATATC 


AAAAAGGTAG 


S640 


AATAAAATCA 


TGGAAATGTG 


GTATAATAAA 


GCCAAGTAAA 


GAGAAACGAG 


AAGCACATGT 


5700 


ATATTGAAAT 


GGTAGATGAA 


ACTGGTCAAG 


TTTCAAAAGA 


AATGTTGCAA 


CAAACCCAAG 




AAATTTTGGA 


ATTTGCAGCC 


CAAAAATTAG 


GAAAAGAAGA 


CAAGGAGATG 


GCAGTCACTT 


5820 


TTGTGACCAA 


TGAGCGTAGT 


CATGAACTTA 


ATCTGGAGTA 


CCGTAACACC 


CACCGTCCGA 


5880 


CAGATGTCAT 


CAGCCTTGAG 


TATAAACCAG 


AATTGGAAAT 


TGCCTTTGAC 


GAAGAGGATT 


S940 


TGCTTGAAAA 


, TTCAGAATTG 


GCAGAGATGA 


TGTCTGAGTT 


TGATCCCTAT 


ATTGGGGAAT 


6000 


TGTTCATCTC 


TATCGATAAG 


GCTCATGAGC 


AGGCCGAAGA 


ATATGGTCAC 


AGCTTTGACC 


6060 


GTGAGATGGG 


CTTCTTGGCA 


CTACACGCCT 


TTTTACATAT 


T AACGGCT AT 


GATCACTACA 


6120 


CTCCGGAAGA 


AGAAGCGGAG 


ATGTTCGCTT 


TACAAGAAGA 


AATTTTGACA 


GCCTATGGAC 


6X80 


TCACAAGACA 


ATAAACGAAA 


ATGGAAAAAT 


CGTGACTTGA 


TATCCACTTT 


AGAATTTGCT 


6240 
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TTGACAGCTA TTTTTACTGC TATCAAGGAA GAACGCAATA TGe©*A.^CA CGCAGTGACG 6300 
GCTCTAGTGG TCATCCTTGC AG G TTTT G TT TTTCACCTCT CACGAATCGA ATGGCTCTTT 6360 
CTCCTATTGA GTATTTTCTT CGTAGTAGCC TTTCAGATTA TCAACTCTGC TATTGAAAAT 6420 
GTGCTGCATT TGGCCAGTCA CTATCACTTT TCCATGCTGG CTAAAAATGC CAAGGATATG 6480 
GCGGCCGGCC CGGTATTACT G G TTTCTCTT TTCGCAGCCT TAACAGGCGC ATTGATTTTT 6540 
CTCCCACGAA TCTGGGATTT ATTATTTTAA ACAGTAAGAG GAAATTATGA CTTTTAAATC 6600 

AGGCTTTGTA GCCATTTTAG GACGTCCCAA TGTTGGGAAG TCAACCTTTT TAAATCACGT 6660 

TATGGGCCAA AAGATTGCCA TCATGAGTGA CAAGGCGCAG ACAACGCGCA ATAAAATCAT 6720 

GGGAATTTAC ACGACTGATA AGGAGCAAAT TGTCTTTATC GACACACCAG GGATTCACAA 6780 

GCCTAAAACA GCTCTCGGAC ATTTCATGCT TGAGTCTGCC TACACTACCC TTCGCGAAGT 6840 

GGACACTGTT CTTTTCATGG TGCCTGCTGA TGAAGCGCGT GGTAAGCGCC ACGATATGAT 6900 

TATCGAGCGT CTCAACGCTG CCAAGGTTCC TCTGATTTTG GTCGTGAATA AAATCGATAA 6960 

GGTCCATCCA CACCACCTCT TCTCTCAGAT TGATGACTTC CGTAATCAAA TGGACTTTAA 7020 

GGAAATTGTT CCAATCTCAG CCCTTCAGGG AAATAACGTG TCTCGTCTAG TGCATATTTT 7080 

CAGTCAAAAT CTGGATGAAG GTTTCCAATA TTTCCCGTCT GATCAAATCA CAGACCATCC 7140 

AG AA CGTTTC TTGGTTTCAG AAATCGTTCG CGACAAAGTC TTGCACCTAA CTCGTGAAGA 7200 

CATTCCGCAT TCTGTAGCAG TACTTCTTCA CTCTATGAAA CGAGACGAAC AGACAGACAA 7260 

GGTTCACATC CGTGCAACCA TCATCGTCGA GCCCGATAGC CAAAAAGGGA TTATCATCGG 7320 

TAAAGGTGGC GCTATGCTTA AGAAAATCGG TAGCATGGCC CCTCGTGATA TCGAACTCAT 7380 

GCTAGGACAC AAGGTCTTCC TAGAAACCTG CGTCAAGGTC AAGAAAAACT GGCGCGATAA 7440 

AAAGCTAGAT TTGCCTGACT TTGGCTATAA TGAAAGAGAA TACTAACTAG AGGTAGGCTC 7500 

ATGCCTGCTT CTTGTTTTTA CAGAAGGAGG ACTTATGCCT GAATTACCTG AGGTTGAAAC 7560 

CGTTTGTCGT GGCTTAGAAA AATTCATTAT AGGAAAGAAG ATTTCGAGTA TAGAAATTCG 7620 

CTACCCCAAG ATGATTAAGA CGGATTTGGA AGACTTTCAA AGGCAATTGC CTAGTCAGAT 7 680 

TATCGAGTCA ATGGGACGTC GTGGAAAATA TTTGCTTTTT TATCTGACAG ACAACCTCTT 7740 
CATTTCCCAT TTGCGGATGG AGGGCAACTA TTTTTACTAT CCAGACCAAG GACCTGAACG 7800 

CAAGCATGCC CATGTTTTCT TTCATTTTGA AGATGGTGCC ACGCTTGTTT ATGAGGATGT 7860 

TCGCAACTTT GGAACCATGG AACTCTTGGT GCCTGACCTT TTAGACGTCT ACTTTATTTC 7920 

TAAAAAATTA CGTCCTGAAC CAACCGAACA AGACTTTGAT TTACAGGTCT TTCAATCTGC 7980 

CCTTGCCAAG TCCAAAAAGC CTATCAAATC CCATCTCCTA GACCAGACCT TGGTAGCTGG 6040 
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ACTTGGCAAT 


ATCTATCTCG 


ATGAGGTTCT 


CTGGCGAGCT 


CACCTTCATC 


CAGCTAGACC 


8100 


TTCCCACACT 


TTCACAGCAG 


AAGAAGCGAC 


TGCCATTCAT 


GACCAGACCA 


TTGCTGTTTT 


8160 


GGGCCAGGCT 


GTTGAAAAAG 


GTGGCTCCAC 


CATTCGGACT 


TATACCAATG 


CCTTTGGGGA 


6220 


AGATGGAAGC 


ATGCAGGACT 


TTCATCAGGT 


CTATGATAAG 


ACTGGTCAAG 


AATGTGTACG 


8280 


CTCTGGTACC 


ATCATTGAGA 


AAATTCAACT 


AGGCGGACCT 


GCAACCCACT 


TTTCTCCAAA 


8340 


CTGTCAAAGG 


AGGGACTGAT 


GGGAAAAATC 


ATCGGAATCA 


CTGGGGGAAT 


TGCCTCTGGT 


8400 


AAGTCAACTG 


TGACAAATTT TCTAAGACAG CAAGGCTTTC 


AAGTAGTGGA 


TGCCGACGCA 


8460 


GTCGTCCACC 


AACTACAGAA 


ACCTCCTGGT 


CGTCTGTTTG 


AGGCTCTAGT 


ACAGCACTTT 


8520 


GGGCAAGAAA 


TCATTCTTGA 


AAACGGAGAA 


CTCAATCGCC 


CTCTCCTAGC 


TAGTCTCATC 


8580 


TTTTCAAATC 


CTGATGAACC 


AGAATGGTCT 


AAGCAAATTC 


AAGGGGAGAT 


TATCCGTGAG 


8640 


GAACTGGCTA 


CTTTGAGACA 


ACAGTTGGCT 


CACACAGAAG 


AGATTTTCTT 


CATCGATATT 


8700 


CCCCTACTTT 


TTCAGCAGGA 


CTACAGCGAT 


TGGTTTGCTG 


AGACTTGGTT 


GGTCTATGTG 


8^60 


GACCGAGATG 


CCCAAGTGGA 


ACGCTTAATC 


AAAAGGGACC 


AGTTGTCCAA 


AGATGAACCT 


8820 


GAGTCTCGTC 


TGGCAGCCCA 


GTCGCCTTTA 


GAAAAAAAGA 


AAGATTTGGC 


CAGCCAGGTT 


8860 


CTTCATAATA 


ATGGCAATCA 


GAACCAGCTT 


CTTAATCAAG 


TGCATATCCT 


TCTTGAGGGA 


8940 


GCTAGGCAAG 


ATGACAGAGA 


TTAACTGGAA 


CGATAATCTG 


CGCATTGCCT 


GGTTTGGTAA 


9000 


TTTTCTGACA 


GGACCCAGTA 


TTTCTTTGGT 


TGTACCTTTT 


ATGCCCATCT 


TCCTGGAAAA 


9060 


TCTAGGTGTA 


GGGAGTCACC 


AACTCGCTTT 


TTATGCAGCC 


TT AGCAATTT 


CTCTCTCTGC 


9120 


TATTTCCGCG 


GCGCTCTTTT 


CTCCTATTTG 


CGGTATTCTT 


GCTGACAAAT 


ACGGCCGAAA 


9180 


ACCCATGATG 


ATTCGGGCAG 


GTCTTGCTAT 


GACTATCACT 


ATGGGACGCT 


TGGCCTTTGT 


9240 


CCCAAATATC 


TATTGGTTAA 


TCTTTCTTCG 


TTTACTAAAC 


GGTGTATTTG 


CAGGTTTTGT 


9300 


TCCTAATGCA 


ACGGCACTGA 


TAGCCAGTCA 


GGTTCCAAAG 


GAGAAATCAG 


GCTCTGCCTT 


9360 


AGGTACTTTG 


TCTACAGGCG 


TAGTTGCAGG 


TACTCTAACT 


GGTCCCTTTA 


TTGGTGGCTT 


9420 


TATCGCAGAA 


TTATTTGGCA 


TTCGTACAGT 


TTTCTTACTG 


GTTGGTAGTT 


TTCTATTTTT 


9460 


AGCTGCTATT 


TTGACTATTT 


GCTTTATCAA 


GGAAGATTTT 


CAACCAGTAG 


CCAAGGAAAA 


9540 


GCCTATTCCA 


ACAAAGGAAT 


TATTTACCTC 


GGTTAAATAT 


CCCTATCTTT 


TGCTCAATCT 


9600 


CTTT'ITAACC 


AGTTTTGTCA 


TCCAATTTTC 


AGCTCAATCG 


ATTGGCCCTA 


TTTTGGCTCT 


9660 


TTATGTACGC 


GACTTAGGGC 


AG ACAGAG AA 


TCTTCTTTTT 


GTCTCTGGTT 


TGATTGTGTC 


9720 


CAGTATGGGC 


TTTTCCAGCA 


TGATGAGTGC 


AGGAGTCATG 


GGCAACCTAG 


GTGACAAGGT 


9780 
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GGGCAATCAT CGTCTCTTGG TTGTCGCCCA GTTTTATTCA CTCATCATCT ATCTCCTCTG 9840 

TGCCAATGCC TCTAGCCCCC TTCAACTAGG ACTCTATCGT TTCCTCTTTG GATTGGGAAC 9900 

CGGTCCCTTG ATTCCCGGGC TTAATGCCCT ACTCAGCAAA ATCACTCCCA AAGCCGCCAT 9960 

TTCGAGCGTC TTTGCCTTCA ATCAGGTATT CTTTTATCTG GGAGGTGTTG TTGGTCCCAT 10020 

GGCAGGTTCT GCACTAGCAG GTCAATTTCG CTACCATGCT GTCTTTTATG CGACAAGCCT 10080 

TTGTGTTGCC TTTACTTGTC TCTTTAACCT GATTCAATTT CGAACATTAT TAAAAGTAAA 10140 

GGAAATCTAG TGCGACTAAA AATCAATCTC AAATGCTCCT CTTGTGGCAG TATCAATTAC 10200 

CTAACCAGTA AAAATTCAAA AACCCATCCA GACAgATTGA 10240 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13206 base pairs 
<B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



CGCTTTATCC 


TGGACGTGGT 


CAACCCGACA 


ATTTCATCAA 


GGAGATGAAG 


G AGGGATT TT 


60 


TTGGCG AT AA 


AACGGATAGT 


TCAACCTTAA 


TCAAAAACGA 


ACTTCCTATG 


ATGATGAGCT 


120 


GTATCGCCTA 


CAATCTCTAT 


CTTTTTCTCA 


AACATCTAGC 


TGGAGGTGAC 


TTCCAAACTT 


180 


TAACAATCAA 


ACCCTTCCCC 


CATCTTTTTC 


TTCACCTGGT 


GGGAAAATCT 


CTTCGAACAG 


240 


GACCCAAGCA 


CCTCCTCAAA 


TTGTCTAGTC 


TCTATGCCTA 


TTCCCAATTG 


TTTTCAGCAC 


300 


TTTATTCTAG 


CATTAGAAAA 


GTCAACCTGA 


ATCTTCCTCT 


TCCTTATGAA 


CCACCTAGAA 


360 


GAAAAGCGTC 


GTTAATGATG 


CATTAAAGAA 


CAGTCGAGAT 


GAAAAAATCG 


TGTGACGCAC 


420 


CAAGGGAGGA 


GTCTGCCCTT 


TTGAGGAAAT 


CTAGCGAGGA 


AAAACGATAC 


TGGAACAGCA 


460 


GAAAGTAAAA 


CTGACCTCAT 


CAGGAGCAAG 


AAACTGGCTC 


ATGAGGTCAG 


GGGTTTTCl'A 


540 


AGTTACATCT 


ACTTGAGAGA 


GGTATGAATC 


ATTTGGGATT 


AATCATTTCT 


TGTTTTAAAT 


600 


CAGGAGAATA 


GTAACGATTT 


TTTCCTTTTT 


TCACGAACTC 


TATTCCGTAA 


CGATCAATCA 


660 


ATTTAATCAT 


GTACCTAATA 


TTACAATTGT 


TTATCCCAAA 


TTTATTTGAA 


AGCTTCTCTA 


720 


AGCTATATCC 


TTGTTTTCTA 


AGTTCATAGA 


TCTGAACTTT 


ATCATCATAA 


GTTAGTTTCA 


780 


TAATAAAAAC 


AC CCC AAAAG 


TTAGATTTTT 


TCTGTCTAAC 


TTTTGGCGGC 


CAGTTCATTC 


840 


AACACCTGAT 


ACTATGCGTT 


TTTCTTATTT 


GAAATACTTT 


TTACTCAACC 


TCTTTATACT 


900 


CAATGAAAAT 


CAAACTGCAA 


ACTAGAAAGC 


TAGCCTCAGG 


CTGCTCAAAA 


CAGTGTTTTG 


960 
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AGGTTCCAGA 


TGGAAGCTGA 


CGTGGI i Iwn 


A fl Xfl & TTTTC" 


GAAGAGTATT 


ACTTAATCTT 


1020 


CTTGATACTT 


TGACTAAGAA 


TAAATCCTAC 


AA fLA 1 1 


& (*CAT WITT 


GCATAAAATT 


1080 


CGGTAGAATT 


TCTGGGAGGG 


CTGCTGCCCA 


GCCAl l U\ 1 v. 


AAACCACAAC 


CCAAGGCGTA 


1140 


GCCTCCTACC 


ATGGC AAT AG 


TTGCTAAAAT 


m ft fff+f^* ft. f"» 

AAGGCCTAAW 


w-/\w l\*/\w i 4 1 


T*T*1^^*P*I > *P ft ft ft 


1200 


TCCTGCGAAA 


AATCCCTGCA 


AGCCATGGTT 


^M. — ft ft ^ ft. 

GACCAAGCTA 


/\MV*r\rtw>* * w w 




1260 


GCCTGATAAG 


AGGTCAATC A 


AGAAACTTGC 


TAGTCCTCCG 


Aw 1 /\wwOw 1 w 


rTTCACG ACT 


1320 


ACCAAAGTAA 


AAGGCCGCAA 


AGAAGACACC 


AGCATCTAAA 






1380 


TGTTGGGATT 


TTTAAGAAAT 


AACCTACAAC 


CACAGAAAGG 


GCGGTTAATA 




1440 


GGCGATTTTA 


CTTGTTTTTG 


TTTGCTTCAT 


ATTGTCTTAC 


TCCATACTGA 


TCTGCTTGTG 


1500 


CAATAGCACG 


ATAAACGAAA 


GCCTTAGAGC 


TTTCTACTGC 


JTfc^^* A ft ft ft ft ^**f* 

TGGCAAAAGT 


TTATCACCTT 


1560 


TAACCAGGTG 


ACTGGCAATG 


CTAGAGsCAA 


AGGTACAACs 


TGCACCAGCA 


TTTTGGCCTT 


1620 


GGATAACTGG 


ATTTTCTAGG 


ATAGTAAAGG 


TCTGTCCATC 


ATAAAAGACA 


TV.WACAuV.vT 


1680 


TGTCCTGACT 


AAGACGATTG 


CCTCCCTTGA 


TAATGACTGC 


_|— i j — J— ij — *MH^^f**ft 

GGCGCTCCTA 


AATCATGwAA 


174 0 


TTTCTCCGCT 


GCACTTTTCA 


TGTCTTCCAA 


GGTTTTAATT 


TCCTGACCGG 


A a ftn 1 /\A I 1 w 


1600 


TGCTTCTGGG 


AGATTAGGCC 


TAATCACACT 


GACATAAGGG 


A A ft ft ft ^ ft ft 

AAAAAGCGAA 


I LAnL 1 w 1 i w 


1860 


GCACAGCTCA 


CTGACAGCTA 


CATCATGCGT 


TTCCTTGCAG 


ACwAAGACAG 


UAl w>w*\/*W#*V 


1920 


CACACGTACT 


CCTGGGCGTT 


GTTTGATAAA 


GTCCAAGGCC 


TTCTCAGCCA 




1980 


ACGGAGAAGA 


CCAATCTTAA 


TTCCCCCAAA 


TTCCACATCA 


CGCAAGCTAl 


r"T A ATTf* ATG 


2040 


TTGAAAAATG 


GTATCATCAG 


TTGGAAAGAC 


TTCAAATCCT 


TTTTCTGTCA 


ACCCTCI CAA 


2100 


AtAAGTCACT 


GCTACAAACC 


CATGCAAGCC 


GTTCAAGGT . * 


• • ."*w*wV»/*. 


AATCAGCTGA 


2160 


CAGTCCACCA 


CCACTAAAAA 


TATCATTTCC 


m ft ft ft pnVf »y 

AGAAAGTGCT 


& & & AT AfVI AT 
iww 1 *\wv^rv i 


TATTCTTCAT 


2220 


AACGAATCTC 


CTTTAAATAC 


AAACCATTTG 


CTGCTGCAGT 


GGGACCTGCA 


AGTTGCCTGT 


lion 


CCTTCTTCTC 


CAAGATGAGA 


TCAATCTGCT 


CTACTGGCAT 


GCGGTTGTTA 


CCGATTTTGA 


2340 


GAAGAGTCCC 


CACCATATTG 


CGAATCTGTT 


TATACAAGAA 


ACCATTTCCT 


GAAAAGGTAA 


2400 


.AGCTCAAAAA 


TTCTCCTCTC 


TCATCGACTA 


TTAAACTAGC 


TTCTGTGATG 


GTGCGAACCT 


2460 


TATCCTCTAC 


ACTAGTCCCA 


GAGGCTGTAA 


AACCGGTAAA 


ATCATGGGTT 


CCCTCTAGCT 


2520 


TTTTCATTGC 


AATCTGC ATT 


CGTTCCACAT 


CGAGTGGGTA 


GGGAAAGTGG 


GTGGCATAGT 


2580 


CACGGCGCAT 


CGGATTTTTG 


GGACGTCCTC 


TATCCACAGT 


AAACTCATAG 


GTCTTGCTAT 


2640 


GCTTGGC ATA 


ACGGCAATGA 


AAATCATCTG 


CCACAAGCTC 


AATCGAAATC 


ACATCAATAT 


2700 



V 



DKicrwin- ^wn oniaonA? i 
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CTTCAGGAGA 


CTGGGTATCC 


AAGGCAAAAC 


GGAGTTTCTC 


CTCATCCATC 


TGATAAGGCA 


2760 


GGTCAAAATC 


AATCACCTGT 


CCCAGGGCAT 


GAACCCCACT 


ATCTGTCCTA 


CCACCACCCT 


2820 


CAACAGTAAT 


GGCTTGCCCT 


TTATTTAATC 


TGGTCAAGGT 


TTTTTCAATT 


TCTTCCTGAA 


2880 


CGCTACGCGC 


ATGAGGCTGG 


CGCTGAAAGC 


CAGCAAAGGC 


ATAACCATCA 


TAGGAAATAG 


2940 


TTCCTTTATA 


TCTCGTCATA 


GCCTCTATTT 


TATCAAGAAA 


TTACTCTGTA 


AACAAGGACC 


3000 


TAAAACAAAT 


ATTGTATCGG 


TATAAAAATC 


TCATACTCTT 


CGAAAATCTC 


TTCAAACCAC 


3060 


CTCAGTTTCC 


ATCTGCAACC 


TCAACACACT 


ATTTTGAGCA 


ACCTCCGGCT 


ACCTTTCTAT 


3120 


ACTAGATTGA 


AATAAGATAT 


GAACAACTCT 


ATTAGGAAAG 


TCAAATTAAT 


TTCTAGAAAT 


3180 


ATTTTAGCAG 


CTACACCGTA 


CTATTCCAAA 


CTCAATCAAC 


TATAGTTTGC 


TCTTTGATTT 


3240 


TCATTGAGTA 


TCAAAAGAAA 


AACTTAGGAA 


TCAATCCTAA 


GCTCTCTTCT 


GAAGTAGCTA 


3300 


CATGACAAAG 


ATAGAGATTA 


CAATCAACCA 


ACCTCCTAAC 


ATACTAAAGA 


CCAACATCCC 


3360 


ATTCTGAGTT 


AGTAAGCCAA 


TTGCACCTAG 


AACGAATGGG 


GTCGTAAAGG 


CTCCGAAACT 


3420 


ACAGCCTAAT 


ACAGCAAATG 


AAGTTGCTTG 


ATTGAGGAGT 


TTAGCTGGAA 


TTCGTTCACA 


3480 


GACAAGTTGA 


AAGACCGTCG 


TCAAGACTAC 


ACTATAGGCA 


AATCCAGCCA 


CAACACTTCC 


3540 


TGCTACTACC 


ACCCACAAGG 


ATGAAGACAA 


GGCAATCACG 


ATTTGCCCCA 


AGCCAAAGGT 


3600 


AAT AC C AG AC 


CAGAGGAGCA 


GTTTCTCTTT 


AAAGATAGAA 


ATCAAGAAAG 


AAAAACTCAC 


3660 


CCCAGCCACA 


ATCCCGATCA 


ACTGCATGAT 


ACTAAGAACA 


AAACTAGATA 


ACTGGGCATC 


3720 


CCCCAATCCT 


CTTTCCACCA 


TCAAACTTGG 


AATACGGATG 


GTAATAGCTG 


TATTGCTACA 


3780 


AACTACAACT 


GCCGCTTCGA 


TACCTAAGGT 


AAAAATCAAC 


CCTTTCATTT 


CTCGAGTTAA 


3840 


ACGACTTGCT 


TCCTTCGCTC 


TTTTCTl'CAC 


TTCTTTCTTT 


GATTTTCCAT 


AAGGGACAAA 


39C0 


GAGCAGATAA 


AGGGGCAGCA 


CCAAAAATCC 


AGCACTATAG 


CCTAGAAAGA 


TAGCTGTCCA 


3960 


ACCAAAGGCC 


AACAACTGAC 


CGACGGCCAA 


GGTAATGAGA 


GAAGCTCCAA 


CCACCTCTGC 


4020 


ACAAGCCCGT 


AGCCCTAACA 


TCTGAATTCG 


CCTTTTTCCT 


TGGTAGCGTT 


CACTGATAAT 


4080 


AGAAATGGCC 


TTGGCATTGA 


TCATCCCAAG 


ACCCAAACCA 


AAGAGAAGCC 


GTGTTCCAAA 


4140 


GACAAAGGGA 


TAGCCTTGGT 


ACCAGAAGGG 


AGCTGTACCG 


CTCAATCATA 


AAATCAGCAA 


4200 


GCCCAAACTA 


ATCTGTAAGC 


GCTCAGGAAA 


TATTTTTTCT 


AAGAAACCAT 


TTAGCAGTAA 


4260 


CATCATCATG 


ATTCCAAAGG 


AAGGCAAGCT 


CACCAAGAGC 


TCAATTTGTT 


CCTTAGAATA 


4320 


ACCCTGATAA 


TAGTCAAACA 


TGGCTGGTAG 


GGCACTCGAA 


ATGGAAAAGG 


AGGTAATCAA 


4380 


AACGAGGGAG 


ACAGCCAAAA 


TGCTGGCCCG 


TTCTAAAAAT 


TGTTTCATGA 


AATCTCTTTC 


4440 


TATATTTCTC 


TTAATCTTCT 


ACTTTTTTGA 


TACTTATCAA 


ATAAGCAAGA 


AAAGAAGAAG 


4500 
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CCTCATTGGT TTGTAGACTC CTTCTTAAAT TCCAAAATGA ATCCCTTCTA TCTTATACTC 4 560 

AATGAAAATC AAAGACCAAA CTAGGAAGCT AGCCGCAGCT TCTTCAAAAC AGTGTTTTGA 4 62 0 

GGTTGCAGAT GGAAACTGAC GTGGTTTGAA G AG ATTTTCG AAGAGTATTA GGATGACTTT 4680 

CTCTTGATTT GCTTGATAAA GTAGAAAATA AATCCTGCTA CCATATAGGC AACAAAGATA 4740 

ATCAGACACC ACTTAAACAC AACATTCCAA CCCTT G TTCA CATTCAAAAA GAAGTAAGGG 4 800 

AAAGGATTAT CCTTGGCATT TGGAATATTG A G TTTTAGAA CCAAGCCATT AAAAAGAGCA 4 860 

AACATCATAT ACAGAAAGGG TAAAATGCTC CACACTGCTG GATCCCAAAT CTTGTATTGA 4920 

CCCTGTTTGT CAAAAAAGAG CGTATCCCCT AAAAACCAGA TGGGAACCAT ATAGTGGCAA 4980 

AGGAAATTTT CTAGGGTATA GAAATTAGTC GCAATGGGCG CCAAGAGGAA ATGGTAAATC SO 40 

ACACAGGTAA TCATGATACT CATGGTGACC CCACCTTTTA AGCGCAAGAG ACTTGGCCTT Si 00 

TCCCAATTTT CACCTACACG GCTCATAACC TTTAGAAGAT AAAGGGTAAA AATAGTTACC 5L60 

AAGAGGTTGG ACAGAACCGT GTAATAGAGA ACCATCCCAA AACCACCATG CTTAGTAATT 5220 

TCAAGATAAA CTCCCCTAAA AGCCGCTAGA AACAAGAAGA TACGGCTATA AAATACAAGT 5280 

TTATAGTGTT TTGACATGCT TAAATCTTCC TCACAAACTC TGATTTAACT TTCATGGCAC 5 340 

CAAAACCATC AATCTTACAG TCGATATTGT GGTCGCCTTC TACGATGCCG ATATTTTTCA 5400 

CCCCCCTCCC TTGTTTCAAA T C T T TTGGCG CAC C TTTTAC TTTCAAGTCC TTCATGAGAG 5460 

TTACTGTATC ACCATCAGCC AATTTATTTC CGTTCGCATC GATAGCGACA AGACCTTCTT 5520 

CTACTTCTGC AACTTCAGCA GGATTCCACT CATGAGCACA CTCTGGGCAA ACCAGTAGGG 5580 

CACCGTCTTC GTAGACATAC TCTGAGTTAC ATTTTGGACA ATTTGGTAAA TTCTTCATGG 5640 

TTTCTCCTTA TCATCATTCA CTXTTCTTTG AAAATCAAAA TTTCTCCAAC ACCAACTATT 5700 

ATACCCTAAA ATCAGCATTT TGACAAATTT AGAAAAAAAC CGATATCAAT CTATCGCCTT 5760 

TTCTACATTT ACATTCTTTT TTCAGCTTCT CCTTTGATTT TTTCAACTAC TT C TTGAATG 5820 

TTCAAACCAG TTGTATCAAG GTAGACAGCA TCCTCTGCTT GTTTGAGAGG AGAAGTCTCA 5880 

CGATGACTAT CCTTGTAGTC ACGCGCAGCA ATTTCCTTTT TTAGCCTTTC AAGGTCTGTT 5940 

TCAATTCCCT TGGCAATATT TTCCTTGTAA CGACGCTCTC CTCTCTCATC AACAGAAGCT 6000 

ACTAGGAAAA TTTTCAATTC TCCTTGTGGC AATACAACAG TTCCAATATC GCGACCATCC 6060 

ATCACAATCC CGCCTTGCTG GGCAATTTCT TGTTGGAGAG AAACCAGTTT CTCACGCACT 6120 

TCAGGAATTG CTGCAATAGC AGAAACATGA TTGGTCACTT CATTTTCACC GATAGGATGG 6180 

GTAATATCCA CATCTCCTAC AAAAACAAGC TGGTCTCCAG TTTCTGAACG TCCAAAGCTG 6240 
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ATTGGATGCT 


GGTCCAACAA 


GGCTAGAAGG 


GCTTCGACTT 


CTTCAACTCC 


TAATTGGTTC 


6300 


TTAAGACCCA 


TATACGTCGC 


TGCACGATAC 


ATACCTCCTC 


TATCAACCTA 


GCTGAATCCA 


6360 


AAATCCTTAG 


CAATAATCTT 


TGCGACCGTA 


CTCTTACCGC 


TGCAAGCAGG 


ACCATCAATA 


6420 


GCAATTTCAA 


TTCTTTTCAT 


ATCGGCTCCT 


ATTTTATTTT 


TATAACATCA 


CCTCGATTAG 


6480 


CAAACCAAGA 


TCCTGTAGCC 


ATGTGCCCAG 


GATTCAAGGC 


CTCTAACTGA 


GCAATGGAGA 


6540 


TTCCTGCACG 


AGCGGCAATA 


CCTGCTTCCC 


CTTCTCCTGC 


GAGAACTTTA 


ATCGTTCCTT 


6600 


CAGGATTAGC 


AGCTTCTTCT 


GAACTACTAG 


AACTAGATTC 


TGGCTCTGAA 


CTCTGCTCAG 


6660 


GCTGAGAACT 


ACTTGAAGAT 


GAGATTTGTA 


CTACACTGGC 


ATCAGAATCA 


TGAAACCCTT 


6720 


TTAACGCTGC 


TGTGCGATTA 


CTCCCCCCCG 


ATGATACATA 


GATGAGAACG 


ATCACCATCA 


6780 


CCACCACAAT 


TACAAAGAAA 


ATACTAGCTA 


CGATCGTCAA 


AATACGATTA 


CCCATCCTAT 


6840 


CAGCCCCTCC 


GTGGTTTCGA 
CGGTTCTTTT 


TGCCGACGCT 
GCCATACCTT 


CTGCTCTTGA 


TTCTTCTTGA 


TCATAGATAT 


6900 
6960 


CTTCTTGCCA 


ACTCCTTGTT 


TTTTTTTACT 


TTTCTTATTA 


CAATATAAAT 


ATGAACATGA 


AAATCACACT 


TATACCTCAA 


CGATGTATCC 


CCTGTGGCCT 


7020 


TTGCCAAACT 


TATTCTGATT 


TATTTGATTA 


CCACGATAAT 


GGAATCCTGC 


GTTTTTACGA 


7080 


TGACCCTGAC 


CAACTGGAAA 


AAGAAATTTC 


TCCTAGTCAC 


CATATCTTAC 


AGGCTGTTAA 


7140 


AAATTGCCCA 


ACTCGCGCCC 


TGATTGGAAA 


CCACGAAGCC 


TAAATCAATG 


GCCATAATCC 


7200 


ACTCCCTCTA 


GTTTAGCACA 


TTTCCATGTA 


AAATTATAGT 


CTTTTCACTT 


TATTTTTTTC 


7260 


TGTAAAATCA 


GGAAGGTCAC 


TTTTTTCTTT 


CATAAGATAA 


AGTGCTCTTT 


TTTTAGTCTC 


7320 


TAAATAAATC 


TTACTGATAT 


ACTTGCCGAG 


AATCCCAATG 


GTCAAGAGTT 


GAATCCCTCC 


7380 


AAGAAAGAGA 


ATAACAGCCA 


TCAGAGAGGT 


CCAACCACAT 


CTCGCATTGC 


CCAAAATGAG 


7440 


GGTCCGAACC 


ACAACAAAAA 


ACGTCATCAC 


CAGAGAAAGA 


AAACAAG ATA 


GGAGACCAGC 


7500 


TACAAAGGCT 


ATAATCAAGG 


CAAAATCTGA 


AAAATTAATA 


ATCCCTTCAA 


TGGAGTAGAA 


7560 


AAAGAGTTGC 


CTAAAACTCC 


AACTTGTCTT 


CCCAGCCTCC 


CTTTCCACAT 


TTGGATAGTC 


7620 


CAAATACTAG 


GTTTTGAAAC 


CCACCCAGGC 


CAAGAGCCCC 


TTTGAAAAAC 


GATTGCACTC 


7680 


CGTCAACCTT 


AAAATGGCAT 


CGACTACAGA 


CCTTCTCATC 


ATACGAAAAT 


CACGGACACC 


7740 


CGACGCCAGA 


GCTAC1GGGC 


TGATTTTTTG 


CATGAGGCCA 


TAAAAGAGAA 


CACCACAGAA 


7800 


ACTGCGAAAG 


AAGGCTTCTC 


CCTCCCGACT 


AGTTCTCCCT 


GTCCCAACGC 


AGTCCAAGTC 


7860 


TACATTTTTG 


TCTAATACAT 


TTTTCATCTC 


AAACAACATA 


CTAGGAGGAT 


CTTGGAGGTC 


7920 


TGCATCCATC 


ACCACCACCA 


AATCTCCTGT 


CGCATATTGC 


AAGCCTGCAT 


AAACCCCTGC 


7980 


TTCTTTGCCA 


AAATTTCCAG 


AGAAAGAAAT 


ATAATGCACT 


GCCGGATTTT 


GCTC CCGAT A 


8040 
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GCCCTTTAAG 


AGTTCCAAGG 


TCCCATCACT 


TGATCCATCA 


TCGACAAAGA 


CATACTCGAT 


q i no 


TTCTGTTTCC 


AAATCTGGAA 


GTAAAGCTTC 


CAGACCCTGA 


TAAAAAAGAG 


GAAGTACTTC 


fi 1 £A 
O 1 OU 


CTCTTCGTTT 


AAACAAGGGA 


CGATGATTGA 


AATCATCATC 


TTAGTCTTCA 


AATCCATTTG 




GATGCTTGCT 


TTGCCAACGC 


CATGCGTCTT 


CACACATTTG 


GGTGATGTCG 


AGTTCTGCTT 


OiOU 


CCCAACCGAG 


TTCTGCTTTA 


CCTTTTGCCG 


GGTCTGAGTA 


GCAGGCAGCG 


ATATCACCTG 


a J4u 


GGCGACGTTC 


TACGATGCGG 


TAAGGAATAG 


GACGGCCCAC 


CGCTTTTTCC 


ATGTTTTGGA 


8400 


TAATTTCAAG 


AACTGACTAA 


CCTTTACCAG 


TTCCAAGGTT 


ATAAACGTTT 


AGTCCTGAAC 


8460 


CTTTTTGGAT 


TTTTTTCAAA 


GCTGCAACGT 


GACCCTTAGC 


CAAATCGACA 


ACGTGGATAT 


8520 


AGTCACGAAC 


ACCTGTTCCA 


TCTTCCGTAT 


CGTAATCGTC 


TCCAAACACT 


TGCACTTGCT 


8580 


CTAATTTTCC 


AACGGCTACT 


TCAGTCACAT 


ATGGCAAGAG 


ATTGTTTGGA 


ATACCGTTTG 


8640 


GATTTTCTCC 


CAAATCACCA 


CTCTCATGGG 


CTCCGATTGG 


CTTAAAGTAA 


CGAAGCAAGA 


8700 


CAACATTCCA 


TTCTGAGTCT 


GCTTTGTAAA 


TATCAGTCAA 


AATTTCCTCT 


AGCATGAGCT 


8760 


TAGTACGACC 


GTATGGGTTG 


CTCACTCAAA 


GTGGGAAATC 


TTCCAAGATG 


GGCACTGTGT 


8820 


GCGGATCCCC 


GTAAACTGTC 


GCAGAAGAAC 


TGAACATGAT 


GTTTTT AC AG 


TTGTTTTCTT 


8880 


CCATGGCTTT 


CAAAAGGCTG 


ACACTTCCAG 


CGATATTGTT 


GTCATAGTAG 


GCAAGAGGGA 


8940 


TACGTGTTGA 


TTCGCCAACA 


GCCTTCAAAC 


CACCAAACTG 


AATGACACCA 


GTCGGTTCTT 


9000 


CCTGCTTCAA 


AATATCTCTG 


ACGGTATCTC 


TCTCACGAAT 


ATCTGCCTCA 


TAGAAAGGAA 


9060 


TCTCAACTCC 


TGTGATTCCT 


TCAACAACTT 


CTAAACTCTT 


ACGATTGCTA 


TTGACAAGAT 


9120 


TATCCACCAC 


AACAACTTGA 


TCACCTCCTT 


GGATCAATTC 


AATAACAGTG 


TGGGTTCCAA 


9180 


TAAAACCGGC 


ACCACCAGTT 


ACCAAAATCT 


TTTCTTGCAT 


CTTTTTTCCT 


ZZ XTTCTCAG 


9240 


ATTATTTTTT 


CTTATTTTAC 


CATTTTTGAC 


AGGGAATGTC 


ATTTGCCATC 


CTAAACTACC 


9300 


TGATAAAATT 


TCAGTAAAAT 


GCTTATACTC 


TTCGAAAATC 


CAATTCAAAC 


T ACGT C AACG 


a i £ a 


TCGCCTTGCC 


ATGGGTATGG 


TTACTGACTT 


CGTCAGTTCT 


ATCCACAACC 


TCAAAACACT 


9420 


GTTTTGAGCT 


GACTTCGTCA 


GTTCTATCCA 


CAACCTCAAA 


GCAGTGCTTT 


GAGTAACCCG 


9480 


CCCCTAGTTT CCTAGTTTGT TCTTTGATTT 


TTATTCAGTA 


TTATTCGCTT 

i . . r j. w . p.. 


TTTACTCGTT 


9540 


TGACATAGTT 


TTCAATTGGG 


TAATTTAGAC 


CGTCCAAGCT 


CAACTCCTTG 


TCTTGGATCA 


9600 


GTTGGGCTAG 


ATGGTAACCA 


ATGATAGGAC 


CACTTGTGAG 


GCCTGATGAA 


CCTAGTCCAC 


9660 


TGGCTGCATA 


GACACCAGTT 


AAGTCAGGCA 


CCTGCCCAAA 


GAAAGGAGAG 


AAATCACTCG 


9720 


TCTAGGCACG 


GATTCCAACA 


CGCTCAGATT 


TTGAACTAGC 


TTCAGCCAAA 


ATCACATAGT 


9780 
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GAGTCAAGCT 


GGCCTCCTCC 


ATTTGTTGCA 


GCAACCTTTC 


ATCTACCGTC 


AAATCAAATC 


9840 


CCATGTCATT 


TTCGTGGGTA 


GCGCCTAAGG 


ATAATTTCCC 


ACCTGCAAAG 


GGAATCAAAT 


9900 


CCCACTCCCC 


TTCTGGCATG 


ACAACACCGT 


AATCTTCCAT 


GTCTTGGGCA 


AGCTGATAAT 


9960 


CTCCTAGTTG 


TCCTTTTTGA 


GGACGGACAT 


CCACTTCATA 


ACCTAAAGGC 


TCTAACATGT 


10020 


CCCCCAACCA 


AGCTCCCGTC 


GCCAAAATAA 


CCTGCTCAAA 


CTCCTCTTCA 


CCAATCTGGT 


10080 


AGCCTGATGC 


TAACGGTGTC 


AGAGTCACTT 


TTTCTTTGAC 


CAGCTTGACA 


TGACTGACTT 


10140 


CCAGCAAACG 


AGTCACTAAA 


AGTTCCCCAT 


CTACTCTCGC 


TCCACCAGAA 


GCATAGAGCA 


10200 


GGCGGTCAAA 


TCCCTGCAAA 


CCAGGGAATA 


ATTCATTAGC 


TGAGGCTTGG 


TTCAGAATGG 


10260 


CTAATTGCCC 


TATCAAGGGA 


GATTCTTCTC 


TGCGCTGGAG 


GGCCACTTGA 


TAAAGTTCTT 


10320 


CCAAATTGGA 


TTCATCCTTT 


TTCAAGAGAA 


AGACTCCCGA 


ACGCTGGTAA 


AAGTCGATTT 


10380 


CTTGTCCTGA 


TTTCTCTAAA 


TCAGCTAATA 


AATCCACATA 


AAAATCACCC 


CCCAAGCGCG 


10440 


CCATCTTGTA 


CCAGGCTTTA 


TTACGGCGTT 


TGGAAAACCA 


AGGACTGATA 


ATTCCTGCTG 


10500 


CGGCCTTGCT 


GGCTTGACCT 


TGCTCATGGT 


CAAAAACGGT 


CACCTCTACG 


TCACTTTCTC 


10S60 


TCGAGAGCTA 


GTAGGCAGCT 


GTTGCTCCCA 


CAATTCCTCC 


TCCAATAATG 


GCAACTTTTT 


10620 


TCATTCTCTT 


CACTTTCTAA 


CTAGATATGA 


TGCAAACGAT 


TGGTTGATGC 


CTGACTAGGC 


10680 


AAGATATCAA 


TAGACCACCC 


CTTATCTTCC 


TTCCATTGAC 


TAAGAACTGC 


TGCCATTTTT 


10740 


TCTACAAAAA 


TCACTTCGAT 


ATACTGACCT 


CGGTCCAATC 


CAAGCAACCC 


ATCAGATAGC 


10800 


ATATCCTGAG 


CACTATGGTA 


GTAGATATCA 


CCACTGATAT 


AGACATCTGC 


CCCCTTTGCC 


10860 


AAACCATCCT 


TATAGAAAGA 


CTGCCCCCTT 


CCACCACAAA 


TTGCTACTCT 


TGAAATAGGC 


10920 


TTCTGCAAAT 


CATCCTCTTG 


ATAATGCACC 


ATTCGAAGCC 


TATCTAGGTC 


AAACACTTGC 


10980 


TTGACCTGTT 


GGGCCAATTC 


CCAAAATGTC 


TGAGGCTGAA 


TATTCCCAAT 


ACGTCCAATT 


11040 


CCACGTTCTG 


GACCTGTTTC 


CTGCAGATAA 


CTCCTCTCCT 


CG ATTCCT AG 


CATCTGACAA 


11100 


AACCAGTCAT 


TGAGCCCATT 


TTCAACGATA 


TCAATATTGG 


TATGGCTGAC 


ATAAACTGCG 


11160 


ATATCATGCT 


TAATCAGCTC 


GATGTAAATC 


TGATTTTGCG 


GACGGCTGGC 


AAGCAACTCC 


11220 


TTGATAGGAC 


GAAAGATAGG 


CGCGTGCTTG 


ACGATAATCA 


ACTCCACACC 


CTTTTCAATG 


11280 


GCCTCTGCCA 


CTGTCTCTTC 


ACGAATATCG 


AGGGCAACCA 


TGACCCTTTG 


GATACCCTTG 


11340 


TCTAAAGTGC 


CAATTTGCAG 


ACCACGGCTG 


TCTCCCTCCA 


TAGAAAATTC 


CTGAGGGCAA 


11400 


AAGGCTTCAT 


AAGCTTGGAT 


CACTTCACTT 


GCTAACATGG 


ACCACCTCCT 


TGATAGCTTC 


11460 


AATCTTATCT 


ACTAGAACTT 


GACGTTCTTC 


CAGATTTTTT 


TCTGGGATTT 


GTCCGAGGGC 


11520 


GAACTCTAGC 


TTCTCACCTT 


CTTTTTGCCA 


TTTTTGGACA 


AATACTGGAC 


TCAcrrcriT 


11580 
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CGACAAGAAG 


GGACCAAACC 


GAACATCACT 


GGCTGATAGC 


TTCATTTGTC 


CTGCTTCCAC 


1104U 


CACCAAAATC 


TCATAAAACT 


TTCCAGCTTC 


TTCTAAGATG 


CTTTCTCCTA 


CAATCTGGAA 


11 / UU 


TCCATGATCC 


TGTAGCCAGA 


TACGCAACTC 


GTCTTC ACG A 


TTATTGGGCT 


GGAGGATCAA 


i 17 oO 


ACGCTCTACA 


TTAGCTAACT 


TCCCCAAACC 


TTCTTCTAAA 


ATCCTAGCAA 


TCAAACGACC 


1 1820 


ACCCATGCCA 


GCAATCCTAA 


TCACAGACAC 


TTGGTCAGTC 


TCTTCAAAAG 


CTGCCAAGCC 


11880 


ATTGGCTAAA 


CGGACTTGGA 


TTTTCTCCTT 


TAGGCCGTGA 


GCCTCAACAT 


TTTTAACCGC 


11940 


AGACTGATAG 


GGACCTTCCA 


CCACCTCACC 


TGCAATAGCG 


CTTTTGATTT 


GGCCTCTCTC 


12000 


AACCAACTCG 


ATAGGCAGAT 


AAGCATGGTC 


ACTTCCCACA 


TCTACTAAAA 


TAGCCCCCTC 


12060 


TGACACAAAG 


GAAGCTACCA 


ATTCTAATCT 


CTTTGAAATC 


ATCTTCTCTC 


ACTTTCCAAA 


12120 


ACTCTATTAC 


CTCTTATTAT 


ACCACATTTC 


AATCTTCAAC 


TTCCCAGTAA 


TATAAGCACC 


12180 


TCTGGCGAAA 


GAAGTTTCAA 


TGTCCTAAAG 


TAATAAGTGA 


ATCCAATTGA 


AAGATTTTAA 


12240 


ACAATTTGCA 


AAAATGTCAA 


AAAATAAAAA 


ATAAACAGTT 


TATTCAGAAA 


ATTCTTGACA 


12300 


TATAAAAACA 


CATGGTAGAA 


TATAATTAGA 


AAGTTAGAAA 


AAATAAAACT 


TTGACTAAAA 


12360 


TTTGTATTTG 


AAGGTGGTGT 


TCAGATAAGA 


AATTTAGTCA 


GACGAACCAC 


GAATTTGCTC 


12420 


TATCCTTTCT 


GGAATTTATC 


ATAACAGGAG 


GATACAGTCA 


TGCAACAAAC 


ATTGTTTGAA 


124 80 


TTAGAACTAC 


TTCCAGAGGA 


AGATATCATT 


CTCACAGGTC 


TCCCTAAGTA 


TTGTTCTTTT 


12540 


ACTTGTTTAA 


TTACAGCTCG 


CTACTTATAT 


TTTATATAAA 


AT AAGT AG CT 


TTACTTACGG 


12600 


AATAGGCTAG 


TGCTGTGTCT 


CTACCCTATT 


TTAATAATTA 


CGAGTTTGTT 


ATGGATTTAT 


12660 


TAGAGAAAGA 


ATGTTTAAAA 


TGTGATAAAA 


ATTTCCAACA 


GGGTGATATT 


TGGAATTACT 


12720 


ATTATTTATC 


AGATAAGATG 


CCTGCACAAG 


GGTGGAAAAT 


ACACATAAGC 


TCCCAAATAA 


12780 


AAGACGCTGT 


AAATATTTTT 


AAGATTGTGT 


ATAAACTATC 


CCAACTAAAT 


AATTGTAGCT 


12840 


TTAAAGTTGT 


TAAAAATTTA 


GAGGAATTAA 


AAAAAATTAA 


TTCCCCTAGG 


CAAATGAGCC 


12900 


CTACTCCTAA 


CAAATTTATA 


ACTCTATATC 


CTAAGTCAGA 


ATCTGAAGCT 


AAGAGTATGA 


12960 


TTTGTAATCT 


TACGAATAGA 


CTGTCAGAAT 


TTAAGGCTCC 


AAAAATACTA 


TCTGACTATC 


13020 


AATGTGGAAT 


CCATTCTCCA 


* GTTCATT AT A 


GATATGGGGC 


TTTTTTAAAA 


AAACAAGCTT 


13080 


ATGATGAAAA 


AAATAAAAAA 


CTCATCTATT 


TATTGCTAGA 


TGAAAAAAGG 


AAGAACTATG 


13140 


TAGAAGATAA 


GAGACAAAAT 


TTCCCTAGTC 


TTCCTAGCTG 


GAAAATGGAT 


TTATTTTCAG 


13200 


AAGAAG 












13206 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



4 



<xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


34: 






CCGGATCCAG 


CGAAAAATAT 


GCTCTTTGAT 


GCTGTAACTG 


GTCAAAAAGA 


TGCTAAAACA 


60 


GCTGCTAACG 


ATGCTGTAAC 


ATTGATCAAA 


GAAACAATCA 


AACAAAAATT 


TGGTGAATAA 


120 


AAAATTTGTT 


CAAGGGGGGT 


GGAAATCAAA 


TCCCCCTTTG 


AATTTATCAA 


TAGAGACACA 


180 


AATAATTTAG 


CTTTCTTATA 


AAAAAGTAGT 


ATCCTATGAA 


AGGAGTTAAT 


ATGGAAAAGC 


240 


AACAACCTAG 


TAAACCAGCC 


CTGCTGTCTA 


TCATTCCTGG 


GTTAGGACAG 


ATTTACAATA 


300 


AACAAAAAGC 


CAAAGGTTTT 


ATCTTCCTTG 


GTGTAACCAT 


CGTATTTGTC 


CTTTACTTCC 


360 


TAGCACTTGC 


AACCCCTGAA 


TTGAGCAACC 


TCATCACTCT 


TGGTGACAAA 


CCAGGTCGTG 


420 


ATAATTCCCT 


CTTTATCCTG 


ATTCGTGGTG 


CCTTCCATCT 


AATCTTTGTA 


ATCGTTTATG 


480 


TACTCTTTTA 


TTTCTCAAAT 


ATCAAAGATG 


CACATACGAT 


TGCAAAACGC 


ATTAACAATC 


540 


GAATTCCAGT 


TCCACGCACA 


CTCAAAGACA 


TGATCAAACC 


GATTTATCAA 


AATCGCTTCC 


600 


CTTACCTCTT 


GATCATTCCA 


TCTTATGTTG 


CCATGACCTT 


CGCGATTATC 


TTCCCAGTTA 


660 


TCGTAACCTT 


GATGATCCCC 


TTTACCAACT 


ACGACTTCCA 


ACACTTGCCA 


CCAAACAAGT 


720 


TGTTGGACTG 


GGTTGGTTTG 


ACCAACTTTA 


CAAACATTTG 


GAGCTTGAGT 


ACCTTCCGTT 


780 


V* X \S\m Li 1 1 WW 


TTCTGTTCTT 


TCTTGGACTA 


1 \mt\ 111 wuuL 


111 VVjivAViW 1 


TCTACTTTAC 


S4U 


AAATCGTAAT 


TGCTATCTTC 


ACAGCTATCA 


TTGCCAACCA 


ACCATTTATC 


AAAGGAAAAC 


900 


GTATCTTTGG 


TGTTATTTTC 




GGGCTGTCCC 


AGCCTTCATC 


ACTATCTTGA 


960 


CATTCTCAAA 


CATGTTTAAC 


GATAGTGTCC 


GTGCTATCAA 


CACTCAAGTA 


TTGCCAATCT 


1020 


TGGCTAAATT 


CCTTCCTTTC 


CTTGATGGAG 


CTCTTATTCC 


TTGCAAAACA 


CACCCAACTT 


1080 


GGACTAAGAT 


TGCCTTGATT 


ATGATGCAAG 


GTTGGCTCGG 


ATTCCCATAC 


ATCTACGTTC 


1140 


TGACCTTGGG 


TATCTTCCAA 


TCTATTCCTA 


ACGACCTTTA 


CGAACCACCT 


TATATTGACG 


1200 


GTGCCAACGC 


TTGGCAAAAA 


TTCCGCAACA 


TCACTTTCCC 


AATGATTTTG 


GCTGTTGCGG 


1260 


CACCTACTTT 


GATTAGCCAA 


TACACCTTCA 


ACTTTAACAA 


CTTCTCTATC 


ATGTACCTCT 


1320 


TCAATGGTGG 


AGGACCTGGT 


AGTGTCGGAG 


GTGGAGCTGG 


TTCAACCGAT 


ATCTTCATCT 


1380 


CATGGATCTA 


CCGTTTGACA 


ACAGCTACAT 


CTCCTCAATA 


CTCAATGGCG 


GCAGCTGTTA 


1440 


CCTTGATTAT 


CTCTATCATT 


GTCATCTCAA 


TCTCTATGAT 


CGCATTCAAG 


AAACTACACG 


1500 
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CATTTGATAT 


GGAGGACGTC 


TAAGATGAAT 


AACTCAATTA 


ft. ft /Wli^ k k k 

AACTCAAACu 


TAGAC1\> 1 


1560 


CAAACCCTTA 


CTTACCTTTA 


CCTGATTGCT 


CTATCAATTG 


ft *W*I « MIX 

TAATTATCTA 




1620 


ATTACCATTA 


TCTCAGCCTT 


TAAACCAGGT 


AACGTCTCAG 


CCTTTAAACT 


AV»A 1AU 1 A 


1680 


ATCGACCTCA 


ATTTTGATAA 


CTTTAAAGGC 


CTCTTCACTG 


AAACCTTGTA 




1740 


TACCTCAACA 


CTTTGATTAT 


CGCCTTAATT 


ACCATGCCTG 


TTCAAACAAG 


TATCATCuTA 


1 ROO 
A ovw 


CTTGCTGGTT 


ATGCTTACAG 


CCGTTACAAC 


TTCTTCGCTC 


GTAAACAAAG 


TTTGGXCTTC 


1 860 


TTCTTGATCA 


TCCAAATGGT 


GCCAACTATG 


GCCGCTTTGA 


CACCCTTCTT 


CU i i ATCCCU 




CTTATGTTGA 


ACGCCCTTAA 


CCACAACTGG 


TTCCTCATCT 


TCCTCTACGT 


TGGTGCTGGT 


170U 


ATCCCCATCA 


ATGCTTGGCT 


CATGAAAGGC 


TACTTCGATA 


CAGTGCCAAT 


GTCTTTAGAC 


204U 


GAATCTGCAA 


AACTAGACGG 


TGCAGGACAC 


TTCCGCCGCT 


TCTGGCAAAT 


TGTTCTACCA 


O 1 Aft 


CTTGTTCGCC 


CAATGGTTGC 


CGTACAAGCT 


CTCTGGGCCT 


TCATGGGACC 


TTTCGGGGAC 


2160 


TACATCCTCT 


CTAGTTTCTT 


GCTTCGTGAG 


AAAGAATACT 


TTACTGTTGC 


CCTAGGTCTC 


2220 


CAAACCTTCG 


TTAACAATGC 


GAAAAACTTG 


AAGATTGCCT 


ACTTCTCAGC 


AGGTGCTATC 


2280 


CTCATCGCCC 


TTCCAATCTG 


TATTCTCTTC 


TTCTTCCTAC 


AAAAGAACTT 


TCTTTCAGGA 


2 34 0 


CTTACAAGTG 


GTGGCGACAA 


GGGATAATTT 


ATCCCCGCCA 


CCCTTTTTCA 


TTTTATACTC 


2400 


TTCGAAAATC 


TCTTCAAACC 


ACGTCAGCTT 


TATCTCCAAC 


CTCAAAGTTG 


TGCTTTGAGC 


2460 


AACCTCTGGC 


TAGTTTGCAC 


TTTGATTTTC 


ATTGATTATT 


AGCAATTGTC 


ACTGTAAATA 


2520 


ATATCCTTGT 


AGCAAGCAAT 


TTTTCTCCTA 


GACTTGAAAT 


AAAGCGCATT 


■ 1J MWl" »W ft M ft. «M ft 

TCTCTATATA 


*> con 


ATAATACTCA 


TATAGAAAAC 


ACCTTTTACA 


AAGATACCTA 


TGCTTCCATA 


TCCA'l 'l HVL 


7 AAA 


TATTTTTCArx 


GTATTTGGGG 


GGTTCGTAAG 


CCCCTGTCCA 


AACGTTTCGA 


GCvCAAtTTtrtf 


* * uu 


TTTCAACTTC 


TCTTTACCAG 


TATCTTCCTT 


ATC AG CTTGT 


CTATGGTACC 


CATTGCTATU 


4 / DU 


CAAAACAGCT 


CCCAGGAGAC 


CTATCCGCTA 


GAAACTTTTA 


TCGATAATGT 


CTATGAACCT 


2820 


CTGACAGATA 


AGGTTGTCCA 


GGATCTCTCT 


GAACATGCTA 


CAATTGTCGA 


TCGCACATTA 


2880 


ACTTATACTG 


GAACAGCTAG 


TCAAGCCCCT 


TCTGTTGTGA 


TTGGTCCAAG 


TCAAATCAAG 


2940 


GAATTACCTA 


AGCACTTGCA 


ACTGCATTTC 


GATACAAATG 


AGCTAGTCAT 


CAGCAAGGAA 


3000 


AGCAAGGAAC 


TGACCCGCAT 


CTCTTACCGA 


GCCATTCAGA 


CTCACAGTTT 


CAAAAGCAAA 


3060 


CACAGCTTGA 


CCCAAGCAAT 


TTCTAAAGAC 


TGGTACCAAC 


AAAATCGTGT 


CTATATCAGC 


3120 


CTCTTCCTAG 


TTCTCGGTGC 


GAGCTTCCTC 


TTTGGTTTGA 


ATTTCTTTAT 


CGTCTCTCTT 


3180 


GGAGCTAGCT 


TTCTCCTTTA 


TATCACCAAA 


AGATCACGCC 


TCTTTTCATT 


TAATACCTTT 


3240 
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AAAGAGTGCT 


ACCATTTTAT 


CTTGAACTGT 


TTAGGATTGC 


CGACTCC'AT 


TACACTTATT 


3300 


TTGGGATTAT 


TTGGCCAAAA 


TATGACAACC 


CTGATTACTG 


T AC AAAAT AT 


TCTTTTTGTT 


3360 


CTGTATCTGG 


TCACTATCTT 


TTATAAAACA 


CATTTCCCTG 


ATCCAAATTA 


CCATAAATAG 


3420 


GAGATTTTTA 


TGCCCCTTAC 


GATTAAAGAC 


CTGGCCAAGC 


CTGCTGGTCT 


TTCGCCTTCA 


3480 


ACCCTAACCC 


GTGTTATTCA 


AAATAAATCA 


ACCATTAGCG 


ACGAAACAAA 


AAAACGTGTT 


3540 


CGCAAACCTA 


TGAAGGAACT 


CAACTACCAC 


CCAAACCTCA 


ACGCTCGTAG 


CTTGGTAAGC 


3600 


AGCTATACTC 


AGGTTATCGG 


ATTAGTTCTT 


CCTGATGACT 


CAGACGCCTT 


CTACCAGAAT 


3660 


CCTTTCTTTC 


CATCGGTTCT 


ACGTGGCATC 


TCTCAAGTCG 


CATCTGAAAA 


CCACTATGCC 


3720 


ATTCAGATAG 


CAACAGGGAA 


AGATGAGAAG 


GACCGTCTCA 


ACGCTATTTC 


ACAAATGGTC 


3780 


TACGGCAAGC 


GTGTAGATGG 


GCTAATTTTT 


CTCTATGCCC 


AAGAAGAAGA 


CCCTCTCGTA 


3840 


AAACTCGTCG 


CAGAAGAACA 


GTTCCCCTTC 


CTTATCTTAG 


GTAAATCTCT 


ATCTCCTTTC 


3 900 


ATCCCACTTG 


TCGACAACGA 


CAATGTTCAA 


GCTGGTTTTC 


ATGCGACTGA 


ATATTTCATC 


3960 


AAAAAAGGCT 


GCAAACGCAT 


TGCCTTTATC 


GGAGGAAGTA 


AAAAGCTCTT 


CGTGACCAAA 


4020 


GACCGTTTAA 


CACCCTATGA 


ACAGGCGCTT 


AAACATTACA 


AACTTACCAC 


TGACAACAAT 


4080 


CCCATCTACT 


TTGCCGACGA 


GTTTCTGCAA 


GAAAAGGGCT 


ATAAATTTAG 


CAAGCGATTA 


4140 


TTCAAGCACG 


ATCCACAAAT 


TGATGCTATC 


ATCACAACCG 


ATACCCTCCT 


AGCTGAAGGT 


4200 


GTTTGTAACT 


ATATTGCCAA 


ACACCAGCTG 


GATCTCCCTG 


TTCTCAGCTT 


TGACTCGGTT 


4260 


AATCCCAAGC 


TCAACTTGGC 


AGCCTATGTC 


GATATCAATA 


CTTTAGAGCT 


TGGTCGTGTT 


4320 


TCCCTTGAAA 


CTATTCTCCA 


GATTATTAAT 


GATAATAAAA 


ACAATAAACA 


AATTTGTTAC 


4380 


CGTCAATTGA 


TCGCCCACAA 


AATTATCGAA 


AAATAAGAGA 


CTGGGCAAAA 


AGTCGTTAAA 


4440 


AGCAAAAACG 


CATACTATCA 


GGTATTGAAA 


AAACTTGATA 


CTATGCGTTT 


TATTGTGGGA 


4S00 


AGATTTACTT 


CCTTTTCTAC 


TGAAATTGAG 


TCTTTTCCCA 


AGATCTTTTT 


ATACTCAATG 


4S60 


AAAATCAAAG 


TGCAAACTAG 


GAAGCTAGCC 


GCAGCTTGCT 


CAAAACACTC 


TTTTGAGGTT 


4620 


GTAGATGAAA 


CTGACGAAGT 


CACTAACCAT 


AC CT ACGGC A 


ACGTGAAGCT 


GACCTGGTTT 


4680 


GAAGAGATTT 


TCGAAGACTA 


TTAATCACTA 


ATTATCTATC 


TCAACAAATC 


TTCCTAGAAT 


4740 


ATGAACATTT 


TCCCAGACAG 


AGACAAAGGA 


GCTTGGATCC 


ACTTGTGTCA 


TAATCTGTTT 


4800 


AAATTCATTA 


AACTCTGCAC 


GTGTAATGAC 


AGTGATTAAA 


ACTGCCTTTC 


TCTCGTGATT 


4860 


ATAGGTTCCT 


TCTGCATCGT 


GGATCATGGT 


TCCTCCGCGG 


TGCAATTTTT 


TATGGATTTT 


4920 


TTCAATTACC 


TTCTCTGGAT 


GATTTGTCAC 


AATCATGCCC 


TGCATACGCT 


TTTGCTTACT 


4980 


AAAGACTGCC 


TCTGTCACAC 


GGCTAGAGAC 


AAAGATCGTA 


ATCATAGAAT 


AAAGACCGTA 


5040 
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TTTCCAACCA 


AAGGTCAAAC 


4 mm0m y ^ MM ft (MM ft M 

CTGCTATCAG 


ft VV* ft mft 

CATGATAGTT 


MM » IMM ft MM JL 


AuAAAuAAAT 


c i f\r\ 
3 100 


ACTACCGACA 


TTCTTACCCG 


TTTTCTTACG 


ft ft tr* ft ft 

AATAGTCAGG 


CTuACOAT AT 




3 1 OO 


ACTGGACATA 


■ri'UTTi'CGAA 


m « ft ft ft 

GAGCAAAACC 


% ft 09*^^0+^ ft It ft 

AATCCCCAAA 


ft M K ft *^ ft ft 

CCCATAACAA 


M «y MMMMM ft ft ft 

WACCtCCAAA 


3220 


AAGGGAATTG 


ATAATGGGAT 


CCTCTGTCAA 


GCTTGCCACA 


& ft ft ft am 

GGGACAAACT 


^m ft «n ft ft ft ^4 ft ft 

GGATAAAGAA 


S280 


GCAACTCATA 


GATACCGTGA 


TAAAGGTAAA 


GACGGTGAAC 


TTATGGCCAA 


»MH»M ft tft ft MM m 

TCTGATACCA* 


5340 


AGCTAAGACC 


ATCAAAGGGA 


AGTTAATGGC 


CTAGAAGCTT 


ft ^+*m*m m ft ft t MU*^* 

AGCGAAATvG 


GAATATGAAA 


5400 


ACCAAACCAG 


TGATTACTCA 


AGGCAGAGAT 


AATCTGTCCC 


AGACCTGTTG 


CACCACTCGA 


5460 


ATACACATGC 


CCTGGTTGGA 


AAAAGAAATT 


AACTCCTACT 


GCTGATAAAA 


AACCATAGAC 


5520 


CAGAGAGGCC 


GAAATCTTCT 


CATCATACTT 


TTCTCGACAG 


ATACTTTGTA 


AGACACGTAA 


5580 


AATTTTTATC 


TGATAAGCAA 


AGCGGCGCAG 


ATAATAGCGC 


CACCGCTTAA 


TTCGTTTTGT 


5640 


TTGTTTCATC 


TTCTTCTACT 


TGTAAGCTGA 


CTTCCTCTAG 


TTGTTTGAGA 


GCGACTGTTG 


5700 


ATGGAGCTTG 


TGTCATTGGG 


TCAGTTGCCT 


TGTTCTTCTT 


AGGAAACGCA 


ATGACTTCAC 


5760 


GGATATTTTC 


TTCTCCAGCA 


AGCAACATGA 


CAAAACGGTC 


AAGCCCGATA 


GCCAAACCAC 


5820 


CGTGTGCTGG 


GAAACCATAG 


TCCATGGCTT 


CAAGAAGGAA 


ACCAAACTGG 


TCATTGGCTT 


5880 


CTTCAGTTGA 


GAAACCAAGA 


GCCTTGAACA 


TGCGTTCTTG 


AAGGTCTTTT 


TGGTTGATAC 


5940 


CAAGGCTACC 


ACCACCAAGC 


TCATAACCGT 


TCAAGACGAT 


ATCCTAAGCA 


ATGGCACGAA 


6000 


CCTTAGCCAA 


ATCACCTTCT 


AATTCATGAG 


CACTCTCTTC 


CTGTGGAAGT 


GTGAAAGGAT 


6060 


GGTGGGCCCT 


CATCTAGCGG 


CCTTCTTCTT 


CAGACCATTC 


AAACATCGGC 


CAGTCAACCA 


6120 


CCCAAAGGAA 


GTTGAACTTA 


TCATTATCAA 


TCAAGCCAAG 


CTCTTTAGCA 


ATACGTCCAC 


6180 


GAAGGGCACC 


CAGTGTTCCA 


TTAGCCACTT 


CAAGCGTATC 


CGCCACAAAG 


AGAACCAAGT 


6240 


CCTTATCTTC 


AAGAACAAGC 


GCTGTTGTCA 


ATTCTTCTTG 


GATACCAGTC 


AAGAACTTGG 


6 300 


CAACTGGTCC 


GTTTAATTCT 


CCATCAACCA 


CCTTGACCCA 


AGCAAGACCT 


TTCGCACCAT 


6360 


ACTGTTTGCC 


TACTTCCGTC 


ATCTTGTCGA 


TGTCTTTACG 


TGAATAGTTG 


TCCGCAGCTC 


6420 


CTGTGACCAC 


AATCGCTTTT 


ACAGCAGGTG 


CTTCTGAAAA 


GACTTTAAAG 


TCTACACCTC 


6480 


GCACCACTTC 


TGTCAACTCC 


TGAAGCAACA 


TGTCAAAACC 


AGTATCTCGC 


TTGTCAGAAC 


6540 


CGTAAAGAGC 


CATAGCATCA 


TCGTATTTCA 


TACGAGGGAA 


TGGTAGCGTT 


ACTTCGATGC 


6600 


CTTTTGTTTC 


CTTCATCACG 


CGCGCCATCA 


AGCTTTCTGT 


AATATCTTCG 


ATTTCTTGCT 


6660 


CAGTAAGGAA 


GGACGTTTCC 


AAGTCGACCT 


GAGTAAATTC 


AGCCTGGCGC 


TCTCCACGCA 


6720 


AGTCCTCGTC 


ACGGAAACAT 


TTAACGATTT 


GGTAGTAACG 


CTCAAAACCA 


GC ATT CATC A 


6780 
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AGAGCTCTTT 


CGTGATTTGT 


GGACTTTGAG 


GAAGAGCGTA 


AAAATGCCCC 


TTATTAACAC 


6840 


GACACGCCAC 


TAAATAATCA 


CGCGCCCCTT 


CAGCCGTTGA 


CTTACAAAGG 


AATGGTGTCT 


6900 


CCACGTCGAT 


AAACTCCAAC 


TCATCCAAGT 


AGTTGCGGAT 


AGAGTGGCTC 


ACCTTGGCAC 


6960 


GAAGTTT AAG 


ATTTTCCAAC 


ATI"!" CTGG AC 


GACGAAGCTC 


AAGGTAACGG 


TAACGCAAAC 


7020 


GTCTATCGTC 


ATTTGCCTCA 


ATGCCATCCT 


TAATCTCAAA 


TGGTGTTGTC 


TTACCTGTCT 


7080 


TAAGCACAAT 


AAGACCTCTC 


ACGTTTAACT 


CAACCGCACC 


ACTTGGCAAC 


TTATCATTGC 


7140 


CTTGTCACGC 


GCAGCGACCT 


GACCAGTCAC 


CTCAATAACA 


AATTCGCTAC 


GAAGGCTTTC 


7200 


AGCTGTTGCC 


ATAACCTCTG 


CAGATACTTT 


TTCAGGGTTG 


ATAACCAACT 


GCATGATTCC 


7260 


TTCACGCTCA 


CGAAGATCCA 


TAAAGATCAA 


ACCACCAAGC 


TCACGACGAC 


GGCCAACCCA 


7320 


TCCTTTCAAG 


GTTATTTCTT 


CTCCGATGTG 


TTCCTCACGA 


ACACGACCAG 


CATACATACT 


7380 


ACGTTTCATT 


ATTTCTCTCC 


TCTTTTATTC 


TGTTACTATT 


TTACCATAAA 


AGCGCAGCTC 


7440 


TTCATGAAAA 


TCATCAGAAA 


AGTTTGCCAG 


TCTTTAAAAG 


TCAGGTGAAA 


GCCCTAAAAA 


7500 


TTAGCCCTAA 


TACTCTT CG A 


AAATCTCTTC 


AAACCACGTC 


ACCGTCCCCT 


TACCGTATGT 


7560 


ATGGTTACTG 


ACTTCGTCAG 


TTTCATCTAC 


AACCTCAAAA 


CCATGTTTTG 


AGCTGACTTC 


7620 


GTCAGTTCTA 


TCCACAACCT 


CAAAACAGTG 


TTTTGAGCAA 


CCTGCGCCTA 


GCTTCCTAGT 


7680 


TTGCTCTTTG 


ATTTTCATTG 


AGTATAATAC 


AAAAATCCGA 


TGAACTTCAC 


CGGACTCTTT 


7740 


TATTTTGAAT 


TTTTGCCTGC 


TTTACGCTTT 


TCAGCGATTT 


CCGCTGCCTT 


TCGAGGCAAG 


7800 


ACAATTTCCG 


TTATGTAACC 


CGTCCCAAAA 


CCCAGTACAC 


CTGCAATACG 


AGCAAAGACA 


7860 


ACTCCTAGAT 


AGTTATAGAA 


GAAATCGCCT 


TTGAAGGCAT 


AACCTAGCGC 


TCCAATGATG 


7920 


AAAAATAGAA 


CGACTGCCTG 


AATCACTGCT 


AATAAAATTA 


CTCGTTTCAT 


GTGACCTCCT 


7980 


GACTCTATTA 


TAGCATGAGA 


ATCATCAAAA 


AGCCGACTAA 


ATTATTCAAA 


GCGTGAAGAG 


8040 


AAATACTGTA 


GACCAGACCT 


TTTCTGCTAA 


TGTAAGCCAA 


ACCCAAACTA 


AAACCAAGGC 


8100 


TAAAATAGAC 


AAAAAATTGT 


TCCACATCAC 


CTGCAAAATG 


AATCAAGGCA 


AATAGAAGAC 


8160 


TAGATACCAC 


AAGAAAAATC 


AGGGTTCGTT 


TACTATTGTC 


CTGCTTAGGA 


AAGAGATAGC 


3220 


GTGCTAACAT 


CCCTCTAAAA 


ACAATCTCTT 


CCCTCAAAGG 


AGCAAAAATA 


ACCACAGCAA 


8280 


AGAATGAGAA 


AAGTGCTTGA 


GACAAGGTCA 


AGTCTGTCGC 


TATTTGCTGA 


TTTACTCAAG 


8340 


GATCATCTCG 


CAAGAAGAAT 


TGAACGACCA 


GAGATAAGAA 


CCAAACCAAG 


ACAGGAAGCC 


8400 


AAATAAATCG 


ATTAAAGCCG 


CTCTTCTCAA 


TATGAACAGG 


AGCCTTCTGA 


TACCATTTGT 


8460 


AAATGCCGTA 


CACATATACT 


CCAGCCAAGG 


CCACATAGAG 


TAGAGTAACA 


GCATAGGGTG 


8520 


AAGCGCCTAA 


AGCAAGCCAC 


GCAGTCGCGA 


GCCCCTGAAT 


AAAGCCATAG 


ATAAA7AAAA 


8580 
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AGGATAGAAG 


GGCTAGAAGA 




uul ill iMu 


TJL&TTTCJITa 
1 n/\ lit \-/* l n 


V* A * /W%^ A V>V» 1 


8640 


TTATTTGAAA 


TAACGTTTTA 


LLATAWjTAA 








8700 


TCCTACAAGC 


AAGAAAGCTA 


GTAACTGAAT 


CTCTCCTGTC 




1 un 1 AA 1 AAVJ 




AAAAATATAT 


AAGGCTGGTA 


AGACATATTG 


OTGTAATTGG 


& »*PA & & ll«T < *TV 


l*AAAAt- IV- I A* 


00«u 


TTCCAAATTA 


GCCTGACGCT 


CCCCTTCATC 


ATAAGAATTT 


ATATACTTCA 


AGAV.ATUCTT 


Qoon 
ooou 


TGGTCTAGCG 


AAAAATTCCA 


AATCAAACTG 


ACGAACAATC 


GCAATGGTTT 


TAAAAAvjAGA 


074U 


TTTTTG AG CG 


ACTAAGAATA 


CCACAAAGAG 


TAAGAAAGAA 


ft Ik « ft 1 H fM^* 

AGGAAAAATG 


TTTGAGGGTT 


9000 


TGTATGCAAT 


ATAATCACCT 


CACTTAATGA 


AATAAAAATA 


GCCAATGGAA 


TCGCTACACC 


9060 


TGTAATATTA 


AAAGCAATGG 


TTCCAAACTC 


AAGATTCCGA 


TACATTTGCA 


CATAATAGGT 


9120 


TTCATTCAGA 


TCGTCATCCA 


TTTCCTCTTG 


ATACAAAGAA 


TGAAATTTTC 


TGCTTTTCTT 


9180 


TAAGAAATTG 


AAAGTCAAAA 


ACATACTAAT 


GAAACCTATC 


AGTAAACAAA 


TAGCTGATAT 


9240 


CCATGGCATC 


AAGGCTTTTA 


CATCTAAAAT 


AATTTCGTGG 


GATTCGACAC 


GTGCCTTAAA 


9300 


CATCCCTACA 


AACATGCCCA 


AGAACCCCCC 


AACACAATAG 


ACATCAAAAA 


TAACAATCTA 


9360 


CGTTTCTTTT 


TCATATTCAT 


TCTCCTTTTT 


CACTTCCTAG 


ATTTTTGGAT 


TTCTTTTCAA 


9420 


TCCATTCAAT 


TACTGGCATC 


AGAGCAAAGT 


AGACCCAAAC 


AAATTGGTCG 


CTTTGATAGG 


9480 


CATTAAACCA 


GCTTAGGTCC 


ATCCCAATCA 


GTAGAAATAC 


GCTGACTAAT 


AAAGCTATGA 


9540 


CCACTACATA 


ATAAATCACT 


TTATACTTGT 


TCATCACTCG 


TCCTCCTCCA 


m * ft ft ft#*%ftv^% 

AACGAAATAC 


9600 


CGATTCGACT 


GTTTCGTTGA 


AAATTTGAGA 


TATTTTCAGG 


GCAATGATAA 


TGGATGGGGT 


9660 


GTACTCATCC 


CGTTCTACTA 


GGCTAATGGT 


CTGTCTGGAA 


ACCCCTGCCA 


GTTTGGCTAG 


9720 


GTCCGTTTCA 


TTGAGACCAT 


CGCGAGCTCG 


AAGCTCTTTT 


AGACGATTTT 


TTAGTTGCAT 




CTTACACACC 


TACTCTCCGT 


CAAATTCAAC 


GGTTTGGATA 


TCCTCAATAC 


CTTGCnAtTT 


704 U 


GAATTTTTCT 


TTTCCCCTAT 


TATCTACACG 


TCGTAGCTTT 


ACCCATTCCT 


CATCAACATC 


9900 


CACAACTTCC 


CAGTTATCTG 


GCCCAATATA 


CACTCCCGTT 


ATAATTCGTT 


CCTTTCCAAT 


9960 


CATTTCTTGT 


AATAATCTCG 


ACATTTCTGC 


GTTTCCTTTC 


TCTTTTCGCT 


CAAGTCTTTT 


10020 


CATTTTATTC 


TCTAGTTTCT 


TCATTTTTTT 


AGAATTATTA 


GAATAAAAGA 


AAATCATAAA 


10080 


TAGTATAAAT 


CCTAGTACCC 


ACATTATAAC 


TCCTTTCTGC 


TTCCTATTTC 


TTAACTTGAA 


10140 


TTCATTGTAA 


CATATCTTTT 


TCTTTTTGAC 


AAGTATAGTT 


GTCAAAAAAA 


TTATGATTTT 


10200 


TGTCATTTTG 


CAAAAGAAAA 


AGGTCAGGAG 


TAGCTTCCTG 


ACCACTTTAT 


CTATCATTAA 


10260 


TACTCTTCTA 


AAATCTCTTC 


AAACCACGTC 


AGCTTCACCT 


TGCCGTAGGT 


ATGGTTACTG 


10320 
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ACTTCGTCAG TTTCATCTAC AACCTCAAAA CCATGTTTTG AGCTGACTTC CTCAGTTCTA 10380 

TCCACAACCT CAAAACCATG TTTTGAGCTG ACTTCGTCAG TTCTATCCAC AACCTCAAAA 10440 

CCATGTTTTG AGCTGACTTC CTCAGT TCT A TCCACAACCT CAAAACAGTG TTTTGAGCAA 10500 

CCTGCGGCTA CCTTCCTAGT TTGCTCTTTG ATTTTTATTG AGTATAAAAT CCTACTTTTT 10560 

CAAAGATTTC TGAGAAGTTT TGGCTGATTG TCTCAACTGA CACTTCCACT TCTTCTCGGG 10620 

TTTGGTTGTT CTTGACCGTC ACTTCTCCGC TTTCGACTTC GCTCTCTCCT AGGGTGATGA 10680 

CGGTCTTAGC CGCAAAGACA TCGGCTGACT TGAACTGAGC TTTTA G TTTA CGGTTGAGGT 10740 

AATCACGCTC TGCTTTGAAA CCTTGTTGGC GAACAGCCTG TACCAATTCC AAGGCCTTGA 10800 

TATTTGCCCC TTCGCCCAAG ACTGCGATAT AGACATCTAG GGCGTTTTCG ATAGGGAGGG 10860 

TCACACCTTG CTTTTCAAGG ATGAGAAGCA GGCGCTCTAC ACCAAGTCCA AAACCAAATC 10920 

CAGCAGTTTC AGGGCCTCCA AAGTAAGCAA CCAAACCATC GT AG CG A CCA CCCGCACAGA 10980 

CGGTCAGGTC ATTGCCCTCA ATCTCTGTGA TAAACTCGAA AATGGTGTGG TTGTAGTAGT 11040 

CCAGACCACG CACCATATTG GTATCGATGA TGTAATCTAC TCCAAGATTT TCCAACATCT 11100 

GACGCACAGC ATCAAAATGA GCTTGGCTTT CTTCATCAAG AAAGTCCAAC ATACACGGCC 11160 

CATTCTCTAC TGCCACCTTG TCTTCTTTTT CCTTAGAGTC CAAGACACGA AGAGGATTTT 11220 

CCTCCAACCG ACCTTGGCTA TCCTTAGACA AGGTCTCCTT CAGCCGTGTC AAATAGTCAA 11280 

TCAAGCCTTG GCGGTAGGCT GCACCGCTCT CAGGATTTCC AAGAGTCTTG AGGTGCAATT 11340 

TGACACCTTG AATACCGATT TCCTTCAAAA AATGGGCTGC CATACCGATT GTTTCCACAT 11400 

CGCTAGCTGG ATTGCTAGAG CCAAAACACT CAACACCAAT CTGGTGGAAT TGGCGCAAGC 11460 

GCCCTGCCTG TGGACGCTCA TAACGGAACA TACCTCCCAT GTAGTAGAAC TT G CTTCGCT 11520 

TTTGCACTTC TGGCGCGAAA AGTTTATTTT CCACATAGCA ACGGACAACG GGTGCAGTTC 11580 

CTTCTCGACG CAGGGTAATA TGACGGTCAC CCTTGTCATA AAAATCCTAC ATTTCCTTGG 11640 

TTACGATATC CGTTGTATCT CCGACAGAGC GACTGATAAC CTCGTAATGC TCAAAAATAG 11700 

GCGTGCGCAC TTCTGCATAG TTGTAGCCTT TGAAAATCTC ACGGGCAAAG CCCTCAACCT 117 60 

ACTGCCACTT AGCAGACTCA GCAGCTAAAA TATCCTCCCT TCCTTTTGGT TTTTGTAATT 11820 

TCATAGGGAA TCCTCTTTAA ACTTAATAGT CTTATTT T AC CATAAATAGA GGGATTAAAA 11880 

CAGTAAGAAA AAAATTAGGA TTTAGATATC ATTTTTGAGA TTAAGAATTG TCAAAAAAAT 11940 

AGCTACCAAG GAAAGACCAA CAAATAGCAT CCAAGTCAAC TGTATATTCC ATACGGCTAC 12000 

TAGTGAAAAA CAAGCTGTTC CCACACGTAT GGATAAGCTA AACAATAGAC CTAAAAAATT 12060 

ACTAGTACGA GCTAGAACCT CTGGAGCTAG ATTTTTCATG AGCATGGCAC TAATCTTTGG 12120 
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TTGAACTTTA CCAGACACAT ACAGAGTAAA GAAGAGAAAT AGCAAACCAA GCACGACTTG 12180 

ATTGAATAAA TTACCCAAAC CAACTAGACT AAGTCCTACG CTCTCCCACA TCATCAATCT 12240 

AGGCAAGGAC TGCTTCCCAA AATAATCATT GCCCGTAAGG CTACTGATGA TGACTGATAC 12300 

TAAAACACAC AATTGATTGA TAAATACTGC CTCTGTATAA CAAAAATTCA AGAGAGAATC 12360 

GCTCAAAAAG AAGATATTAT AAATTCCACC CAAAGCGCCA CCCAAGCAAT TAATAACCAA 12420 

GACAGCAAAG AGCATAAAAC CAAAGTTTTT CTGTCCACTT TTAAGAAAAA CGAGACCTAA 12480 

ATTTCGGTAA ATTGTTAGGA ACTGGTCTTT GATAGAAAGC TTCTCATTTT TTAAGTTTTC 12540 

ACCATCAGCA GATGACATTG ACAGGCTCAA TTTGCTTTTT CCTAAAAAGA GGATAGTGGC 12600 

TGATACTAGG AAAAAGCAGG CATTGATTCC CGCAACGAGA GAAAAATTGT TCACCGATAG 12660 

AGCTAAGAGC CAGACTCCGA AAGCTTGACC ACCAATAGCT CAAATATAGC TGATGAACTG 12720 

TGAAAAACAA TAAGCCTCCA TCAGATCATC TTCAGCTACT TTTTCCTTAA TAAGAGGCAT 12780 

ACGCAGGCCA CCTGCAAAAT CACTGATGAT ATCACTAATG ACATTGATCA AACACAGCCT 12840 

AGAAAAGGCA AAGAGACTAC CTTGCTGAAC AACTAGGGCT GCTAGAAAAA ATAGAACCCC 12900 

CTGAAACAAA CCGCTATAGA CCATCCATTT GACCTTGTCC CTCGTGTAAT CTGCCCGAAT 12 960 

CCCTGCAAAA ACTGTAAAGA GGGTCGGAAG AATCATCACA ATATTCGCCA TAGCAACACC 13 020 

AAAAGATGCT TGTGACAAGG TCGATGCATA GACGATAAAG ACCAGCTTGA AAATCGAAAC 13080 

ACCAAAAGCA TTGAAGAAGC GTGG 13104 

(2) INFORMATION FOR SEQ ID NO: 35: 

<i> SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 19250 base pairs 
(B) TYPE: nucleic acid 
<C) STRAND EONESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCGGGCAAAT ACTTTTGAAC TTTTCATCAT TTTCTCCTTT AAAACTTTCT 
GACTCTTTTC AGAAACTTGT CAACAGAATT TTCAGAATTT TTGAAAATTA 
AACATCTTTG CAAAAAATAT GAATATCGTA AGCGCGTCAT AACAAGGTAT 
TGGAGCTCCT CCTGTATACT ATTACTAAAG TAAATATTGG AGGATATTTT 
CCTATTGTTC CTCTAGAGAT TCCACAATCT CGTCGTTTTG ATTCTAAAAA 
ATTCTrCTTA AAATTCGTAT TGGCAAGCTT GAAGTAAGTT TTTTTCAATC 



CTCCATTATA 
TTTTTCAAAC 
CTATCATTCA 
AATGCCACAA 
GAGAAATGAT 
TCTCAATCTC 



60 
120 
180 
240 
300 
360 
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GAAATGATAG 


AACAGCTTTT 


GGATAAGGTG 


TTGCTCTATG 


ACAATTCATC 


TATCTAGCCT 


420 


AGGGCAGGTC 


TATCTCGTGT 


GTGGGAAAAC 


TGATATGAGA 


CAAGCAATCC 


ATTCACTGGC 


4 80 


TTATCTCGTT 


AAAACCCACT 


TTGAATTGGA 


TCCTTTCTCC 


GGTCAAATCT 


TTCTCTTTTG 


S40 


TGGTCGACGT 


AAAGACCGCT 


TTAAAGTCCT 


TTACTGGGAT 


GCTCAAGGAT 


TTTGGCTACT 


600 


ATATAAACCC 


TTTGAGAACG 


GCAGACTGAC 


TTGGCCCAGT 


ACACAAAAGG 


ATGTCAAAGC 


660 


TCTCGCACCT 


CAACAAGTAG 


ATTGGCTGAT 


GAAAGGCTTT 


TCTATCACTC 


CAAAAATATA 


720 


GTAGATTGAA 


ACTAGAATAG 


TACACCTCTG 


CTTCTAAAAC 


ATTGTTAGAA 


ATCGATTTTA 


780 


CTGTCCTGAT 


CGATTTGTCC 


TGTTATTATT 


TCATTTTACT 


ATAAATCCAT 


CAGAAAGTCG 


840 


TGATTTCTAT 


TGAAATGAGG 


ACTTTCTTTT 


TATACTCATC 


TGCTTTCAAA 


AAGCACTCTA 


900 


GTCCATCTCC 


GATTAACGAT 


GGACTTTATC 


ACCTCCTTCT 


CCAGTCCTTC 


TATAACATCT 


960 


TGAAGTTGAT 


TCATGACATC 


TTCCAAAGTT 


CGAAAGGCTT 


T A TTCTT AAA 


TCCACGTTTA 


1020 


CGAATCTCTT 


TCCACACTTC 


TTCAATGGGG 


TTCATCTCTG 


GTCTGTATGC 


AGGAATAAAT 


1080 


GCAAAGCCAA 


TATTACTCGG 


AATCTTTAAG 


GTACTTGATT 


TATCCCATAT 


AGCATTGTCC 


1140 


ATAACGACTA 


AAAGATAATC 


ATCTGGATAA 


GCTTCTGAAA 


GCTCCTATTC 


CTAAAGCCCC 


1200 


TTTATAACCT 


CTTGCGAGAG 


ACACTATTGA 


CTCAGCCCTT 


ACTTCATGCG 


GATGAAACCT 


1260 


CCTATCGGCT 


TCTAGAGAGT 


GATAGCCATC 


TGACCTACTA 


TTGGACTTTT 


TTGTCAGGTA 


1320 


AAGCAGAGAA 


ACAAGGGATT 


ACCCTTTACC 


ACCATGATCA 


GTCTCGAAGT 


GGTTCAGTAG 


1380 


TACAAGAATT 


CCTAGGAGAT 


TATTCTGGCT 


ATGTTCATTG 


TGATATGTTG 


CGGCAGTAAC 


1440 


TTAGGACTTT 


AGTCCTCTAG 


TTCTCCCTAT 


GCGATAGCAG 


TCCAAGGTTT 


AGGAGTAAGG 


1S00 


CGACGCTAAG 


CTTGGTAAAC 


TGCGAACAGC 


TAGAAGCTTA 


TCGTCAACTG 


GAAGAAGCTG 


1S60 


CACTTGTTGG 


ATGTTGGGCG 


CATGTGAGAA 


GGAAGTTTTT 


TGAAGTCCCC 


CCCAAGCAAG 


1620 


CAGATAAATC 


ATCCTTAGGA 


GCTAAACCTT 


TACCCTATTG 


TGATCAGTTA 


TTTTCCTTGG 


1680 


AAAGAGACTG 


GGAGGCTTTG 


CCAGCTGATG 


AACGGCTACA 


GAAACCTCAA 


GAACATCTCC 


1740 


AACCCCTACT GGAAGACTTC 


TTTGCTTGGT 


GCCGTCGTCA 


GTCAGTTTTA 


TCGGCTTCAA 


1800 


AACTAGGAAG 


GGCAATTGAA 


TACAGCCTCA 


AGTATGAAGA 


AACCTTTAAG 


ACCATTTTAA 


1860 


AAGACGGACA 


TCTGGTCCTT 


TCCAATAATC 


TAGCTGAACG 


CGCCATTAAA 


TCATTGGTTA 


1920 


TGGCACGCAG 


TAAAAGAGTC 


CAGTGGACTC 


TTTTAGCCTA 


AGCTCAGTTT 


AAAAAAACGA 


1980 


GGGTGGTTAT 


TTTTAAAAAA 


GCGAGGGTGG 


TTATTTTCTC 


AAAGTTTTGA 


AGGAGCTAAA 


2040 


GCAACAGCTA 


TTATTATGAG 


TTTGTTGGAA 


ACAGCTAAAC 


GTCATCAATT 


ATAGTGCGTT 


2100 


GAATCTATAA 


CAGTACGCAT 


CGACTGCTAA 


AATATTTCTA 


TAAATCAATT 


TTCCTTTCCT 


2160 
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AATCGATTTC 


TTCATATLTT 


tt nNfi k ft ft *T*^^ 

ATTACAA1*. 


ATT ATAAAT A 

• » •» * *™ftf™w™% * 


GCGAGAAATA 


TCTATCCTAT 


2220 


CTTCTAGAAT 


GTCTTCCAAA 




rTCGTAAACA 


AAGAGGTTTT 


AGAGGCCTAT 


2280 


TTACCGTGGA 


CTAAAGTTGT 


K m m ft ft » ft ^ 

ACAAGAAAAb 


TY^ P AAAT AAG 


AAATCTCCAG 


ATTAGGAACT 


2340 


ATATATGACT 


TCTCTAGTCT 


GGAGATTTTT 


TAATAGACTT 
LAAIAUM** * 


CGTTATTGGG 


CGGTTACTTT 


2400 


CCAAACTTTC 


AAAACTTCAA 


AAAACGGATT 


TTT ATCGCTC 


TGAACATCAA 


AAAAGAAAGG 


2460 


ACCAAATTTC 


TCCTTTCTCA 


AGCTTAGCTT 




fACTACACTT 


GACAAAGAGC 


2S20 


CCTTTATTCT 


ATCAAACATG 


AAGCGCAAAA 


ACAAGCWAAA 


AATCPC5ATAG 


AATGGCTATC 


2580 


CCTCGACTAT 


CAAGTAAGAC 


ATTTCCATCA 


AATACGTTL A 


A i i 1 i «L I ^ * 




2640 


AGAATTAATC 


ATCTCGTTTT 


GATTTATTAA 


AAATATAC.AA 


1 i vAuv 1 1 1 A 


CCTCCAAACT 


2700 


ATTTTATCCA 


CTATCCCTGT 


ATAGCTCTGT 


ATTATCTTAA 


WAAW 111 AV* 1 


ACAGACATTT 


2760 


TCCTCAACAT 


AATCCGGAAC 


CGGTAATCCA 


AAATCCTCAT 


LI lu 1 OH.AA 




2820 


CTTTCAACTG 


CTTGAAGAAG 


AGAATTTTCA 


TCAATGCCTG 


r+r* » ft ft ft ^ iv ft ft 
LUAAAAl AAA 


TTCTCCCTTA 


2880 


TCTAAGGACT 


CAGGACGTTC 


TGTACTTGTA 


_n__rn. h k Jttt ft rf* ft M ft 

CGAATACATA 


CAbWUiuAnA 


AnT.ATAAfCT 
AVA#A l AAWw • 


2940 


TGACTAGTAA 


AGAAACTACT 


TTCTTCCGGT 


AAAGTTCCCG 


ft aTnnTAr 

An i LAVjA 1 tW— 


TACAACAAAT 


3000 


CCATTCATCT 


GTAAACAATT 


ATAGTCATGG 


AATCCTAGTG 


GC^ LA I lAj 


AA I V»/\W/\Ww * 


3060 


TTATCTAGTT 


TAAAACCGCT 


CTCTTGTAGC 


(.1 1 1 1L1 1 iu 


niUl nwViA 1 w 


CC AAGAATAT 


3120 


AAGATTGGCA 


TATTATACTT 


TTCAGCTAAT 


TGAriAAilvj 




AGAAATAAAA 


3X80 


TTTTTATCTG 


TATCAATATT 


TTCCTCACGG 


>TW JA ^^^f* & ft & 

TCAGCTCAAA 


r^T« JVAG AT AT A 


ACCTCCTTTT 


3240 


TTCAATCCCA 


AACGTTCATG 


GATATCTGAA 


GACTCAATAl* 


Lnun * AAA A * 


TTTATGTAAC 


3300 


ACTTCTGCCA 


TAGGACAACC 


ft » ft. wfi ft 

ACTTACATAT 




TACGTAAACC 


AC ACTC ATGT 


3360 


AAATACTTAC 


GTGCATGTTC 


AGAGTATGCT 




CTGAAATAAC 


ATCAACAATC 


3420 


CGACGATTAG 


TCTCTTCCGG 


TAGGCACTCA 


TCTTTACAGC 


GATTGCCAGC 


\_ 1 i A i v*#\ 


3480 


AAAATTGGAA TATGTAAACG 


CTTGGCAGCA 


ATAGCTGATA 


AACAAGAATT 


TGTATCCCCT 


3S40 


AAAATCAATA AACCATCTGG 


TTTAATTTGA 


TTCATCAATT 


TGTATGAAGT 


ATTAATAATA 


3600 


TTCCCTACAG 


TAGCACCAAG 


ATCATCTCCA 


ACAGCATCCA 


TGTATACGTC 


CGGAGTGTCT 


3660 


AACCCTAAAT 


TATCAAAGAA 


AATACCATTT 


AAATTCTAAT 


CATAGTTTTG 


TCCAGTATGT 


3720 


GCCAAAATAA 


CATCAAAATA 


CTTTCGACAT 


TTAGTGATAA 


CACTACTTAG 


ACCTATAATC 


3780 


TCTGGACGTG 


TTCCCACAAT 


AATCAATAAC 


TTAAGTTTGC 


CATTATCTTT 


AAAGTCAATA 


3840 


TCACTATAAT 


CTGTCTTAAT 


TTTCATTTAT 


TTCTCCACTT 


GTTCAAAAAA 


AGTATCTGGA 


3900 
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TGTCTACGAT 


CAAATGACTC 


ATTAGCCCAC 


ATGACAGTAA 


TTAGATTTTC 


TGTATCAGAA 


3960 


ACATTAATAA 


TATTATGTGC 


ATAGCCCGGT 


ATCATATGTA 


TTGCTTCAAT 


CTTATCCCCC 


4020 


GACACTTCAA 


AGTTCAGAAT 


AGGATACTCT 


TGACCGTTTT 


CATCCAGCCC 


TATCCTACGC 


4080 


TCTTGTATTA 


AAGCACGACC 


AGAAACAACC 


ATCAAAAATT 


CCCACTTAGA 


ATGATGCCAA 


4140 


TGTTGCCCTT 


TGGTAATGCC 


AGCTTTAGAA 


ATATTAACAG 


AJWATTGACC 


CGTATTTTCT 


4200 


GTTTTTAATA 


ATTCCGTAAA 


ACTACCTCGT 


TCATCTATAT 


TCATTTTTAG 


AGGAAACTTA 


4260 


AACTTATCTA 


CTGGTAAATA 


AGATAGGTAG 


GTAGAATACA 


ATTTCTTTTT 


AAACGATCCC 


4320 


TGAGGAATTT 


CAGGCATAAC 


TAAACTATCA 


GGCTGTTTTT 


TAAATGTTTC 


TAATAGAGAG 


4380 


ACAATCTCTC 


CTAAGGTTGC 


ACGATGAGTC 


GTTGGTACGT 


AGCAGTAGTT 


TCCTGATGGG 


4440 


CTAGGTAAGA 


TTTGTAATCC 


ATCTAGATTA 


CAACGATGAG 


GATTTCCTTC 


CAATGCAGTT 


4S00 


AGACACTCTT 


CTATCAAATC 


ATCAATATAC 


AGCAACTCCA 


ATTCTACACT 


TGGATCATTT 


4560 


ACTTGAATAG 


GTAAATCGTG 


ACCTAGATTA 


TAACAGAAAG 


TTGCTACAGC 


AGAATTGTAG 


4620 


TTAGGACGCC 


ACCACTTCCC 


ATAAAGATTC 


GGGAAACGGT 


AAACTAAGAC 


AGGTGCTCCC 


4680 


CTTTTCTTTC 


CATATTCAAA 


GAAGAGTTCT 


TCCCCTGCTA 


GCTTAGATTG 


TCCATATATA 


4740 


GAGTTTGAAA 


ATCGGCCTTC 


TAAACTAGCT 


TGACTACAAC 


TTGAGAGTAG 


AACAGGACAA 


4800 


GTGTTTTCAT 


ACTTTTCTAA 


AATCTCCAAT 


AATCTACTTG 


AAAAACCCTA 


ATTTCCCTCC 


4860 


ATG AATT CAT 


CAGGATTCTG 


TGGACGATTG 


ACACCAGCTA 


AATCGAATAC 


CAAATCGGCC 


4920 


TTCTTACAAT 


ATTCATCTAA 


TAAAATCGGA 


TCTCTATCAC 


GATCATACTG 


AAAAATCTCT 


4980 


CCAATCTCTA 


AATT AGGACG 


AGTCCTATCT 


CGTCCATCTT 


TCAAACCTTC 


CAGACTACAG 


5040 


ATAAGATTTT 


TTCCTACAAA 


TCCTTTCGCT 


CCTGTGATTA 


AAATATTTTT 


AATCATGCCC 


5X00 


CCTCCTTATT 


TTATATGCTG 


TTTTAATAGT 


TAACTCTCTC 


GACAATACAT 


GATACATTAT 


5160 


ATATCCTTGA 


TAATTTTAAT 


GTATCTTAAA 


AGATTTTACA 


TCTCTTCGTC 


TGCTACCATA 


5220 


TCACCAATTG 




TTCATCTAAT 


TCTAGCAACT 


TTCTTTTAAC 


TTGCTCTACA 


5280 


TCCATCAAAT 


CGGTATTATT 


ACTATTGAAT 


TCTGTCAACA 


AATTTCTATT 


CGTACTACCA 


5340 


TCTTTGAAAT 


ACTTATCATA 


GTTAAGATTA 


CGATTATCAC 


TAGGAACTCT 


ATAAAAATCA 


5400 


CCCAAATCAA 


TTGCATTTCC 


GCACTCTTCG 


TTAGTTAATA 


GTGTTTCATA 




5460 


CCGTGTCTAA 


TACCTATAAT 


CTTAATATCT 


TCTTCTGAGG 


CAAAAATTTC 


TGATACAGCC 


5520 


TTAGCCAACA 


CTTCAATCGT 


ACATGCTGGT 


GCTTTCTGAA 


CTAGTATATC 


TCCAGATTTC 


5560 


CCTTCTTCAA 


ATGCAAATAA 


AACCAAGTCT 


ACTGCTTCTT 


CCAATGTCAT 


CACAAAACGT 


5640 


GTCATGCTAG 


GTTCAGTAAT 


TGTAAGAGCA 


TTTCCTTGCT 


TAATTTGCTC 


AATCCAAAGA 


5700 
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GGAACGACAG 


ATCCACGGCT 


ACACAGAACA 


TTCCCATAtit- 




1 r\ A ^ ft * ft w * « 


5760 


TGCTCAGGAT 


TTACCGTCCT 


GGACTTAGCA 


ACAGCAATCT 


TTTCCATCAT 


ft.flCf*TTGGAT 


S820 


GTTCC CAT AC 


CATTGACAGG 


ATAAGCCCCC 


TTATCTGTAG 


AAAGACAGAT 


1APTTWTTT 

M/\^ ft ft * ft * 


5880 


ACACCAGCTT 


CGATAGCCGC 


AGTGAGGACA 


TTCTCCCTTC 


CCAAAATGTT 


nu I X 1 1 ft /%^W 


5940 


CCTTCT AC AG 


GGAAAAATTC 


ACAAGAAGGT 


ACTTGTTTAA 


GAGCAGCAGC 


pfrh » » k ft£" & 


6000 


TAATCCACAC 


CATGCATAGC 


ATTTTTT ACC 


GAAGCTAAGT 


CACGCACATC 




6060 


AAACGGATTT 


TCCCAGCCAC 


TTCTGGTACT 


TTTACCTGAA 


ACTCATGACG 




6 120 


TGTTTCTTTT 


CATCTCGCGA 


AAATATACGA 


ATCTCTGAGA 


CATCTGTTTC 




6 180 


TTGAGAACCG 


CATTCCCAAA 


TGAACCTGTC 


CCTCCTGTAA TTAGGAGAGT 


TTTTCCTGTA 




AATTGTGACA 


TATATTACAC 


TTCTCCTTCT 


AGTATGTCTG 


CAA'l"l 1 Iwl 1 


ft ft ft ^*f+f*^**T+ff* 

ACAAGCCGTT 


OJUU 


CCATCTCCAT 


ATGGATTTGA 


AGCTTGACTC 


ATTGCTTGAT 


AAACTuAAlA. 


ATTTTCTAAT 


Q JOU 


AATTCTTTAA 


AATGCCTATA 


AATATTATTT 


ft jw_n ft r+ ft ^ 

TCATCAGCAC 


CTACAAoTTT 


^ ft ft ft ^^fP*^^^^^^#f% 

CAAAGTCCCT 


OVftU 


GCTTCAATTC 


CCTCTGGACG 


TTCAGTTGTA 


TCTCTCATAA 


CCAAAACAGG 


TTTTCCTAAA 


o4ou 


CTTGGAGCCT 


CTTCCTGAAT 


ACCACCACTA 


TCTGTTAAAA 


TTAAATAACT 


TC" I GATAAA 


CCA ft 


AAATTGTGAA 


AATCTAATAC 


TTCTAAAGGT 


TCGATCATCT 


TGATACCTTC 


ACAGCCAvlT 


6600 


AcrrrcTTC CT 


CAGCAATTTG 


GCGAACACGA 


GGATTCATAT 


CGATAGGATA 


AATAGCCTTG 


O DDU 


ACATCTGAAT 


ATTCTTCAAT 


AATCCTTCTA 


ATTGCTCTAA 


ACATATGTCT 


CATtWil lw« 


6720 


CCAAGATTTT 


CACGACGATG 


AGCTGTAATT 


AGAATAAACC 


TGCTTTCTCC 


ini 1 I V. 1 


6780 


AACTCAGGAT 


GCCTATAGTC 


CTCTTGAATT 


GTAGTTTGTA 


AAGCATCAAT 


^wwvul Al ft f\ 


6840 


CCTGTCACAA 


ATATGCTCTC 




crrrcrcrrA. 


AAACATTATC 


i i • .VJrtArtW 1 


6900 


TGTGTTGGTG 


TAAAATGATA 


CTGACCCAAA 


ACCCCAACTG 


CTTGACGATT 


ft ft ftfTPTTf* A 

4\/V\l_ ft W ft ft 


6960 


GGATATGGTG 


AATAGATATC 


GTAACTGCGC 


AAACCACCTT 


CAACATGACC 


AATTGGAATC 


7020 


TCTAAATAAA 


AGGCCGCCAG 


TGAACTAGCG 


AAGGTCGTAC 


TTGTATCCCC 


ATGAACTAAC 


7080 


ACCAAATCAG 


GTTTTTCTGA 


CTCTAAAATA 


GCCTTCATTC 


CTTCCAAAAT 


CCCAATGGTC 


7140 


AC ATCAAAT A 


AAGTTTCTTT 


ATCTTTCATA 


ATAGACAAAT 


CAAAATCCGG 


AATAATCCCA 


7200 


AATGTCTCCA 


AGACCTGATC 


CAACATTTGA 


CGGTGTTGGC 


CCGTAACCCA 


AACTAATCTT 


7260 


TCAATATTCT 


TACGTGTTCT 


TAACTCTTTG 


ACCAAAGGAC 


ACATCTTGAT 


GGCTTCTGGA 


7320 


CGAGTTCCAA 


ATACTACAAC 


TACTTTTTTC 


ATATATTTAC 


TTACTCCTAA 


CAAATAATGA 


7380 


ACGGTTCTTA 


AAATAAATTA 


GATAACGGCT 


AATCCATAAC 


ACCACCTCAG 


ACATACTTGA 


7440 
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ACAAATAGCT 


AATGTTACTA 


AACTAAAATT 


ATCAGACAAG 


ATAAATATTC 


CTAATCCCAA 


7500 


ACTTTGCACA 


ATCGAAGCTA 


ATATACTTGT 


CATTGTAGTT 


TCTTTCACTT 


TATCAATAGC 


7560 


TCCTAACACA 


GGCCATCCGT 


AAATCATAGA 


ATAAAAACTA 


GCAACAAAAG 


CGGGTAATAA 


7620 


GTACTTAAGA 


AAATCTGCTG 


AAACGGTATA 


TTTTTCACCA 


CCAATTATAG 


AAAGAATTTG 


7660 


ATTTGAAAAG 


AATAAAACTA 


TCAAAACTCC 


AAAGATAATA 


GGAATAAACA 


TAATCCGATT 


7740 


AAT ACT CTT A 


ACCGATTGTA 


TATCTTTAGT 


ACGTATCATA 


TGCGGATATA 


AACTATTCGC 


7800 


TATAGGATTA 


TACAATGATT 


TTCCTGCTGA 


AAGCAGTTGC 


ATTCCTATCC 


CCCAAAAGGC 


7860 


TATCTCTTGA 


CTTTGTAAAT 


AAAAACCCGA 


AATGACTGTC 


GTAAAGACGC 


CAAAAATAGT 


7920 


AGTTGCAAAA 


TTCGATAAAA 


AATAAATAGA 


GGATTCCTTT 


AAATCTTTAA 


CCCAAACAGA 


7980 


CAGATAAGAA 


AATGATAATT 


TAATTCCATA 


ATAATGAAGG 


AATCTATAAG 


AAACTACTCC 


8040 


AGCAACTAAA 


TTCCCAATTC 


CTTCCAATAT 


AGCAATCCAT 


AAAATAGAAG 


AATCATCTTT 


8100 


TACTACAATA 


AATGTCAAAA 


TTGTAATGAT 


AGTTTTAGAA 


ATAATATAAC 


GAATTGCAAC 


8160 


TGCATGCATC 


TTTTCAATTC 


CACGAAATAA 


AAAGTCAAAG 


ATAAAAATAT 


TGGTCACTGT 


8220 


AGCTAACAAA 


TAAAAAACTG 


AAAAAAGAAT 


ATTCTCTCTC 


ATTATTGGGA 


TTTGCCACAT 


8280 


CAATATGGTG 


TAAATTAGAA 


TCGAAATGAT 


AG AT AAAAAT 


ATTTTTTCAA 


CTACAGTATC 


8340 


TCCAACTATC 


CTTCCAATCT 


TTGAGGGACT 


ACT AC AAGC A 


TTTACAATAT 


TTTTTCTAGC 


8400 


TCATATCATG 


AAACCAAAAT 


CAATCACCAG 


TTGAACATAA 


GCTATTAACG 


CTTTAACATA 


8460 


AATAACCATT 


CCATACGCGT 


CTAGCGAAAG 


CACCCTTGTC 


AAATACGGGA 


GTCTTAATAA 


8520 


AGCAAATAGT 


AATTTAACAA 


TATTCAGAAT 


ATAGAGAGAA 


CTTGTATTTT 


TTATAAATGA 


8580 


AATTCTATCA 


ACTTTCACGA 


ACTAGTCCTT 


CCAAAAAAAG 


ATCTAAATAG 


TCCAAACTAC 


8640 


TTCTCGCTTT 


CAACACCAAT 


TCTGAAGGTA 


TTGTTATCCG 


TTTTAGATGA 


AAAGTTTCAA 


8700 


GTTTCTTTAC 


AATACTATTA 


ACACTTGAAT 


CAAATAAACA 


TTCACAACGT 


TGTAACTCTC 


8760 


CAATTGCTCC 


ATAATAACGT 


GCTGTTTTTT 


CTGCATGGCA TGCAATGGCA 


ATC AC AGAT T 


8820 


TATTAAAACA 


TGTTGCCACT 


ACCCCAACAT 


GTAATTTACA 


AGTTAAAACC 


ACATCTACCA 


4880 


TTTTCAACAA 


TCATCTCATT 


TCTGCAGGAG 


AATGATACTT 


GAATTGAAAA 


CAATCCTCAG 


8940 


TTCTAACTAA 


TTTTCTAAAT 


TCCTGATAAT 


AAGC ATCTT C 


ATAAGGTAGA 


ATGGAATCCG 


9000 


AAGTTACTAC 


AACATAATAG 


TTAGGATTGT 


TTTCTAGAAA 


AAGACTAATT 


GATTCCGCAA 


9060 


ATTTTTCAAG 


AGCTTTTTTG 


GAATGATTAT 


AGTGAACAAG 


AATTATCTTC 


TTATCTTTAG 


9120 


CTTCTCTTTT 


CAATTGACAC 


AGCTGCTCTG 


TTTTTTCTTC 


TCTTAATTTA 


CTTGAAATAA 


9180 


TTAAATCAAA 


GGTTTCATGC 


ACTGGAGCCG 


AAGGCGACAA 


ATGCTTCAAA 


GAATCAAATG 


9240 
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ATTCTCGATC 


ACGAACTGTA 


ATAAATTCAG 


CATGATTAAT AATTCTCTTT 


ATALUi 1 #U% 1 


9300 


TCATCAAAGA 


ATCCTTATTA 


GGCCCTGCAC 


CAATACCTAA TACTCCTATA 


GGCTTTTTAA 


9360 


AATATGAAGC 


CCAAATTCCC 


AAAGGTAAAA 


ATCGTTTAAA TTGGATTAAA 


TTA 1 CnUinn 


9420 


AACCTGCATT 


ATGCCCTTCC 


CCAAAATATC 


CTCCCGGGAT ATACAAAATA 


CCATvi\jV.iT 




GTTTTTTAGT 


AAAACTTTGT 


TTTTGCCGAT 


ATTCTTTCAA GTACATTTGA 


AAGAAATVlu 


7J*U 


ATGGATTATA 


AAAAGAAACT 


T CAT ATCCTT 


TACATTCTAA 


TAAATCATAG 


ACAATCTCAC 


7 ouu 


CGTAAAGATA 


ATCACCGTAA 


TTACTTGAAC 


CATAATCCGT TGCACCATGT 


AACATAATTT 


7000 


TTTTCACCAC 


TATTITTTCA 


ACCTCCTAAA 


AATAAATATC 


ATAATCAAAC 


TATACATAAT 




AGGACGATAA 


ACATCTATTG 


AACTACTTCT 


CACTAAAAGC 


AATAGTTGAG 


AAATTACCGA 


OT a A 

97SO 


AAAATAAATA 


ACTTTTGAGA TTTTACTTCT 


TTGAAAAGCT 


CTGAAATTTA 


ATCGCCATCC 


9840 


ACTAAATATT 


CCCAAAACAA 


AACTCCAAAA 


AACACCACCA 


TAGTAACCAA 


ACTTCCAAAA 


9900 


TAATTCTTCC 


ACAAAAGAAG 


AGCCTACAGG 


TAACCCCAAA 


AATTTATTAA 


TAACAACCGT 


9960 


CGCTGATGCT 


TTATCAAAAA 


AATCACCAAC 


TAACCATCCA 


ATAGGAAAAA 


TTG AT AGGAT 


10020 


A GTGCCT AG A 


AATGTCATCC 


CATATTCATA 


TGGAATGCTA 


CTAGCCACAA 


CAGTTACAGC 


1 0080 


AGAAGCTACT 


CTTAGGCTGC 


TCAGTCCCGA 


CTCTGAAAAT 


ACTTCCCCTA 


CTATATTCTT 


10140 


TACAAAATCT 


AATGAAGAAA 


AGGAATCAAA 


TAAGTATATA 


CCTATAGTAT 


TCAAGTCGAA 


10200 


ACGGTGCCCC 


CTAATAACAA 


CTAATACATT 


TAATAGAAAT 


ACACTTACTA 


TTAAAAATAC 


10260 


AAGTACTCTT 


TTCTTCGAAA 


AAGTAATCCC 


TAAAGATTGT 


GTGTATACTA 


AAACCAACGC 


10320 


CAAGATTGAA 


AACACCTGGA 


TTTTACGACT 


TCCTGTTAGG 


ATCATTATCA 


AAA TT AGO I A 




AAACAACATT 


ACCCAAAAAA 


TAGTACGCTT 


TATAACTCGG 


GACAGCTTAT 


CTGAATAAAA 




CAAGGAGAAC 


ACACCAGGAA 


GCATAAGTAC 


TCCTAAATCA 


TCTATTATTC 


CTGAACTAGv. 


i ne aa 


TCCCTCTGAA 


TATGCTGAAT 


AGCTATTCGC 


CGCTCTAACT 


GCTAGTACTC 


TTTTAGAATC 


10S60 


AGTTATTACC 


CTAGAAATAA 


AGCCCACTCC 


TGTTAAAATC 


CTACCCGCAT 


TGTACAAAAT 


10620 


TTTCTCTTCA 


TTTTCCTGAT 


AATTTTGTAC 


TTCTGAATGA 


TAATGTACCT 


TTCCATCACT 


10680 


ATAAAAAAAT 


AAATAGCCTA 


CAGAATAACA 


AAACAAAATC 


CAAATTATAA 


AAATATATGA 


10740 


ATGAAATAAT 


TCTTCATTAT 


TATAGAAGTT 


ACTAGGGCTC 


CACAGCAGAG 


TTGTTTGAAA 


10BOO 


CCCCATATAC 


TCATTGAAAA 


TTAATCCAAA 


CATAAAAAAA 


TAAGATAAAA 


TCAGATACCA 


10860 


TACAGAAAAA 


TCATATATAC 


TAACTTTTTG 


TAAAATAAAA 


CCAGTAATTT 


GAAAAATAAT 


10920 


TAGAAAGCAA 


ACCCATATAA 


ATATAGACGG 


AACATAATTA 


GATATAAGAA 


AACCATTATT 


10980 
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CCAATTATCG 


AGAGTCCAGA 


ACAAGTAACA 


GAAAGCAAAT 


ATAAAACTTA 


ATGTCACTAG 


11040 


TGTCACTCTA 


CAAATATACT 


TTGTCTGCAT 


CTATATCTCC 


TTTATTACAC 


ACATTTCTTG 


11100 


ATAACGATTC 


AATAATTTAC 


TAGCTTGATA 


ACAAATATCA 


TAGAGTCCAT 


CTGTCATACT 


11160 


GTTATTTATT 


TCAAAACGAT 


TGCATTCCTC 


ACATGTTAAA GACAGTACTT TATCTTTCCA 


11220 


TAGCAACACA 


GACTCTTCGT 


TGATAGGTAA 


GTAACTAATG 


TTTTTGGTCA 


CATCTACTTC 


11280 


TTGCGTCACT 


GTATCTGACG 


ATAAAATTTG 


TAATCCCGAT 


GCCTGAGCCT 


CTACTACAGA 


11340 


AACAGGCAAC 


CCCTCATATT 


TAGACGGAAG 


CAAAAAAACA 


TCCATCCCAG 


ATAATAAATC 


11400 


AGAAATATCA 


GTCCTTCTCC 


CTAAAAATAG 


CACATATGGG 


GTCAGATTTA 


GTTCTAAAGC 


11460 


TTTCTGTTTT 


AATTTCTGCT 


CATCCTCACC 


ATTACCAACT 


AGGAGTAAAA 


TAACATTTGG 


11520 


TTTGATTAAA 


ATGAGTTCTT 


TTAAAACGTT 


AAATAAATAA 


CTTTGGTTTT 


TTTGATCTGA 


11580 


TAGCCGAGCT 


ATATTTCCTA 


ATACCAACTT 


ATTTGACACA 


TCTAATTCTC 


TACGACATTT 


11640 


TTCTCTAACA 


TCTGACAAAA 


ATTCATACTT 


TTTCAAATCA 


ATTGCATTAA 


AAATAATTTC 


11700 


AATTTTTCCG 


TCTTTATACG 


CTTTCTCTCC 


ATATAACCAC 


TTACCCGAAT 


CTTCCCCACA 


11760 


TGCAAACCAA 


TGAGTTGCTA 


AGATTTTTAC 


CAAAATTGTT 


ACTAATTTAC 


GCAATACTTT 


11820 


TTGAAAACTG 


TTTTCTGTTA 


CATAAGCCAT 


ATGACTATGA 


ATAATTCTAA 


TTTTACAACC 


11880 


AATTATTTTA 


GATAAGATCA 


GACCAATTGC 


AG ATTT AT AG 


CCATGGCAAT 


CAACTATATC 


11940 


ATAATCTCCT 


TTCTTTATTA 


TTCTAGCAAG 


AGAGAGAAAC 


TGATGTAGAG 


GCTTTTTCCT 


12000 


TAATACAGGC 


ACATGATAAA 


CCTTTGCACC 


CAATTCTTTC 


ATTTTATCCT 


CTAAAAATCC 


12060 


TTGTTCTTTT 


CCAGGCACAA 


TAAAATCAAA 


TTGAATTTTT 


TTTCTATCAA 


TGTGAGAATA 


12120 


ATAGTTGAAT 


AGAAAACTTT 


CTACTCCACC 


ACTATCTAGT 


CTTGTAAATA 


GATGTAATAC 


12180 


TTTAATCAT7 


CTTCTTCCTT 


AAGCTTAAGA 


TTCGCTTCTC 


TAATTCTATT 


TCTGTTTTTT 


12240 


GTTTTTCTAA 


ACTAATTCTG 


TCCATGAAGT 


TATCACAATT 


CTTAATTAGC 


TGTTTCCTGT 


12300 


CAAGGTTTTG 


AATATACAAA 


GCCAAACAAT 


CTTTTTCCGA 


TTCATCCTTC 


ATAGGTAAAA 


12360 


CGAAACCAAA 


ACCATTCTCT 


ATTGACACTT 


TTTCCATATA 


AGTATCTTCA 


CAAACTAAAA 


12420 


TAGGTTTATA 


CAACAATGCA 


GCAAAGTAGA 


GTTTATTAGA 


CAAAGCATAG 


TCTAGTAAGG 


12480 


GAGTGTGATT 


CCCGTATAAA 


TTCAAAACAA 


CATCTCTATT 


CTTATAAAAA 


GACATGGTAT 


12540 


CTTTAGGCTG 


GAATGTGTCC 


ACCAAGTTAA 


CATTGCTGAT 


ATTTTTTTCT 


TGACAAAATT 


12600 


CCCTTAATTC 


TCCTGCATTA 


GTACCTATAA 


AATTCAACTG 


AAATCGACTG 


TCATTTGCAA 


12660 


AAAAATCGAT 


TATTTTTTTA 


TTTTGTTCTT 


GAAAACGAAT 


TAAACCAATG 


TAGGAAAGTT 


12720 


CAATTGGAAA 


CGTACTATTA 


TTTTTTAACT 


GCTTTACCTC 


GTTTAATTCT 


ATCATATTGG 


12780 
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A A ^Pf* 
U 1 MUO 4 1 A 1 \J 


r:/"iT AfTT A AAA 
uw A Aw A /win 


TACTCTCCCA 

A AV> 4 ft. 4 V»W Wl 


TTGGTAAAAA 


AAATTTATAG 


CCGTCTGAAG 


12840 


AAAOOA 1A11 


fATTAAAflAA 
OA A lA/wvinn 


TT*TT*Pf* Ai^t 1 A 


ATTGTTTCTC 

w\ 4 4 w 4 4 4 V. 4 ft» 


AACCAAACGA 


TAAACCAAAA 


12900 


ATTTTTCATA 


AtluTAA rwA 


OOAA 1 A 1 vft 1 


AAATATATCT 
#vw% A A A a 1 v. 1 


ATTTTTAAAT 

r\ 4 4 4 4 4 rvft#ft 4 


CAAAACAGAA 

ftftff%^^ftr%ftr^\ftV^^ftftVrftftr\ 


12960 






VinLAUvl 1 /\V» 


TATCTAACGG 

4 A ft v A finWVV 


CAATATCATA 


TCATAATCAT 


13020 


TTTCTTTTAG 


V. A i O I I I 1 1 A 


ATTTCTTTTC 


Tf.AJ.TTTT AC 

A V#AA A A 1 A AS— 


ATAACCTAAT 


ATCPTACTT A 

■ \ 4 W ft 4 r^ft. 4 4 #^ 


13080 


ATTTTCCTTT 


m ft ^ It ft ». ft ». 

ACuAGAAAAA 


uAAA 1 ALun I 


AO 1 AO 111 Ivi 


TTT^TAATA 

A A 1 « V* A AAA A 


AAOAOOA AAA 


1 3 1 AO 


TA11L i 1 ATC 


^Vft ft m m ft nft m*L 

ooAATATATA 


AWAl Ww 1 AnL 


i AA i AvMLnVj 


^PT^^^*T^P^* A A^* 
1 1 1 IVnn A 


A A^P^Pi^^^^T A^P 
IV\ A A V. A A A #ft A 


13200 


ft i « > ■ fWTVO ft. ft 

AAAAATTGAA 


^Tftft ft ft 1 MfM|ft 

GTAAGGAGTT 


ft ^ ft qi ft, #»• A *^ A*^ 

AGATATATAT 


1 Al «.AIjA 1 Au 


TA*T*A AAPArtT 
AAA AAA^nU i 


AO 1 O A OA AAA 


1 JAOU 


AATTATTCTT 


TCTTACTTTC 


CCTCTCTAAA 


CAiuiLlVWA 


UTTvuAwv.A 1 


AAAoTGCToT 


a JJaU 


TTTGAAAAGT 


GATTTTCATA 


GTAACAACuA 


GCTTTCTTTC. 


v. A AAV. IV- lw 1 


TTGXCPCTTA 




ATAGATAACA 


TACTAAATTT 


ft ^ ft -ft ft ft fffc#IW|ft 

AoAAATATTT 


TTTGCCAATT 


GTTTTACATC 


1 OO A A ftUUUA 




«_ fk M ft Hfe ft «*%^ft 

CTAACATATC 


CACAATTTGC 


«»ftftft> i*WMM*lW ft ft 

TTCTTCTACA 


ATTATTTTAG 


CATCTCCTGA 


ft » 'v^r'.r— ft. 
AATTGtACCT 


1J3UU 


ATAATTGCTT 


TGCCTGCCGC 


^ft ft JVt ft fft ft ft ^ ft l_ 

CATATAAGAK 


TGTACCTTCC 


^ A /"^ / ft «*• ft 

CAGGTATAGT 


A<^* A^ A A A^*I* 

AoG AG AAAO I 


1 1 CCA 
1J30V 


ATCGAGTCTC 


Afft ■_ fUk ft ft ft ft 

CTATTAAAGA 


ft f_ j wii ft ft ft m ft 


GCATCTGATT 


<|X| mix; i ft /T" ft ft » 

I I I 1 A 1 AGAA 


uuniVAiLAI 1 


i icon 

1 JOAVf 


TCCTCCAAAG 


AACGTCTTCC 


ft ft ^ ft ft K 1* 

AT AG AAGG AA 


ATATTCTTTA 


Ai. 1 tLAA lift. 


A 1 O AOO 1 AA 1 


l "i Ann 


CCTTTCATGC 


fM ia -m ^ftfti ft MWUft 

TTAACAATTC 


CGTACCATCT 


jft*^ ft ft /"ft » ft ft ft. m 

CoAAoaAAAT 


GAAAATGAAT 


TTTCX'rGGGX 


i 


AAATTGGTAT 


TCTTCTCTAT 


oAAAoTGGOA 


Gull ItAAAA 


T A fTTTrr A A 
I AO All I.*. AA 


A A 1 A A O lUft A 


1 J ouu 


TTCCoAATAT 


#*ft H A^ft ft ^ft^ft ft ft ft 

TACCAGCAAA 


ft ^ftffftfM ft *»^ft*fft A ft 

AGTTAGGToA 


ACACTTTCTT 


1 All AAO 1 A I 


AOA 1 A OA A OA 


1 3860 


fW* A^* A & & A & 


GATCTTCTGC 


A I A i IAj 1 OOO 


A A AT1TCTA A 
AAA 1 A 1 V* A AA 


TCTTTTCTTC 


CT.ATATCTCA 

UVJ/I 4 n 4 ft* 4 ft. A\ 


13920 


AATTCCITCA 


UVMA 4AAI I 


* • • AAA A OA - 


V»>J » 1 k*A 


C VAJV*^A* r AT A 

V**>nVft • • A 1 n 


ATC AC* ACCT 


13980 


/V*pTiAftP*Pn' 
OOO 1 AAAW 1 1 


TTTTTGAGAT 


nnn ill /vvtU 


AriTTTr! AAAA 


TCAACCCATC 


*T*Tft^T*I**t*ftf*A^T 

4 4 4 4 4 V-#^\* 4 


14040 


CCACCTACGG 


TTAAACTATC 


TGGCCAAACA 


TCCATACAAT 


ATAGAAACAT 


CGGTTTCTTA 


14100 


TATTTTTTTT 


TATAAGCCAT 


ACCAGCCCAT 


CCCATCATAA 


CTGGAGACAA 


TTGGTTAACC 


14160 


AATACACAGT 


CAAAATTCGA 


TCCATCTTTC 


GTTTTATACC 


TCCCCAATAA 


AACTCCTAAA 


14220 


GTAGAACTAA 


TTGCAAAGCT 


AAAATAATTC 


AACAATCGAA 


ATACAACACT 


TTTTTTTCTA 


14280 


GGGATTGTAT 


AAGAACGATA 


TATCCTAACA 


CCTTCTATAA 


TCTCACGTCT 


TTTriTATTA 


14340 


TGACGATAAT 


CTGCATATAT 


CTTCCCTTCA 


GGGTAATTAG 


GAATCCCACC 


CAAAACAGAG 


14400 


ACTTCATGCC 


CTTTTCGAAC 


TAAATCTTCA 


CAAATATCTG 


ACAACCTGAA 


TGGTTCTGGC 


14460 


TTATAATGTT 


GGCAAACAAA 


TAGTATTTTC 


ATTGTCCAAT 


TTAACTTTCT 


TTCTTACCAC ' 


14520 



onefwtn- _,ia/o 
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TACCCTCTAC 


AATACCTTTT 


CGTTTCAGTA 


CGTAAGGTAT 


TGTCTTAACT 


ATACATCTAA 


14560 


TATCCATTAT 


CAAAGACAGA 


TGTTTAACAT 


AGTAGCCATC 


TAACTCCGTC 


TTCATCTCAA 


14640 


CAGACAAAGT 


ATCACCCCCG 


TTAATTTGTG 


CCCATCCAGT 


TAACCCTGGC 


AAGATATCAT 


14700 


TTGCTCCATA 


CTTATCTCTC 


TCTGCAATCA 


AATCTAGTTC 


ATTTATACCC 


GCTGGTCTAG 


14760 


GACCTACAAT 


ACTCATATTA 


CCAACAAGAA 


TATTAAACAA 


TTGTGGTAGT 


TCATCCAAAG 


14820 


ATGTTTTTCG 


CAAGAAAGCC 


CCTACTTTTG 


TAATCyATTG 


CTCTGGATTA 


TATAAGTTTC 


14880 


GAGGCGCCAC 


ATTTTTAGGT 


GCATCTATTT 


TCATAGACCT 


AAATTTCAAA 


ATATAGAAGT 


14940 


ATTCTTTATG 


AATACCAAAG 


CGTTTTTGCT 


TAAATATAAC 


CGGACCTTCT 


GAATCAAGTT 


15000 


TAATCGCAAT 


TGCAATTATC 


ATAAAAACCG 


GACACAATAT 


TATTATCCCT 


ATT AAAGATA 


15060 


ATAATATATC 


ACCTAATCGT 


TTTATTATAC 


CGTACATAAA 


CAACCTCCAA 


CTATAAATTC 


15120 


TATTTCCATT 


TTTCATTCTA 


TTTCCATTTG 


ACAAATTAAA 


TCACGCAGTA 


CATGCAACTA 


15180 


CAGAAACTCA 


ATATATATTT 


GCTCACTCAA 


TGATTTTCAG 


AAATATAATT 


CTTTTATCCT 


15240 


CTACGTCAGA 


TAAAACTTTT 


CTCCATCTAA 


ACAAAATTTA 


TTTGTTTCAC 


TAATATATGA 


15300 


CTTCTCAATA 


ATGAATTACA 


AGGTCCAGTT 


CAATTATTCT 


TCCAAATAGA 


CCCAATATTA 


15360 


TTTGAAGACA 


TATCGGTTTC 


TCAAATTGCA 


ATCAGTACAT 


AAGCTAATAA 


ACTCATAAGT 


15420 


ATG CTCTGT A 


AGAATGCCAG 


AGTTATATTG 


TACTCCCCTT 


CCATACTATA 


TTCATTTTAT 


15480 


TTTTTACCAT 


AATTTCCATA 


GGAACCGTAA 


ACTCCATACT 


TATTAACCGA 


GATATCCAAT 


15540 


TTATTTAAAA 


CAACTCCTAG 


GAACAGTTTC 


CCTGTTTGTT 


TTAATTCTTG 


TTTCCCTTTT 


15600 


TGGATATCAC 


GTTTATTCCC 


CTCACCTGTT 


CCTGTTACCA 


AGATGGACGC 


ATC ACACTTT 


15660 


TGAGTGATAA 


TTCCCGCATC 


AATAACAATT 


CCAATACGCG 


GTGTATCAAT 


AATGATATAA 


15720 


TCAAAATATT 


TACGCAATGT 


TTCAATCATA 


TCATTAAAAT 


TTTTACTTTG 


TAACAAGGCT 


15780 


GTAGCGTTTG 


GTCATACAGA 


TCCCGATTGA 


ACTACAAATA 


AATTTTCAAT 


ATTTGTATCA 


15840 


CATAAACCGT 


GAGATAAATC 


AGCTGTCCCA 


GATAAAAATT 


CTGTTACCCC 


TGTAATn rr 


15900 


TCACGAGATT 


TAAAAACTCC 


TAACATAACT 


CAATTTCGAG 


TATCGCCATC 


GATCAAAAGA 


.5960 


CTTTTATAGC 


CTGCACGCGC 


AAACGACCAT 


GCTATATTTA 


TGGAAGTAGT 


TGTTTTTCCT 


16020 


TCCCCAGGGT 


TAACAGAAGT 


AACGGAAATT 


ACTTTTACTT 


TATCTCCGCT 


CAACTGTATA 


16080 


TTTGTACACA 


AGGCATTGTA 


ATATTCTTCT 


GCCTTCTTAA 


TGAACTCCAG 


TTTTTTTTCT 


16140 


GCTATTTCTA 


ATGTCGGCAT 


CCTTCTCTCC 


TATTTCAACT 


TACCCAAGTT 


TGGCACAACT 


16200 


CCCAAAAGTG 


TCATCTGCAA 


TGTATTTTCG 


ATATCTTCCG 


GACGTTTCAC 


ACGAGTATCC 


16260 


AAAAGTTCAA 


GATGAAGAAC 


TATAACACTA 


GTTCCAATCA 


CCCCTGCCAA 


AAAACCAATT 


16320 
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ACTCTATTGC CTTTAATATT TGGCGAAGAC GGGGATATCG CCGGCCTTCC CTCCTCCACT 
GTTGTCACGT CAGAAACACG ACTAATACTG ATAATTTTTT GAGCAGCTAC TTCTCTCAAA 
GAGTTAGCGA TACGGCTTGC CTCTTCAGGA ACTCGATCAT TAACTGAAAT AGAGACAATA 
CGGGTATCAA CTGGTACTGT CACTTTAATT TTATTAGCCA AACCTTTTGG CCTCAAATCT 
AGTTTCAAAT. CAGAAACAAC TTCCTCCAAA ACATCCTGCG AAAGG AT AAT CTCACGCTAG 
TCTTTTACCA GATAAGTTCC TGCCTGCAAA TCCTGATTTG TCAACCCCCG CTTGTCTCCT 
TGATTGC GAT TCACTACGTA AATTCGCCTG GTACTCGTAT ATTCTGCCTT AACAATAAAA 
GTGCTATATG CAAAAGCCCC CGCACCTGTC ACAACTGCCA CTATTAAAAT CATTAGCTTC 
CGTTTCCACA AGCTTTTAAC TAATTGAAAT ACATCGATTT CTATCGTATT TTGTTCTTTC 
ATCATTTCTC CTAAATTAGT TG AT CC ATT A CAATTTTTCG AGGATTGTCT ATAAAAAGTT 
CCTGAGCCTT CCCTTCTCCG TATTTTTGGG TAACAAGGTC ATATGCTTCT GCCATATGAC 
CACCTCTACC GTCTAGATTG TGCATATCAC TTGCAATGAC ATGAACCAAA TCCTGCTCTA 
AAAAAT ACT G - AGCTCTTTTT TTCATGAATT TATAACGTTC GCCAAAAACT TTCCGTTTGA 
GGACATGTGA ACTATTTACT TGCGTGTAAC AGCCCATATC GATCAGTTCT CGAACGCGTT 
TTTCATTATT TTCAAGAGCA TCATAGCGCT CAATGTGGGC AATGACTCGA CTAATTCCCA 
ACATCAAGAT CTTGCTCAAG GCGCTATGAA TATCGCGATA AGGACTGTTC AT ACT AAACT 
CTATCAAGGC ATAACGACTA TCATTGAGGG TCGGAATCCC CTTTTTTTCC ACCTTATCCA 
GAACATCTGG TGTGTAATAA ATTTCACCCC CGTAAGCAAT CACCAAGTCA CTCGCCACTT 
CCTTAGCTAT TTCCCGAACC TGAAGAAACT TTTCTGCTAT CTTCTCTTCC GGAGTTTCAA 
ACATGCCCTT GCGACGGTGA G AGGT AG AAA CAATGGTTCG CACCCCCTCT CTGTAGGATT 
CTCCCAAGAG AGCCTTGCTT TCCTCTCTTG ACTTGGGACC CTCATCTACA TCAAAAACCA 
TATCCGAATG GATGTCTATC ATTTCATCTA CCCTCCATCA CATCCTGTAT AGCTGCTTTA 
ACTACAGCTA AACTACTATC ATCTATTTCC ATCACATAGA GGTTACTGTC TGGCATTGCA 
TAAGAAGGAA GATCCATCCG ACCTGTCCCT TTTAAATCTT GAGAATTTAC TTTATAATTC 
CCTCCACTTT CTAACTGAGC ATTGACCAAA TTTATCATGC TCTCAAGTGC CATATTTGTT 
TGGATAGAAT CTTGCAAGCT ATTAATGATC GTACTATAAT TTTTCAGCAC TTCGGTTGAC 
GTTAATTTTT GAAGGATAGC CACAATCACC TTTTGTTGAT GGCGCCCGCG GTCACCATCG 
CCATCTGCTA GGGAGTAGCG CTCACGAACA AAACCGAGAG CCTGTTCTGA ATCAAGATGA 
ACATTGCCTG CAGGGTAATA CTTTCCATTC GTATGGGCAC TAAATTCTTG AT C ATT AT AA 
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16440 

16500 

16560 

16620 

16680 
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16860 
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17040 

17100 
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ACATCAATTC CACCCAACAA ATCAATCAAT TTCAAAAACG AAGTGAACTT CAATCGCACA 
TAGTAATTGA TATCCACTCC ATAGAGATTT TCTAAGGTGT GAATGGACGA ATCAACTCCA 
TAAATGCCCG CATGAGTCAA TTTATCTTTT TGATTATTTC CACCATCTGC GATTGGTACA 
TAGGCATCAC GTGGCGTTGT GCTCAAGAGG ATTTTCTTGG TATCTCGATT GACAGTCATC 
AGGATCTTGA CATCTGATCG CGACACCGAA CTAATAGGAC CATAGGTCTC AATTCCACTA 
ACATAGATAT TGAAAGACTG ACTCTTAGAC GTCTTAGGAG CTTCTACTTT TTTAGTGAAT 
CCCTTAGTAT AAATCTTTTT TATCTTCGAT GCGTAGTCTC GATACTCTGA CTCGATGATG 
TTTTCAAAGA CACTATTTAC GACAATGGCC TTAGTCTCCC CTGCAATCAA ACTCTTGTAA 
GCTGCCAAGT AAGACGAACT CTGGTTGACC GTCAAATCGG TATTCTGACT TGACTTGATA 
TCAGCTAGTA ATTTCTCAAT ATTTTCATTA TTAGTCCCAG TCGGTGCTGT CACACTCGTC 
AGTTGCGTAA CATTTTCGAT CTCACTATCT GCTAAAACAG CGACACTGAT TCAATATTCT 
GAGTAATTAG AAGTCGCATT TAAACGATTG CTCAGTCCAA CAAACTGCTG TACTGCAAAG 
AGCGACACAG AGCTGACAAG GATAGAGAAC ACCAACAGAA AAATACTAAA CTTTTCAGCT 
lit- i 1 AT AG A TAATCAAGAG TAGCCCTACC AAGGCAACTA GTAGGACTAA CCCAGTTACC 
ACT AG ATT AA GATATCTAAA AGCAAGGATA TTCTACTTAA AGATTAAGAA CAATAAAAAA 
CAAACTAACA ATAAATAAAT AGTCAGCAAA ACT AT ATT AA CACTTCGCTT CACTTTCTGT 
GAACGTGATT TTTTAAAACC TCTACTCATG ATTAATACCT ATACATTGAA C ATT AT ACGA 
TTATATCACT TTTTTACGGT AATGTCTACA CCTTTATTTT TACTATCTGC ATCTTTAAGT 
ATCTTAGTAG ACTTCCCGCC AAACAAAAAT ATAGTAAAAT CAAATAAGAA CAGAACAAAT 
CGTTCAGGAC AGTCAAATCG ATTTCTAACA ATGTTTTAGA AGCAGAGGTG 
<2) INFORMATION FOR SEQ ID NO: 36: 

fi> SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 21706 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EON ESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

AAAGTTGAAA CACTGCTAGC TGTTTTTGAT ACCAATCGTT TCCAACTACA GACCAAACAG 60 

TATACAAAGT TTCTTTTTGG ATGTAAGCTT CTTGATGCAC AATTCCAAGA AAATCAACAA 120 

ATTGCTGACC TTCAATTTTT TGCCATTGAC CAACTGCCGA ACTTATCTCA AAAACGCATT 180 

ACCAAGGAGC AAATAGAGCT T CTTTGGC AG GTTTATCAAG GTCATAGGGG GCAATATCTT 240 
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GACTAAGAAl» 


A L\Jf\ 1 lAl WW 


T A'PTTC T AAA 


TCCATTTTTA 


ACAACTAGCA 


TGGTATAATA 


300 


ATATGCAWA*A 


AAA 111 1 


TTATGAGGAA 


GACTAGATGA 


ATTTATGGGA 


TATTTTCTTT 


360 


ACGACTCAW* 


LAAL \» l» AfcA.. V> 


CCCCAAATTT 


GACCTTTTTT 


GCTATGTTAC 


CCTATTTACG 


420 


CTCTTACCCT 


TAACCTTTTA 


TACAGCCCAT 


CGCTATCCTG 


AAAAGAAGGT 


TTACCAACGA 


480 


TTTTTCCAAA 




TVITTf* ACTT A 


ATCCTTCTTT 


ATGCTTGGTA 


CTGGGTCAAT 


540 


CATATGCCAC 


TCTCAGAAAO 


LW InULL 1 1 * 


TACCATTGCC 


GTATGGCTAT 


GTTTGTGGTA 


600 


CTCTTGCTTC 


CTGGTCAATC 


L. AAA 1 A 1 A«/* 


CAATACTTTG 


CATTATTGGG 


AACATTTGGG 


660 


ACATTAGCAG 


CCTTTGTTTA 




GATGCTTACC 


CTTTTCCACA 


TATCACCATT 


720 


CTATCCTTTA 


TCTTTGGTCA 


I J 1 nuU\\. 1 1> 


TTGCGGAACT 


CTCTAGTTTA 


TCTATTGAGA 


780 


CAGTATAATG 


CGCGATTGCT 


GGATGTGAAO 


\J^»f\J\ 1 & * • • W 


TCATGACCTT 


TGCCCTAAAT 


840 


GCCTTGATTT 


MUWAMM ft ft 

TTGTGGTCAA 


TTTvjuTbnLn 




ACGGATTTTT 


GACAAAACCG 


900 


CCATTGGTTG 


GGGATCACGG 


TCTAGTAvSCT 


AA 1 J A 1 1 1 AV. 




TGTGCTGGTA 


960 


GCTACTATCA 


GTTTGACTAA 


GAAAATCTTA 


GAATTCTTTT 


TAGCTCAAGA 


AGCACAAAAA 


1020 


ATGATTGCAA 


AGGAAGCTTA 




TT/TTTTTTfl 


CTCTTAGAGA 


GTTTTTACAA 


1080 


GCAGCTTATA 


AAATAAGAAT 




iriiiprrwi 

nUrt/vnv. 1 V-AA 


AAAATGCCTG 


GGAAATTTAG 


1140 


GAAAAAAGCA 


AG CACG ATT A 


AATTTTTTGT 


Ul lAl /W A * 


TTTGTGAATA 


GCTATGCCTA 


1200 


TGTTTAGCTA 


TCCAATAATA 




AACTTCGAAG 


ATAGAGAGGA 


AGCGATGTAA 


1260 


TGCCTAGAGA 


AGGCi^i i ill 


AwAwta IL1 A\J 


ATATTGGAAC 


AAGCTCTGTC 


AAGGTCCTTG 


1320 


TGGCCGAGCA 


GAGAAATGGT 


r~ft tl"P&&&Tf! 
uAA i 1 AAA lw 


TAATTGGCGT 


GAGTAATGCC 


AAAACTAAAG 


1360 


GTGTAAAGtiA 




CTTGATATTG 


ATGCAGCACC 


AACTGCTATC 


AAGTCAGCCA 


1440 


TTTCCCAAGC 




flC AfiGCATTT 

W Wnn*\»#% • • * 


CGATTAAATC 


AGTGAATGTC 


GCCTTGCCTC 


1500 


GTAATCTTTT 


GCAGGTAGAA 


CCAACTCAGG 


GGATCATTCC 


* i"'^ & n ^ h M*I^T 


wA 1 rW* i 


1560 


AAATTACGGA 


TCAAGATGTT 


GAAAATGTTG 


TCAAATCAGC 


TTTGACAAAG 


AGTATGACAC 


1620 


CTGACCGTGA 


AGTC ATTACC 


TTTATTCCTG 


AAGAATTTAT 


TGTGGATGGT 


TTCCAAGGGA 


1680 


TTCGTGACCC 


ACGTGGCATG 


ATGGGGGTTC 


GCCTTCAAAT 


GCCTGGTTTG 


CTTTATACAG 


1740 


GACCTCGTAC 


TATCTTGCAC 


AATTTGCGTA 


AGACGGTTGA 


GCCTGCAGGT 


CTTCAGGTTC 


1800 


AAAATGTTAT 


CATTTCACCA 


CTAGCAATGG 


TTCAGTCTGT 


TTTGAACGAA 


GGGGAACGTG 


I860 


AATTTGGTGC 


TACAGTGATT 


GATATGGGGG 


CAGGTCAAAC 


GACTGTCGCT 


ACAATCCGTA 


1920 


ATCAAGAACT 


CCAGTTCACA 


CATATTCTCC 


AAGAAGGTGG 


AGATTATGTA 


ACTAAAGATA 


1980 
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TCTCCAAGCT 


TTTGAAAACC 


TCTCGCAAAT 


TACCGGAAGG 


CTTGAAACTG 


AATTACGCGG 


2040 


AAGCCTATCC 


GCCTCTTGCA 


AGCAAAGAAA 


CCTTCCAACT 


AGAGGTTATT 


GGAGAAGTAC 


2100 


AAGCAGTCGA 


AGTGACGGAA 


GCCTACTTGT 


CAGAAATTAT 


TTCTGCACGA 


ATCAAGCACA 


2160 


TCCTTGAACA 


AATCAAGCAA 


GAATTAGATA 


GAAGGCGTCT 


ATTCGACCTC 


CCTGGTGGTA 


2220 


TTGTCTTAAT 


CGGTGGGAAT 


GCCATTTTAC 


CAGCTATCGT 


TGAGCTTGCT 


CAGGAAGTCT 


2280 


TTGCCGTCCC 


TGTCAAGCTT 


TATGTTCCAA 


ATCAAGTTGG 


TATCCGTAAT 


CCAGCCTTTG 


2340 


CCCATCTGAT 


TAGTTTATCA 


GAATTTGCGG 


CTCAATTAAC 


AGAAGTTAAT 


CTTTTGGCTC 


2400 


ACGGAGCGAT 


AAAAGGTGAG 


AATGACTTAA 


GTCATCAGCC 


AATTACTTTT 


GGTGGGATGC 


2460 


TGCAAAAAAC 


AGCTCAGTTT 


GTACAATCAA 


CGCCTGTTCA 


ACCAGCTCCT 


GCTCCAGAAG 


2520 


TAGAGCCGGT 


GGCGCCTACA 


GAACCAATGG 


CGGATTTCCA 


ACAAGCTTCA 


CAAAATAAAC 


2580 


CGAAATTAGC 


AGATCGTTTC 


CGTGGATTCA 


TCGGAAGCAT 


GTTTGACGAA 


TAAAGAGGAA 


2640 


AAATAAATTA 


TGACATTTTC 


ATTTGATACA 


GCTCCTGCTC 


AAGGGGCAGT 


GATTAAACTA 


2700 


ATTGGTGTCC 


GTGGACGTCG 


TGGCAATGCC 


ATCAACCGTA 


TGGTCGACGA 


AGGTGTTACA 


2760 


GGCGTAGAAT 


TTATCGCAGC 


AAACACAGAT 


GTACAAGCAT 


TGAGTAGTAC 


AAAAGCTGAG 


2820 


ACTGTTATTC 


AGTTGGGACC 


TAAATTGACT 


CGTGGTTTGG 


GTGCAGGAGC 


TCAACCTGAG 


2880 


CTTGGTCGTA 


AAGCCGCTGA 


AGAAAGCGAA 


G AAAC ACT G A 


CGGAAGCTAT 


TAGTGGTGCC 


2940 


GATATGGTCT 


TCATCACTGC 


TGGTATGGGA 


GCAGGCTCTG 


CAACTGGAGC 


TGCTCCTGTT 


3000 


ATTCCTCGTA 


TCGCCAAAGA 


TTTAGGTGCG 


CTTACAGTTG 


GTGTTGTAAC 


ACCTCCCTTT 


3060 


GCTTTTGAAG 


GAAGTAAGCG 


TGGACAATTT 


GCTGTAGAAG 


CAATCAATCA 


ACTTCGTGAG 


3120 


CATGTAGACA 


CTCTATTGAT 


TATCTCAAAC 


AACAATTTGC 


TTGAAATTGT 


TGATAAGAAA 


3180 


ACACCGCTTT 


TGGAGGCTCT 


TAGCGAAGCG 


GATAACGTTC 


TTCGTCAACG 


TGTTCAAGGG 


3240 


ATTACCGATT 


TGATTACCAA 


TCCAGGATTG 


ATTAACCTTG 


ACTTTGCCCA 


TGTGAAAACG 


3300 


GTAATGGCAA 


ACAAAGGGAA 


TGCTCTTATG 


GGTATTGGTA 


TCGCTACTGG 


AGAAGAACCT 


3360 


GTGGTAGAAG 


CGGCACCTAA 


GGCAATCTAT 


TCACCACTTC 


TTGAAACAAC 


TATTGACGGT 


3420 


GCTGAGGATC 


TTATCCTCAA 


CGTTACTGGT 


GGTCTTGACT 


TAACCTTGAT 


TCAGCCAGAA 


3480 


GAGGCTTCAC 


AAATTGTGAA 


CCAGGCAGCA 


GGTCAAGGAG 


TGAACATCTG 


GCTCGCTACT 


354Q 


TCAATTCATG 


AAAGTATGCG 


TGATGAAATT 


CGTGTAACAG 


TTGTTGCAAC 


GGGTGTTCCT 


3600 


CAAGACCGCG 


TACAAAAGGT 


TGTCGCTCCA 


CAAGCTAGAT 


CTGCTACTAA 


CTACCGTGAG 


3660 


ACAGTGAAAC 


CAGCTCATTC 


ACATGGCTTT 


GATCGTCATT 


TTGATATGGC 


AGAAACAGTT 


3720 


GAATTGCCAA 


AACAAAATCC 


ACGTCGTTTG 


GAACCAACTC 


AGGCATCTGC 


TTTTGGTGAT 


3780 
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TGGCATCTTC 


GCCGTGAATC 


GATTGTTCGT 


ALnnLnun 1 1 


CAGTCGTTTC 




1 A Aft 


CGCTTTGAAG 


CCCCAATTTC 


ACAAGAl\»AA 






ATTTTTCAAA 


tonn 


AATCGTTAAG 


TAAATGAATG 


TAAAAGAAAA 


TACAGAACTT 


GTTTTTCGAG 




t <j <; ft 


GGCTACTCTG 


AGTGCTCATC 


GAGAGAGTGG 


TTCGGTCTCT 


\j 1 \_M 1 r\>^Av> 






TCTAGATGTA 


CCGACAGCGG 


AAGCCTTCCT 


TCCGCTAGGT 




•prw^^Yt & & & & 

l I vnnnA 


4 nan 


TCGTGTAGAT 


AAGTTTCTGG 


AAAAATATGA 


AGCTTTAAAA 




TGAvTIvGCA 


4 140 


TTTGATTGCT 


ACCTTGCAAA 


GACGTAAGGT 


GAAAGATGTC 


ATTCAATACG 


TTGATTATTT 


4200 


CCATGCATTG 


GACTCAGTAA 


AGCTAGCAGG 


CCAAATTCAA 


AAAAGAAGTG 


ACCGAGTCAT 


4260 


CAAGTGTTTC 


CTTCAACTAA 


ATATTTCTAA 


AGAAGAAAGC 


AAACACGGTT 


TTTCGAGAGA 


4 320 


GGAACTCCTG 


GAAATCTTGC CAGAGTTAGC C AG ACT AG AT 


AAGATTGAAT 


ATGTTGGTTT 


4380 


AATGACGATG 


UwU( W A * A » W 


AGGCTAGCAG 


TCAGCAGTTG 


AAAGAGATTT 


TCAAGGCGGC 


4440 


CCAACATTTA 


CAAAGAGAAA 


TTCAAGAGAA 


ACAAATTCCA 


AATATGCCTA 


TGACCGAGTT 


4500 


AAGTATGGGA 


ATGAGTCGTG 


ATTATAAAGA 


AGCGATTCAA 


TTCGCTTCCA 


CTTTTCTTCG 


4560 


TATACGTACA 


TCATTTTTTA 


AGTAGGAGAG 


AACCATGTCT 


TTAAAAGATA 


GATTCGATAG 


4 620 


ATTTATAGAT 


TATTTTACGG 


AGGATGAGGA 


TTCAACTCTC 


CCTTATGAAA 


AAAGAGATGA 


4680 


GCCTGTGTTT 


ACTTCAGTAA 


ATTCTTCACA 


GGAACCGGCT 


CTCCCAATGA 


ATCAACCTTC 


4740 


ACACTCCCCT 


GGCACAAAAG 


AGAACAATAT 


CACCAGACTT 


CATGCAAGAC 


AACAGGAATT 


4800 


CGCAAATCAG 


AGTCAGCCTG 


CAACGGATAA 


GOT C ATT AT A 


GATGTTCCTT 


ATCCTAGAAA 


4 860 


ATATGAGGAT 


GCAACAGAAA 


TTGTTGATTT 


ATTGGCAGGA 


AACGAAAGTA 


TCTTGATTGA 


4920 


TTTTCAGTAT 


ATCACACAGG 


TCCACCCTCC 


TCGTTGTTTC 


GACTATTTGG 


ATCCACCTTv* 


a a an 


TCATGTTTTA 


GCTGGAAATT 


TGAAAAAGGT 


AGCTTCTACC 


ATGTATTTGT 




JV1V 


GAACCTTATT 


GTAAATGTTG 


AAGATATCCG 


TTTACCACAT 


GAAGATCAAC 


AGGGTGAGTT 


5100 


CGGTTTTGAT 


ATGAAGCGAA 


ATAGAGTACG 


ATAATGATTT 


TTTTAATTCG 


TATGATTTAT 


5160 


AATGCAGTCG 


ATATTTACTC 


CCTGATTTTG 


GTAGCCTTCG 


CTGTCATGTC 


TTGGTTTCCA 


5220 


GGTGCCTACG 


AATCCAGTTT 


AGGTCGTTGG 


ATTGTAGCCT 


TGGTCAAACC 


AGTGCTTGCT 


5280 


CCCTTGCAAC 


GCCTGCCTTT 


ACAGATAGCG 


GGTCTTGATT 


TATCTGTTTG 


GGTTGCGATT 


5340 


GTTTTCCTTC 


GATTTTTAGC 


AGAAAACCTA 


GTGCGTTTTC 


TGG CGATG AT 


AGGATGAATA 


5400 


AAGGGATTTA 


TCAGCATTTC 


TCCATAGAAG 


ATCGTCCATT 


TCTTGACAAG 


GGAATGGAAT 


5460 


GGATAAAGAA 


GGTAGAAGAT 


AGCTATGCTC 


CTTTTTTAAC 


TCCTTTTATC 


AATCCTCATC 


5520 
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AGCAGAAGCT 


ATTAAAGATT 


TTGGCCAAAA 


CCTATGGTCT 


TGCTTGTAGC 


AGTAGTGGGG 


5580 


AATTCGTCTC 


CAGTGAGTAT 


CTTCGAGTTT 


TATTATACCC 


AGATTATTTC 


CAACCAGAGT 


5640 


TTTCAGATTT 


TGAAATATCT 


CTCCAGGAAA 


TTGTGTATTC 


CAATAAATTT 


GAACATTTAA 


5700 


CGCATCCTAA 


GATTTTAGGG 


ACAGTCATCA ATCAATTAGG 


GATTGAACGG 


AAACTTTTTG 


5760 


GAGATATCCT 


AGTAGATGAA 


GAACGGGCGC 


AGATTATGAT 


tjottcagcag 


TTTCTTCTTC 


5820 


TCTTTCAAGA 


TGGACTAAAG 


AAAATTGGTC 


GTATACCTCT 


TTCGCTGGAG 


GAACGTCCTT 


5880 


TCACCGAGAA 


AATAGATAAG 


CTAGAACACT ATCGAGAACT 


GGATTTATCT 


GTGTCTAGTT 


5940 


TTCGATTAGA 


TGTTCTTTTA 


TCAAATGTTT 


TGAAACTATC 


TAGGAATCAA 


GCAAACCAGT 


6000 


TGATTGAAAA 


GAAACTTGTC 


CAAGTAAATT 


ATCATGTGGT 


AGACAAATCA 


CATTACACTG 


6060 


TTCAAGTTGG 


AGACTTGATT 


AGTGTGACAA 


AATTTGGTCG 


CTTGAGATTA 


CTTCAAGATA 


6120 


AGGGACAAAC 


GAAAAAAGAG 


AAGAAAAAAA 


TAACCCTCCA 


GTTATTATTA 


AGTAAGTGAG 


6180 


CAATACAATC 


CCAATTACAT 


CATTAGAAAT 


AAAGGACAAG 


ACTTTTGGAA 


CTCGATTCAG 


6240 


AGCTTTTGAT 


CCAGAAGAAC 


TCGATGAATT 


TTTAGATATT 


CTGCTTCCTG 


ATTACGAAGA 


6300 


TCTTGTGCGT 


GCGAATCATG 


ATAAAAATTT 


GCCTATTAAG 


AGTTTAGAAG 


AGCGTTTGTC 


6360 


TTACTTTGAT 


GAAATAAAAG 


ATTCATTGAG 


CCAGTCTGTA 


TTCATTCCTC 


AGGATACAGC 


6420 


TGAGAGAGTG 


AAACAGGCGG 


CGCATGAACC 


TTCAAACAAT 


ATCATTCATC 


AAGCAGAGCA 


6480 


ACATGCGCAA 


CGCTTGTTGG 


AAGAAGCTAA 


ATATAAGGCA 


AACCAGATTC 


TTCGTCAAGC 


6540 


AACTGATAAT 


GCTAAGAAAG 


TCGCTGTTGA 


AACAGAAGAA 


TTGAAGAACA 


AGAGCCGTGT 


6600 


CTTCCACCAA 


CCTCTCAAAT 


CTACAATTGA 


CAGTCAGTTG 


GCTATTCTTG 


AATCTTCAGA 


6660 


TTGGGAAGAT 


ATTCTCCGTC 


CAACAGCTAC 


TTATCTTCAA 


ACCACTGATG 


AAGCCTTTAA 


6720 


AGAAGTGCTT 


AGCGAAGTAC 


TTGGAGAACC 


GATTCCAGCT 


CCAATTCAAG 


AAGAACCAAT 


6780 


TGATATGACA 


CGTCAGTTCT 


CTCAAGCAGA 


AATGGCAGAA 


TTACAAGCTC 


GTATTGAGGT 


6840 


AGCCGATAAA 


CAATTGTCTG 


AATTTGAAGC 


TCAGATTAAA 


CAGCAAGTGG 


AAGCTCCAAC 


6900 


TCCTGTAGTG 


AGTCCTCAAG 


TTGAAGAAGA 


GCCTCTGCTC 


ATCCAGTTGG 


CCCAATGTAT 


6960 


GAAGAACCAG 


AACTAGCTCC 


AATGCATCCG 


ATAGCTCCAA 


CACCAGCTAC 


AGAAACTGTT 


7020 


GATTCAATAC 


CGGGATTTGA 


AGCACCGCAA 


GAATCTGTTA 


CAATTTTATA 


AGAAATATTC 


7080 


TGAGAACAAT 


ATCTTATCCT 


TATATTTCCA 


GCGACCAGGA 


GATGGTGTGA 


GTCCTGTAAT 


7140 


CCCTATTGAT 


AAGATTATCC 


TCTCAAAAAC 


TCAACTCTCA 


ACCTAGTAAG 


ATTTGACCTT 


7200 


TCCCACGTTA 


CGGGATAAGA 


GGGAGAAAGA 


CTAAATCTTT 


TTCCGAATAA 


AGGTGGTACC 


7260 


ACGATTTTCG 


TCCTTTTTGG 


AAGTCGTGGT 


TTTTAATTTG 


TTATTATTTA 


TAAAGGAGAT 


7320 
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ACCATGAAAC 


TCAAAGACAC 


CCTTAATCTT GGGAAAACTG 


AJVTTCCCAAT 


CCGTGCAGGC 


7380 


CTTCCTACCA 


AAGAGCCAGT 


TTGGCAAAAG 


GAATGGGAAG 


ATGPAAAACT 


TTATCAACGT 


7440 


CCTCAAGAAT 


TGAACCAAGG 


AAAACCTCAT 


TTCACCTTGC 


A TCATCGCCC 


TCCATACGCT 


7500 


AACGCAAATA 


TCCACGTTGG 


ACATGCTATG 


AACAAGATTT 


WAAMuA 1 A 1 


CATTGTTCGT 


7S60 


TCTAAGTCTA 


TGTCAGG ATT 


TTACGCACCA 


TTTATTCCTG 




1 1 VJU X V* *\M 


7620 


CCAATCCAGC 


AAGTCTTGTC 


AAAACAAGGT 


GTCAAACGTA 






7680 


TACTTGAAAC 


TTTGCCGTGA 




TCTCAAGTAG 


ATAAAwAACu 


iwfw\OA 1111 


7740 


AAACGTTTCG 


GTGTTTCTGG 


TflAPTeefiAA 


AATCCATATG 


TGACCTTGAW 


i I— »w iunL inl 


7800 


GAACCAGCTC 


AAATTCGTGT 


& *rf*t*fvyvfi Aft 

All 1 Vj*\V* 


ATCGCTAATA 


AGGGTTATAT 




/ SOU 


GCTAAGCCAG 


TTTACTCGTC 


f\ l \J\J 1 Itl 


GACTCAGCAC 


TTGCTGAAGC 


ft *"* ft ^" X iwiy* & A 

ACAGATTGAA 




TACCATGACT 


TGGTTTCAAC 


TTCCCTTTAC 




AGGTAAAAGA 


TGGCAAAGtiA 


/ 70U 


GTTCTACATA 


CAGATACTTA 




TCCACAACGA 


CTCCATTTAC 


CATCACAGCT 


8040 


TCTCGTGCTT 


TGACGGTTGC 


Tf-T' If". AT ATT 

X AV##\ • r\i l 


CATTACGTTT 


TGGTTCAACC 


TGCTGC* 1VAA 


o i nn 
o XUU 


GCTCGTAAGT 


TTCTCGTTGC 




TTGACTACCT 


TGTCTGAGAA 


A 1 riwVitl VjO 


01OU 


GCTGATGTTC 


AAGTTTTGGA 




GGCCAAGAAC 


TCAACCACAT 


CCTAACACjAA 


a •? 1 1\ 


CACCCATCGG 


ATACAGCTGT 




GTAATTCTTC 


GTGACCACGT 






TCTCGTACAG 


GTATTGTCCA 


Tar* Ai"t/T*pr**P 

1 /\LMlA«WwV. 1 


GGTTTTCCTG 


AGGACGATTA 


LAAIVj 1 1\Aj 1 


D J *■ U 


ATTGCTAATA 


ATCTTCAACT 




GTTGATGAAC 


GTGGTATCAT 




8400 


CCTGGTCCTG 


AATTTGAAGG 


TCAATTCTAT 


GAAAAGGTAG 


TTCCAACTGT 


■Pft-TtYLAAAAA 


8460 


CTTCCTAACC 


TCCTTCTTCC 


CCAAGAAGAA 


ATCTCTCACT 


V. AU A I ^LA • I 


t> r*. ft r^^v^fTT 


8520 


ACTAAGAAAC 


CAATCATCTG 


GCCTGCAGTT 


CCACAATGGT 


11 tj(.t> 1 WAO 1 


•p*T»f*T* & A ATTr* 
1 1 v_ 1 t\*\f\ I 1 V» 


8580 


CGTCAAGAAA 


TCTTGGACGA 


AATTGAAAAA 


GTGAAATTCC 


ACTCAGAATG 


GGGTAAAGTC 


8640 


CGTCTTTACA 


ATATGATCCG 


TGACCGTGGT 


GACTGGGTTA 


TCTCTCGTCA 


ACGTGCTTGG 


8700 


GGTCTTCCAC 


TTCCTATCTT 


CTACGCTGAA 


G ATGGT AC AG 


CTATCATGGT 


AGCTGAAACT 


8760 


ATTGAACACG 


TAGCTCAACT 


TTTTGAAGAA 


TATGGTTCAA 


GCATTTGGTG 


GGAACCTGAT 


8820 


GCCAAAGACC 


TCTTGCCAGA 


AGGATTTACT 


CATCCAGGTT 


CACCAAACGG 


CGAGTTCAAA 


8880 


AAACAAACTG 


ATATCATGGA 


CGTTTGGTTT 


GACTCAGGTT 


CATCATGGAA 


TGGAGTGGTG 


8940 


GTAAACCCTC 


CTGAATTCAC 


TTACCCACCC 


CACCTTTACC 


TAGAAGGTTC 


TGACCAATAC 


9000 


CGTGGTTGGT 


TTAACTCATC 


ACTTATCACA 


TCTGTTCCCA 


ACCATGGCGT 


AGCACCTTAC 


9060 
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AAACAAATCT 


TGTCACAAGG 


TTTTGCC CTT 


GATGCTAAAG 


GTGAGAAGAT GTCTAAATCT 


9120 


CTTGGAAATA 


CTATTCCTCC 


AACCGATGTT 


GAAAAACAAT 


TCGGTGCTGA 


AATCTTGCGT 


9180 


CTCTGGCTAA 


CAAGTGTTGA 


CTCAAGCAAT 


GACCTGCGTA 


TCTCTATGGA 


TATCTTGAGC 


9240 


CAAGTTTCTG 


AAACTTACCG 


TAAGATTCGT 


AACACTCTTC 


GTTTCTTGAT 


TGCCAATACA 


9300 


TCTGACTTTA 


ACCCAGCTCA 


AGATACAGTC 


GCTTACGATG 


AGCTTCGTTC 


AGTTGATAAG 


9360 


TACATGACGA 


TTCGC'TTTAA 


CCAGCTTGTC 


AAGACCATTC 


CTGATGCCTA 


TGCAGACTTT 


9420 


GAATTCTTGA 


CGATCTACAA 


GGCCTTGGTG 


AACTTTATCA 


ACGTTGACTT 


GTCAGCCTTC 


9480 


TACCTTGATT 


TTGCCAAAGA 


TGTTGTTTAC 


ATTGAAGCTG 


CCAAATCACT 


GGAACGCCGT 


9540 


CAAATGCAGA 


CTGTCTTCTA 


TGACATTCTT 


GTCAAAATCA 


CCAAACTCTT 


GACACCAATC 


9600 


CTTCCTCACA 


CTGCGGAAGA 


AATCTGGTCA 


TATCTTGAGT 


TTGAAACAGA 


AGACTTCCTC 


9660 


CAATTGTCAG 


AATTACCAGA 


AGTTCAAACT 


TTTGCTAACC 


AAGAAGAAAT 


CTTGGATACA 


9720 


TGGGCAGCCT 


TCATGGACTT 


TCCTGGACAA 


GCACAAAAAC 


CCTTGGAACA 


AGCTCGTAAT 


9780 


CCAAAAGTTA 


TCGGTAAATC 


ACTTGAAGCA 


CACTTGACAG 


TTTATCCAAA 


TGAAGTTGTG 


9840 


AAAACTCTAC 


TCGAAGCAGT 


AAACAGCAAT 


GTAGCACAAC 


TTTTGATCGT 


GTCTGAGTTG 


9900 


ACCATCCCAG 


AAGGACCAGC 


TCCGGAAGCT 


GCCCTT AGCT 


TCGAAGATCT 


AGCCTTCACA 


9960 


GTTGAACGTG 


CTACTGCTGA 


AGTATGT G AC 


CGTTGCCGTC 


CTATCGACCC 


AACAACACCA 


10020 


GAACGCAGCT 


ACCAGCCAGT 


TATCTGTGAC 


CACTGTGCAA 


GCATCGTAGA 


AGAAAACTTT 


10080 


GCGGAAGCAG 


TCGCAGAAGG 


ATTTGAAGAG 


AAATAAGATT 


GAAAAGTCTA 


GCCAAAATTC 


10140 


AATTTGACAA 


GAAAAGACAA 


CTAATTTTAT 


AGTCTATTAA 


ACGCATTCTA 


TCACCTTTTT 


10200 


GAATACCTGA 


TATGATGCGT 


TTTTTATTTA 


TTTTAAAAAT 


TTGCGAGGTA 


TGACTTTTTA 


10260 


TACTCAACAA 


GAATCAAAGA 


GAAACTTAGC 


AAGCTAACAG 


TAGTAACATA 


AAATAGGAAT 


10320 


TTC AT ATT AG 


GGATAAGATT 


GGTAAATAGT 


GTAATATTTT 


TACAACAATA 


AATTTATATA 


10380 


GTTATTTCTG 


GTTTCTGAAA 


AGTATTATAT 


TTTATTTCAT 


ATTATACAAA 


TTTTTATTTT 


10440 


ATAATATCAG 


AACATACTTT 


TTTTAAAAGC 


AAATATGATA 


CAATTTTATT 


TGAAAAAAAT 


10500 


AAAAAAGGAG 


ATTTTATTAT 


AAAATTAAAA 


AGACTTGCTT 


TAATTAGTGG 


TATCGTCGGT 


10560 


CTTCTGGGAG 


GAATTTTACT 


TCTTATTGGT 


CCTTTTGTCT 


TGTTGGGAAT 


AGCGGTAAAC 


10620 


ACAGCTGCTA 


CAACTCTTAA 


TGGAGGAGCT 


ACTGCAGGGG 


CTTTTTCAGG 


TGTAGCCTTA 


10680 


CTCTTGAATG 


CCTTGAACAT 


TGCAAATCTT 


CTTCTTGGTA 


TCATTGCTAT 


TGTTTACTAT 


10740 


AAAGGAGATA 


AGCCTGTAGG 


TGCAGCTCCG 


TCTGTACTAA 


TGATTGTTTC 


TGGTGGAGTT 


10800 


AGTCTCATTC 


TATTCCGTTC 


TTAGGATGGG 


TTGGGGGGAT 


TTTTGCTATT 


ATCGGACGAT 


10860 
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CTCTATTCCT 


TTCAACATTG 


AACAAATTCA 


AATCAGAAGA 


ATAAAAGGTA 


TTTTAGCATC 


10920 


AAAAGAACAA 


AAAAGTTTAT 


CGGTATAGGA 


GTAGCTCTAT 


TATCTCTTTC 


TCTTCTAGTT 


10980 


CCATGTGCAA 


CATAAAGTTC 


AAAGAATACT 


TCAACAAGTA 


ATCATGACAA GACAGTAGCA 


11040 


ACATCCAATA 


GTTCAAAAGA 


AACAATCACT 


TTCGATACAC 


l»ViVi 1 i\i 1 nAL 




11100 


ATTGAATCAA 


TACGCACTTA 


TGCAGATTAT 


ATACATCTTT 


ATAAAAATAT 


ill lAjrf\ 1 \*/\ 1 


11160 


TATTTTACTA 


AAGCTGAGGA 


AGGTTTCAAA 


GGCATAGCTA 


TGGAAAATAA 


TGACTCGXTX 


11220 


ACTAAACTAA 


AAGAGTCAAC 


TCAAAAATTA 


TTCGATGCGC 


AGAAAAAAAG 


GTTAAATAAT 


11280 


GAAGATACAA 


TAGAAACAAC 


CAAAAACAAT 


GTGATTGCCA 


AACATTGTCA 


AACAu IXX l 1 


11340 


TCCTTTTTGG 


TTTTGACTAG 


CTTTTTTGTG 


AAAAATTGTG 


TAAAATAGAA 


TAGATAAACVj 


11400 


AGGGGAAACC 


TCGGAAAATT 


TAAAGGAGAA 


TCCATCTAAT 


GGTAAAATTG 


U"1"TTTTmCTC 


11460 


GCCACGCTGA 


GTCTGAATGG 


AACAAAGCTA 


ACCTTTTCAC 


TGGTTGCGCT 


GATGTTGATT 


11520 


TGTCTCAAAA 


ACGTACACAA 


CAAGCGATTG 


ACGCTGGTAA 


ATTGATCAAA 


GAAGCTGGTA 


11580 


TCGAATTTCA 


CCAAGCTTAC 


ACTTCAGTAT 


TGAAACGTGC 


TATCAAAACA 


ACTAACTTGG 


11640 


CTCTTGAACC 


TTCTGACCAA 


TTGTGCGTTC 


CAGTTGAAAA 


ATCATGGCGC 


TTGAACGAAC 


11700 


CTCACTACGG 


TGGTTTGACT 


GGT AAAAAC A 


AAGCTCAAGC 


TGCTGAACAA 


TTTCCTCATG 


11760 


AGCAAGTTCA 


CATCTGGCGT 


CGTTCATACG 


ATGTATTGCC 


TCCAAACATG 


GAWWGTCiATvj 


11820 


ATCAGCACTC 


AGCTCACACA 


GACCGTCGTT 


ACGCTTCACT 


TGACGACTCA 


Uli A 


11880 


ATGCTGAAAA 


CTTGAAAGTG 


ACTTTGGAAC 


GTGCTCTTCC 




w/t 1 /vwv\ 1 ^.v» 


11940 


CTCCAGCTCT 


TAAAGATGGT 


AAAAACCTAT 


TCGTAGGAGC 


TCACGGTAAC 


TCAATCCGTG 


12000 


CCCTTGTAAA 


ACACATCAAA 


CGTTTCTCAG 


ATCACGAGAT 


CATSGACCTG 


GAAATCCCTA 


12060 


ACTTCCCACC 


ATTGGTATTC 


GAATTCGACG 


AAAAATTGAA 


CGTCGTTTCT 


GAATACTACC 


12120 


TTGGAAAATA AAAAATTGTA AGTCTAGAAT 


TGATTTCTAG 


GCTTTTTATG 


TTAGTATGGA 


12180 


AGTATGATAA 


GGAATAAAAA 


ACAAGATTAT 


GTACTGGCCT 


ACAAGCAACC 


AGCTTCAACC 


12240 


ACTTACATGG 


GTTGGGAAGA 


AGAAGCTTTA 


CCGATAGGCA 


ATGCTTCTTT 


AGGAGCAAAA 


12300 


GTATTTGGCC 


TTATAGGGGC 


TGAACGGATT 


CAATTTAATC 


AAAAAAGTCT 


CTGCTCTGGA 


12360 


GGTCC ACTTC 


CTGATAGTTC 


AGATTATCAG 


GGTGGAAATC 


TTCAGGATCA 


GTATGTTTTT 


12420 


TTAGCTGAGA 


TTCGGCAGGC 


TTTGGAGAAG 


AGAGATTACA 


ATCTGGCTAA 


GCAACTGGCT 


12480 


GAGCAGCACC 


TAATTCGGCC 


AAAAACGAGT 


CAATATGGGA 


CCTATCTCTC 


TTTTGGGGAT 


12540 


ATTCACATTG 


ACTTCAGCCA 


GCAAGGTACG 


ACTTTGTCTC 


AGGTGACGGA 


CTATCAGAGA 


12600 



no« or»o iao i _ 
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CAGCTGAATA 


TTACTAAGGC 


ACTTGCGACG 


ACTTCTTATG 


TCTATAAGGG 


AACGCGATTT 


12660 


GAACGTAAAG 


CTTTTGCCAG 


TTTTCCAGAT 


GATCTCTTGG 


TTCAATGTTT 


TACTAAGGAA 


12720 


GGGTTGGAAA 


CTCTAGATTT 


TACTATAGAA 


CTATCCTTGA 


CCTGTGATTT 


GCCTTCTGAT 


12780 


GGAAAGTATG 


AGCAGGAAAA 


ATCTGATTAC 


AAGGAGTGTA 


AGTTGGATAT 


TACTGATTCT 


12840 


CATATCTTGA 


TGAAGGGAAG 


AGTTAAGGAT 


AATGATCTGC 


GGTTTGCTAG 


TTATCTAGCT 


12900 


TGGGAAACCG 


ATGGAGATAT 


TAGAGTTTGG 


TCAGATAGGG 


TTCAGATATC 


AGGAGCCAGT 


12960 


TATGCCAATC 


TCTTCTTGGC 


CGCTAAGACG 


GATTTTCCCC 


AAAATCCTGC 


TACCAATTAT 


13020 


CGCAAGAAAC 


TAGATTTAGA 


GCAACAGGTG 


ATAGACTTGG 


TGGACACAGC 


TAAAGAAAAG 


13080 


GGCTATACCC 


AATTGAAATC 


AAGGCATATC 


GAGGACTACC 


AAGCCTTATT 


CCAGCGTGTT 


13140 


CAATTGGATT 


TGGAAGCTGA 


TGTTGACGCA 


TCCACTACAG 


ATGATTTGTT 


AAAAAATTAT 


13200 


AAGCCACAAG 


AAGGGCAGGC 


TTTGGAGGAG 


CTGTTCTTCC 


AGTATGGACG 


GTATTTATTG 


13260 


ATTAGTTCGT 


CCAGAGACTG 


CCCAGATGCT 


CTACCAGCTA 


ACCTACACGG 


AGTCTGGAAT 


13320 


GCGGTCGACA 


ATCCTCCTTG 


GAATTCCGAC 


TATCACTTAA 


ATGTCAATCT 


CCAGCTGAAT 


13380 


T ATTGGCC AG 


CCTATGTTAC 


CAATCTCCTA 


GAGACGGTCT 


TTCCAGTCAT 


CAACTATGTA 


13440 


GATGATTTGC 


CTCTCTATGC 


TCGTCTAGCC 


GCTGTAAAGT 


ATGCAGGAAT 


CGTCTCTCAG 


13500 


AAAGGTGAGG 


AGAATGCTTG 


GTTGG TTC AT 


ACTCAAGCGA 


CTCCCTTTGC 


TTGGACGGCA 


13560 


CCTGCTTGGG 


ATTACTATTG 


GGGTTGGTCA 


CCAGCTGCCA 


ATGCGTCGAT 


GATGCAAACC 


13620 


GTTTATGAAG 


CCTATTTATT 


TTATAGGGAC 


CAAGACTATC 


TCAGGGAGAA 


AATTTATCCC 


1368C 


ATGTTGAGGG 


AAACGGTTCC 


TTTTTGGAAT 


GCCTTTTTAC 


ATAAGGATCA 


GCAGCCGCAG 


13740 


CGTTGGGTGT 


CTTCTCCGTC 


TTATTCCCCA 


GAACATGGGC 


CGATTTCGAT 


TGGCAATACC 


13800 


TATGACCAAT 


CTCTGATTTC 


GCAGTTATTT 


CATGATTTTA 


TTCACGCTGC 


TCAGCAATTC 


13860 


CGACTGGATG 


ACGACTTGTT 


GACTGAGGTT 


AAGGAGAAGT 


CTGATTTACT 


AAATCCTTTG 


13920 


CAAATCACTC 


AATCTGGTCG 


AATCACCGAG 


TGGTATGAGG 


AGGAAGAGCA 


GTATTTTCAA 


13980 


AATGAGAAAG 


TGGAGGCCCA 


GCATCGGCAC 


CCTTCCCATC 


TAGTGGGACT 


CTATCCTGGC 


14040 


AATCTCTTTA 


GCTACAAGGG 


ACAAGAGTAT 


ATTGAAGCGG 


CGCGTGCTAG 


CCTCAATGAT 


14100 


CGTGCAGATG 


GCGGCACAGG 


CTGCTCCAAG 


CCTAATAAGA 


TCAATCTCTG 


GGCGCGTTTG 


14160 


GGAGATCGCA 


ATCCAGCCCA 


TAAATTATTG 


GCAGAGCAGT 


TAAAGACATC 


CACCTTCCAA 


14220 


AATCTTTGGT 


GTAGCCATCC 


TCCTTTTCAG 


ATAGATGGTA 


ATTTTGGTGC 


TACTAGTGGC 


14280 


ATGGCAGAAA 


T GTT ACTCCA 


GTCTCATGCA 


GCTTATCTGG 


TACCTCTAGC 


TGCCCTACCT 


14340 


GATGCTTGGT 


CAACAGCTTC 


TCTTTCAGCC 


TTAATGGCAC 


GTGGACATTT 


TGAAGTGAGC 


14400 
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ATGAGCTCGG 


AAGATAAAAA 


ACTCTTACAG 


TTGACCATTT 


TATCAAGGAG 


TGGAGGAGAT 


14460 


TTGCGAGTTT 


CTTATCCAGA 


TATTGAGAAG 


AGTGTCATTA 


AAATGAATCA 


AGAAAAAATA 


14S20 


AJUXGCGAAAT 


GCATGGGGAA 


AGATTGTATT 


TCGGTGGCAA 


CACCAGAAGG 


TGATCTTGTT 


14580 


f* A. A VTTTATV 


TTTAAGAAGA 


TGTTATAAGG 


CAGTAATTTG 


AAACTGCCTT 


TTAATAAGGA 


14640 


TTTAAGAATA 


TAAGCAGTTT 


TCAACTAGTT 


GAAAAAACGT 


TATAATGATA 


ATAGGAAGTA 


14700 




AAAATCAAAG 


AGCACAAACT 


AGGAAGCTAG 


CCGCAGGTTG 


CTCAAAACAG 


14760 


I \J I l i lA*#*V»lf 


TTGCAGATGG 


AAGCTGACGT 


GCTTTGAACA 


GAGATTTTCG 


AGGAGTATAA 


14B20 


iTllii i 1 Vj/\ 1 




TCTGATGGCT 


TATATTGAGA 


TGAAACACTG 


TTACAAGCGT 


14880 


TATUKiu 1 i 


CCCAPAPCGA 


GATTCTGGCC 


AATTGTGATG 


TGAATTTTGA 


CATTCAAAAG 


14940 




1 Inl i 1 * 




GGTCCAGGCA 


AGTCAACAGT 


TCTTAACCTT 


1S000 






TGATCAAGGG 


GAAATCTGGA 


TTGATGGTGT 


TAATATTGCG 


15060 


G ATT AT AG 11 






CGTAGAAATG 


ATGTGGGGTT 


TGTTTTTCAG 


15120 


mwmi m ill ft m 

TTTTATAATC 


I Ml 1T1 1 V. 1 *v* 




AAGGAAAATG 


TGGAACTGGC 


TTCTGAAATT 


15180 




rrTT<*:AATrr 


TG ATC AGGC C 


TTGACAGATG 


TAGGTCTGGC 


TCATCGTCTC 


15240 


AATAACTTTt. 




TTCTGGAGGG 


GACCAACAGC 


GAGTCTCCAT 


TGCACGCGCG 


15300 








GATGAACCGA 


CTGGAGCCTT 


GCATTATCAG 


15360 




Mvrv» I i i 1 


AATTCTCCAA 


CACATGTCTC 


CTCAAAAGGG 


AGCCACGGTG 


15420 




CTCATAATGG 


AGCTTTGGCG 


CCCATTGCTG 


ATCGCGTGAT 


TCAAATGCAC 


1S480 




TCAAGGATGT 


GGTCCTCAAC 


CAGCATCCTC 


AGGATATTGA 


CAGTTTGGAG 


15540 




ATCAAGCGAA 


AAACTTATTG 


GAAGGACTTA 


GTTCAGTCCT 


TCACAGGCTC 


15600 


CAAGGCGCGT 


TTTTTATCCA 


TCTTGATCCT 


GATGATGTTG 


CGATCTCTAG 


CCTTAGTAGG 


15660 


CCTCAAACTA 


ACCAGTCCCA 


ACATvK*ACUW 


rirsfirrAAT 

UALnwL 1 Ha 1 


fiCTT ATTT AA 

VW 1 J 4 A a 


CAACTCCTCA 


15720 


AACCTTGGAT 


TTGGCAGTCA 


TCTCTAACTA 


TGGCTTGGAT 


CAAGCAGACC 


AAGAAGAACT 


15780 


AAAACAGACG 


GAGGGCGCAG 


AGGTCGAGTT 


TGGCTATTTG 


ACAGATGTGA 


CTATGGATAA 


15840 


TGGGCAGGAT 


GCCATTCGGC 


TGTACTCCAA 


ACCAGAGCGA 


ATTTCAACCT 


TTCAGCTAAC 


15900 


AAAGGGACGA 


CTTCCTCAGT 


CAGACAAGGA 


AATCGCTTTG 


GCCACTCATT 


TCCAAGGCCA 


15960 


ATACAGCGTG 


GGACAGGAGA 


TTAGTTTTAA 


AGAAAAAGAA 


CAGGGTCATT 


CCTCTTTAAA 


16020 


AGACCATACT 


TATACCATTA 


CTGGTTTTGT 


GGATTCGGCT 


GAAATCCTCT 


CCCAGCGAGA 


16080 


TATGGGCTAC 


GCAGGAAGTG 


GAAGTGGGAC 


TCTGACAGCC 


TATGGGGTGA 


TTTTACCTAG 


16140 
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TCAATTTCAT 


CAGAAACTCT 


ACAATATAGC 


TCGTTTGAAA 


TATCAAGATT 




16200 


AAATGCCTTT 


TCATCAGCTT 


ATGAAGAAAA 


ATCCAACCAA 


CATwVAoAAG 




16260 


AATTTTATCA 


GATAATGCCA 


AGGTACGTCT 


GCAACTTTTG 


AAAAAAGAAG 


CACAAGAtjJu 


\ fit 20 


TCTAGACAAG 


GGGCAAGAGA 


CCCTTGACAA 


GGCTCAGACT 


AATTTGCAGG 


AAGGCAAGCG 


lOJOU 


TCGTTTAGCA 


GCTGCTCAAG 


CTCGTATACA 


GGCTCAAGAA 


AGTCAACTAG 


C CTTG ^. 


1 044U 


TCAAGTTCAG 


AGAGAGCAGG 


CTACTGCTCA 


ACTTACCCAA 


GCCAAGCAGG 


AATTGGGCAA 




GGAAGAGGAC 


AAACTAAACC 


AAGCTGAACA 


AAATCTAGCC 


CAAGAAAAGG 


AAAAATTAGA 


1 £C£A 
1 03DU 


AAAACATCAG 


CAAGTCTTGG 


ATGATTTGGC 


GGAGCCAAGG 


TATCAGGTTT 


ATAATCGTCA 


1 £ £ O A 


G AC CAT G CCA 


GCTGGTCAGG 


GCTATCTTAT 


GTATAGCAAT 


GCTTCATCCA 


GTATTCGAGC 


16680 


AGTGGGCAAT 


ATCTTTCCTG 


TGGTACTTTA 


TCCCGTAGCA 


GCCATGCTGA 


CCTTTACGAC 


16740 


CATGACTCCC 


TTTCTAGACG 


AAGAGCGAAC 


TCATGCAGGG 


ATTTTTAAGG 


CCTTGGGTTA 


16800 


TCGTAGTAAG 


CATATTATCC 


CCAAGTTTCT 


CCTTTATGGA 


CTAGTAGCTG 


GGACTGTCGG 


16860 


AACGGCTCTA 


GGTAGTATAC 


TTGGTCATTA 


TTTGCTAGCC 


AGTGTAATTT 


CAACTCTCAT 


16920 


TACAAAAGGC 


ATGGTGGTGG 


GAGAAACTCA 


GATTCACTTC 


TATTGCACCT 


ATAGCTTACT 


16980 


AGCTTTTGTC 


TTGAGCTTGT 


TGGCGAGTGT 


GTTACCAGCC 


TATCTGGTCC 


CTTGGAGGCA 


17040 


ACTTCATGAC 


GAAGCAGCCC 


AGCTTCTACT 


TCCTAAACCT 


CCTCTC AAAG 


GAGCTAAAAT 


17100 


CTTATTGGAG 


CGTATCCGTT 


TTATCTGGCC 


TCGTCTCAGT 


TTTACTCATA 


AGGTAACAGC 


17160 


CCGCAACATC 


TTTCGTTATA 


AGCAGAGAAT 


CTTGATGACA 


ATCTTTGGTG 


TGGCAGGTTC 


17220 


TCTAGCTCTG 


CTCTTTGCAG 


GTTTGGGAAT 


CCAATCTTCT 


CT ACCAGG AG 


TTCCGTCTAA 


17280 


ACAGTTTCAA 


CAAATCCAAC 


AGTATCAGAT 


GCTTGTCTCT 


GAAAATCCTA 


GTGCGACCAA 


17340 


TCAGGACAAG 


CTAGACCTAG 


CAGAAGTGTT 


GAAAGGGCAG 


GAGATACTAG 


CCTACCAGAA 


17400 


AATCTATTCT 


AAAGCGCTAT 


ACAAGGATTT 


CAAAGGCAAA 


GCTGGTCTTC 


AAAACATTAC 


1 T A £ A 


TCTTATGATG 


ATAGAGAAGG 


AACATTTGAC 


TCCCTTTATC 


CATCTTCAAC 


ATC ATCAGC A 


1 ' J«U 


GCAGCTGACA 


TTAAAACATG 


GCATCGTTAT 


TACAGCTAAA 


CTCGCCCAGC 


TGGCAGGTCT 


17580 


CAAGGTTGGG 


CAGACTTTAG 


AAATTGAAGG 


TAAGGAACTA 


AAGGTCGTTG 


CTATTACTGA 


17640 


GAACTACGTT 


GGTCACTTTA 


TTTATATCAG 


TCAGGCTAGC 


TATGAGCAAC 


TTTACGGACA 


17700 


GCTACCCCAA 


GCCAACACTT 


ATCTGGTCTC 


ATTAACGGAT 


ACCACTGCAA 


CTAGTATCGA 


17760 


AACTCAGGCG 


GGCTTGCTTA 


TGAATCAATC 


TGCGGTGTCC 


AGCCTTGTCC 


AAAATGCTTC 


17820 


AGCCATTCGA 


CTCTTCGACT 


CTATCGCTAG 


CTCACTCAAT 


CAGACCATGA 


CCATCTTGGT 


17880 


CATCGTATCG 


GTTCTATTAG 


CTATTGTCAT 


CCTTTACAAT 


CTCACCAATA 


TCAACGTAGC 


17940 
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TGAGAGAATC 


CGTGAACTCT 


CCftLi ATCAn 


Vjv> 1 J W * * ww * 


TTTCATAATA 


ATGAAGTCAC 


18000 


CCTCTACATT 


TACCGTGAGA 


CGATTGTGCT 




GGAATCGTAC 


TTGGTCTGAT 


16060 


AGCTGCTTTC 


TATTTACACC 


AArri i i\jAT 




TCGCCTGCGA 


CTATTCTCTT 


18120 


TTATCCGCAG 


GTAGGCTGGG 




AA 1 L^^AV* t w 


GCAGCAGTAA 


GCATCATTTT 


18180 


GACCTTGCTT 


GGTTTCTTCG 


TCAAT i A I I A 


TfTftA.flAAAG 


GTTGATATGT 


TAGAAGCCCT 


16240 


CAAATCTCTA 


GAGTAAGGTA 


GTTATTTTTA 


l un x i v*ma 


CTTCT A'ITT A 


CTAATATTCA 


16300 


AAAATCCTCC 


GTTTCAAAGA 


GCAGGGAACT 


CTTTGTGACA 


w AwV»A A * X 1 * 


TCTATAGGGC 


18360 


TTTAGCAGCT 


GCAATTGCGG 


CTTCGAAGTT 


Tu\i(. i LnuAA 


TTT.ATkTTAT 


CCACGTATTC 


16420 


AACGTAGCGA 


ATCCTATTGT 


CAGTATCGAG 


GACAAnUAL 1 


nmctrmcT a 


ATAGGTGCCA 


18460 


TTCGTTGATC 


AAGAGGGCAT 


AATCGCGCCC 


GAAAGAATGG 


TUAAAu i AO 1 


r*TCAAAGCAT 


18540 


AATGGCATTG 


TCAAGGCCTT 


CAGCACCGCA 


CCAACGTTTT 




CTAGGTCCAT 


18600 


TGAAACAGTC 


AATACGACCG 


TGTTGTCCAG 


TCCAGCCAAT 


TCTTCATTAA 


aapc ACGTGT 


18660 


TTGAGTTGAC 


CACATCCCTG 


TATCGATAGA 


AGGAACGACA 


WTCAAOAtTI 


1"!TTf TTGCC 


18720 


ATCAAAATCA 


GCCAGAGATT 


TTTT AG AAAG 


ATCTCTTGTA 






18780 


CTTGTCGCCG 


ACTTGTAGTT 


GTTTACCTGT 


AAAGCTCAQ.A 


wOA 14 1 ^^.VJ/* 


GAAAACTTAC 


18840 


CATAGGATAC 


TCCAATCTTT 


TTTCTTCCAT 






TTTTCCAATG 

4 4 4 4 W V* 4 


18900 


ATTTGACCGG 


AAATATGGGC 


ATAGAAAAAA 


CbCCAvj CTC A 




ACGTTTTTCA 


18960 


TAGGTTTATT 


TTGCCAATCC 


TTCAGCAATC 


TTGTCAAGGT 


1 w 1 A 1 1 1 +—r\ i 


CATC CTGTAG 


19020 


TAGCTCTCGC 


CTTCTTTACC 


TTGTTCTGCG 


ATAGAGTCAG 


TAAACATTTC 


ACCGTAGATT 


19080 


GGGATGTTTG 


TGTCTTGAGA 


AACACTTTTC 


ATTGGACGCT 


CATCCACACT 


TGATTCTACA 


19140 


AAGAGTGATG 


GAACTTTTCT 


TTGGCGAAGT 


TTTTCAACCA 


AGGTCTTGAT 


TTGTTCAGGA 


19200 


GTTCCTTCTT 


CTTCAGTATT 


GATTTCCCAG 


ATGTAAGCAC 


TTGGGACACC 


ATAGCCTTTA 




GAGAAGTATT 


TGAATGCTCC 


TTCGCTGGTT 


ACAATGAGTT 


TCTTTTCAGC 


ACGGATCTTA 


19320 


TTAAATTTAT 


CCTTACTTTC 


TTTATCAAGT 


TTGTCTAACT 


TATCACTATA 


TTCTTTCAGA 


19380 


TTTTTITCAT 


AGAATTCTTT ATTGTTAGGG TCTTTGCCGC 


TCAATTGTXT 


GCCGATATTT 


19440 


TTAGCAAAAA 


TAATACCGTT 


TTCAAGCTTA 


AGCCAAGCGT 


GTGGGTCTTC 


TTTTCCTTTT 


19500 


TCATTTTGAC 


CTTCAAGGTA 


GATAACATCA 


ACGCCGTCGC 


TGACTGCGAA 


GTAGTCTTTG 


19560 


TTTTCAGTTT 


TCTTGGCATT 


TTCTACCAAT 


TTTGTAAACC 


AACCATTGCC 


ACCTGTTTCA 


19620 


AGGTTGATAC 


CGTTATAGAA 


AATCAAATTA 


GCCTCAGAAG 


TTTTCTTAAC 


GTCTTCAGGA 


19680 
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ACTGGTTCGT 


ATTCGTGTGG 


CTCTTGCCCA 


ATCGGAACGA 


TACTATGAAC 


GTCAATTTTG 


19740 


TCACCAGCAA 


TATTTTTACT 


AATATCAGCG 


ATGATTGAGT 


TTCTAGCAAC 


AACTTTTAGT 


19800 


TTTTGACCAC 


AAGTTGTATC 


TTTTTTTCCG 


CTAGCACATG 


CTACAAGAAT 


GATTGCAGAA 


19860 


AGAAAGAGAA 


CGAGTAATGT 


ACCTAATTTT 


TTCATTAGAT 


CCTCCAATTT 


ATTACGGCTT 


19920 


TGCCCCTTAT 


TTTAACAAAT 


GTTTATTTTT 


CAGTTTCAAA 


TATCGTTGTT 


TGGGAGCGAT 


19980 


AAAGAAGCTA 


ATGAGAAAGA 


AACTAGCAGC 


TGTAAGCACG 


ATACTAGAAC 


CTCCCGCAAC 


20040 


ATTAAAACTA 


TAGCCAATAA 


ACAGTCCCAA 


AACTGAAGCA 


GTAGCTCCGA 


AGGTTGAGGA 


20100 


AAGGAAAATC 


ATACTTTTCA 


GACTATTAGC 


ATACAGATAA 


CCAGTTGCAG 


CTGGGGTAAT 


20160 


CAGCATGGCT 


ACAATCAGGA 


TAG1TCCGAC 


ACTTTGCATG 


GCTGTCACAG 


ACACGAGAGT 


20220 


CAGGAGTACC 


ATGAGAAGGT 


AGTGATAGAA 


ATTGACACGC 


ATTCCCATGG 


CTTTAGCCAA 


20280 


GAGTTCATCA 


AAGGAAGTTA 


TCAAGAGTTG 


CTTGAAGAAA 


ATCCAGATTA 


ACAAGAGGAT 


20340 


AGCTGCCCCC 


ACACCCATAG 


TAATAAACAT 


ATCCGTATCT 


TGGACGGCCA 


GGATATTACC 


20400 


AAAAAGGATA 


TGGAAAACGT 


CAGTTGAACT 


TTTAGCGACA 


CCAATCAAGA 


TGATACCGAC 


204 60 


GGCTAAGAAA 


GAAGAAAAGG 


TAATGCCGAT 


GGCGGTATCG 


CTTTTGATAA 


TCGAGTTTCC 


20S20 


TTTGATGTAG 


GTAATGATGA 


TGCCACCTAG 


C AATCC AAAG 


ACAATGGCTC 


CGATAAAGAA 


20560 


GTCAAGGCCC 


AAGATGAACG 


ATAGGGCTAC 


ACCTGCTAAG 


ACAGCATGTG 


AAATGGCATC 


20640 


TCCCATGAGT 


CACATCCCCC 


GTAGAATAAT 


GAAACATCCC 


ACAGCTCCAG 


CTACAATCCC 


20700 


GACGACAATA 


GCTGTTATCA 


AGGCATTTTG 


TAGGAAATGG 


AATTTTTGCA 


ATCCATCGAT 


20760 


AAATTCTGCA 


ATCATAGCTC 


ACCTCCATTG 


AAAAAGAGTT 


CATTACCGTA 


AGCTTCTTTT 


20820 


AGATTGGTTT 


CGGTAAAAGT 


TTCTTTTGTT 


CGACCAAAGG 


CAATCACTTC 


TCGATTGACA 


20880 


AGTAAGACTT 


GATCGAAGTA 


GTGGGGAATC 


TTGCTGAGGT 


CGTGGTGAAC 


GATGAGAACC 


20940 


CTCTTCCCAC 


CTTTTTTCAA 


ATCTCTCAGC 


GTATTCATGA 


TGATTTCCTC 


ACTGACAGAG 


21000 


TCAATCCCAG 


CAAAGGGTTC 


ATCCAAGAGG 


ATATAGTCCG 


CTTCCTGCAC 


CAAACATCTG 


21060 


GCAATCAAGA 


CCCGCTGGAA 


TTGACCTCCA 


GACAGTTGAC 


TAATTTGACG 


TTCAGCCTAG 


21120 


TCAGCTAGGC 


CGACGATTTC 


AAGGGCCTCT 


TGCACTTTCT 


TCCAATGTTT 


AGCCTTTAAA 


21180 


CTTCGAAAGA 


GAGGAATAGA 


GGGAAATACT 


CCTAACGAGA 


CGCATTCCTT 


GACCVFGATG 


21240 


GGAAAGTTGT 


AGTCGATATT 


GATTTTTTGT 


TCGACATAGG 


CAATTCGGTG 


TAAGGATTTT 


21300 


TTAACTTCCT 


TGTCATCGAG 


AAATGCCTGA 


CCTTGATGTG 


GGATAATTCC 


CAACATACCT 


21360 


TTTAATAGTG 


TTGATTTCCC 


AGCGCCGTTT 


GGACCAATCA 


TGCCCGTAAT 


TGTTGGTCCA 


21420 


TGGAGCACTA 


GTGAAATATC 


CTTAAGTGCC 


AACGTTTCTT 


TGTACGAGAC 


ACTGAGGTTT 


21480 
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TCGATACCTA TCATAAACTT CTATTCCTCC TGTCTCTTAA TATACATTAA AAAAAAAATT 21540 

AACTCAAGTT AATTTTTGAA AAAATTAAAA TAATAACTGA AAAATAGATT CTAAACATAA 21600 

CTTTCAGGAT AAATTTCTAA ATTATAAAAC GCATAGTATC AAGTGTAAAA AACTTGGAAT 21660 

TATCCCTTTT ATCATGGAAA GATTTTTTAT AATAGCTAAA AAATAA 21706 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GATCCCCAGG AAAAACCGAG GTTTTCCCAA TCAATCGTTA CTGTCATATT CCACTCCTTA 60 

TTCTAAAAAC CTATTTCTTA TATTCTACAC TATTTTTCTA AAATAGCAAG TATATTTTGT 120 

AATTTTCAGA AAATTTCTCC AATAAAAACC AACTCTTAGA ACTGATTCTT CATTTCACTT 180 

ATTTATCTTC ACT AACT A CT TCCTGAACAT AAGCCTCAAA AACTTCTTCA TCTGAAATCC 240 

TGTCACAAAT GAAGCTTCCA TTGCTAGTGC GTTCTGACAA CTTCAAGTCT TGCAATCGCC 300 

TTTCATAGAT TGTTCCTTTA TTGGATTGGA CAAGCAGACT TTGGTCGTTC ACATCCACTT 360 

CCGTACTGAA GAAATCCCCA ACAAATCCTT GCTCTGCAAC TGCTCCTGCC AAGAAGACAC 4 20 

GATCCGGTTT GTTTTTCAAC TCACGCAAGA CTTGTAATCC TCGTTTGGCA CGGCTGGTTG 480 

CTAGAATTTC CTCAATCGAA ACACGTTTCA AGCTTCCACG CTGGGTCAAG AGGTACAAGG 540 

ACGAAGTATT ACAGATAAAG CCAGATTGGA CCACATCATC TTCTTTCAAA TTCATAGCCT 600 

TGACACCTGC TGCCTTAGCA CCGACAACCG GAACCTCTTC GATATTGAAA CGCAGGGCAT 660 

AACCATTTTG ACTAACCAAG ACAACATCAT CTAGTTTAAT CGGAGCCACT GCTACAATCT 720 

GATCTGTATC GTCTTTGAGC TTAGCATACT TGACAGACTT AGATCTATAG GTCCGCCATG 780 

GAGTGAATTC TTTTCGCTCT ACCCCTTTGA TTTGACCAAG GCGAGTCACT CCAAAGTAGG 840 

TTGTCGCATC GTCAAACTCA TCCAGTACTT CCACATAAAG GATTTCTTCA TTCGTTTCAA 900 

AGTTT G TGAT GGTTTGGCTC AGATGCTCTC CGATGTCCTT CCAACGAATA TCTGCCAACT 960 

CATGGATTGG TCTGTAGATG ACATTTCCAA GACTTCTGAA CATCAAGAGG TGCTGGGTTG 1020 

TCTTGGCAGA TTGAACAAAA ATCAAACGGT CATCATCACG CTTGCCAATT TCTTCCAAGG 1080 

TGGAAGCCGC AAAGGAACGT GGACTGGTAC GCTTGATGTA ACCTGCCTTG CTCACGCTGA 1140 
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CGTAGGTATC 


TTCCTCAGCG 


ATAAGACTAG 


CTGTATCAAT 


CTCAATTGCT 


TTCGCAGTGT 


1200 


CTTCTAAAGA 


ACTCAAACGA 


GGAGTTCCAA 


ATTTCTTCTT 


CACCTCACGA 


ACTTCTTTCT 


1260 


TCATGAGATT 


GTACATACTC 


CTTTCATCAC 


CGATAATACC 


CGCCAGCATA 


GCAATCTTCT 


1320 


CACCAACCTC 


TGCTTCTTCT 


TCCTGCAAGA 


CAACCACATC 


GGTATTGGTC 


AAACGGTACA 


1380 


GTTGCAAACT 


TACCATAGCC 


TCAGCCTGTT 


CTTCCCTAAA 


ATCATAGCTA 


ACTTTGAGGT 


1440 


TTTCCTTGGC 


GTCCGCCTTA 


TTCTCAGAAG 


CACGGATAAG 


AGCAATGACT 


TCATCCAAAA 


1500 


TCGAAATCAC 


ACGAATCAAA 


CCTTCGACCA 


TATGGAGACC 


TTTCTCAGCC 


TTTTCTTTGT 


1560 


CAAAGCGTGA 


ACGCGCCAAA 


ATCACTTCTC 


GACGGTGAGC 


GATATAGCTA 


GACAGGATTG 


1620 


GAACAATCCC 


AACCTGACGA 


CGTGTGAAAT 


TGTCAATCGC 


CACCATATTA 


AAGTTGTAGT 


1680 


TGATTTGTAG 


GTCGGTGTAC 


TTAAATAACT 


AGTTGAGAAC 


AACCTCAGTA 


TTAGCGTCTT 


1740 


TCTTAAGTTC 


GATAGCGATA 


CGAACACCAT 


CACCGTCAGA 


CTCATCACGA 


ACCTCAGCAA 


1800 


TCCCACCTAC 


CTTGTTATTA 


ACACGAACAT 


CATCCATTTT 


CTTGACTAGA 


TTGGCCTTAT 


1860 


TCATTTCATA 


AGGAATCTCA 


ATAATAACGA 


TTTGTTCCTT 


ACCACCTTTT 


AGCTTTTCAA 


1920 


TTTCAGTCTT 


GGAACGAACA 


ACCACGCGCC 


CTTTCCCAGT 


CTCATAAGCT 


TTCTTGATTT 


1980 


CATCACGACC 


CTGAATAATA 


GCCCCTGTAG 


GGAAGTCTGG 


TCCAGGCAAG 


AATTCCATGA 


2040 


GTTTATCAAT 


CTTTGCAGTT 


GGGTGGTCAA 


TCATGTAAAC 


TGCAGCATCT 


ATGACCTCAG 


2100 


CTAAATTATC 


GGGAGGAATG 


TCTGTGGCAT 


AACCAGCCGA 


AATCCCAGTC 


GAACCATTGA 


2160 


CCAAGAGCTT 


TGGAAAGGCT 


GCTGGCAAGA 


CCGTTGGTTC 


TTTCTCCGTA 


TCGTCAAAGT 


2220 


TC C ATGC AAA 


AGGAACTGTC 


TTTTTCTCGA 


TATCCTGAAG 


AAGGTAGCCT 


GCAATTTCAG 


2280 


ACAAACCTGC 


CTCAGTATAA 


CGCATAGCCC 


CAGGAGGATC 


TCCGTCCATA 


GAACCGTTAT 


2340 


TACCGTGCAT 


TTCAACTAGA 


ATCTCACCAT 


7TTTCCACTT 


CTGTGACATA 


CGAACCATGG 


2400 


CATCATAGAT 


AGAAGAATCC 


CCGTGTGGGT 


GGAAATTCCC 


CATGATGTTC 


CCGACTGACT 


2460 


TGGCCGACTT 


ACGGTAGCTC 


TTGTCAAAAG 


TATTGCTATC 


CTTATTCATA 


GAATAAAGAA 


2520 


TACCGCGCTG 


AACCGGCTTC 


AACCCATCAC 


GAATATCTGG 


CAAAGCCCGG 


TCTTGAATAA 


2580 


TGTACTTGGA 


GTAGCGACCA 


AAGCGCTCTC 


CCATGATGTC 


CTCCAGGGAC 


ATGTTTTGAA 


2640 


TGTTAGACAT 


AAGATACAAA 


GCCCATAAAA 


TACCAACTGA 


AAATAGAAAA 


TTCTTGAAGT 


2700 


AAGCAAACTC 


ACAAGAGAAT 


TTATCTTTTT 


CACACAGTAT 


CTAGGGCGTG 


TTCAACTCCT 


2760 


TTCAAAGAAT 


GTAGAGTAGG 


TTTTTATGCA 


GTAAAAGATA 


TTTTACGGGA 


ATTCCTCCCG 


2820 


TGTTCAGTTA 


CGATAAGTAA 


CCAAACTATC 


CTGTTTGTAT 


TTTTCAATAT 


GAAAATCTGG 


2880 


TTTTCCAAAA 


TTAGTCTTAG 


TTTGTGTCTT 


AGCCGCTCCC 


TTAAGCGCCT 


CTTTGAGATA 


2940 
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ACCACTCATA GCAGATTCTT CATTAATAAT CCTGCAATTT TTTCAAACCA AGATTTTCAA 3000 

ACTCCTTTTT CACATAGTCA TTCACATCCC ACTCTAATTT CCACTTTACT AACATATTAT 3060 

TTTCTTTCAT TAAAACACTC TCGTTTCTTC TAGCGTAAAC TTGACATTAT CTTCAATCCA 3120 

TTTACGGCCT GGTTCTACCT TATCTCCCAT GAGAACATTC ACGCGCCGTT CGGCGCGCCC 3180 

TAAATCTTCA ATTGTGACAC GGATGAGGGT ACCTGTTTCT GGGTTCATGG TTGTTTCCCA 3240 

CAGCTGGTCC GCATTCATCT CACCAAGTCC TTTGTATCGT TGGAGGGTAG CGCCTTTACC 3300 

GAACTGTTTA CGGAGTTCTT CTA G TT CT CC GTCCGTCCAA GCCTAGGCCA CTTCTTCTTT 3360 

CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA CATGACCTGC 3420 

CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT GGATATGGGC 3480 

ACCGTCGGTA TCCGCATCGC TCATGATAAT GATCTTATCA TACTTGGCAT CTTCAATAGA 3540 

CAACTCTGCT CCAACACCCG CACCAATGGT ATAAATCATG GTATTCATCT CTTCATTTTT 3600 

GAGGATATCC GCCATCTTGG CCTTGGCTGT ATTGACAACC TTACCACGAA CAGGTACAAT 3660 

AGCCTGCAAC TTGCGGTCAC GACCTTGTTT GCCAGAACCA CCGGCAGAGT CCCCCTCAAC 3720 

TAGATAGACT TCATTCTTAG CAGGATTCTT ACATTGGGCT GGGGTCAATT TCCCAGACAA 3780 

CAAGCCCTTA TCTTTCTTGT TTTTCTTCCC ATTTCGGCTC TCATCACGCC CCTTACGTGC 3 840 

TGCTTCACGA GCATCACGGG CCTTGATAGC CTTGCGCATG AGGTTAGAAG CTAATTCCCC 3900 

ATTTTCCATA AGGAAAAAGG TCAACTTATC ACCCACTATT CCATCCACAA CTGGGCGAGC 3 960 

TAGGGGGCTT CCTAGTTTAT CCTTGGTCTG TCCTTCAAAC TGCAAGTCTT CTTCAGGAAC 4020 

TAAGATAGAA ACAACGCCCG CTAGTCCCTC ACGATAGTCT GAACCTTCAA GCTTTTTATC 4080 

TTTTTCCTTG AGAAGACCTG TTTTACGTGC ATAGTCATTC ATGACCTTGG TAATGCCAGA 4140 

CTTGAGTCCT GTCTCGTGCG TTCCACCGTC CTTCGTCCGA ACCTTATTGA CAAAACATAG 4200 

AATGTTATCT GAGAATCCCT CATTGTACTG GAGGCCTACT TCCACTTGAA AACCATTGTC 4260 

TTCCCCTTCA AAGTAAAGAA CTGGCGTCAA GATTTCCTTA TCTTCGTTGA GATAAGAAAC 4320 

AAAAT CT T GT ACTCCATTCT CATACTGGAA CTCAATCGCT TCATTTGTTC GCTTGTCCGT 4 380 

TAAAGACAAG GTCACATTTT T CAAG AG AAA GGCTGATTCA TTAAGGCGCT CTCAAATGGT 4440 

ATTCTACTTG AAATCTGTCG TAGAAAATAT AGTCGCGTCA GGCATAAAAG TAACTTTGGT 4 500 

GCCTGTTTTA GACTTGGGTG CTGTACCGAT TTTCTTCAAA GTCGTGACAG GTTTTCCACC 4 560 

ATTTTCCAAA CGTTGCTTGT AAACTGCGCC ATCACGGGTA ATTTCAACTT CTAACCAGCT 4 620 

AGAAAGGGCG TTAACAACGG AAGAACCCAC TCCCTCAAGT CCACCTGATG TCTTATAGCC 4 680 
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ACCTTGACCG 


AATTTCCCTC 


CGGCATGAAG 


AATGGTAAAG 


ATAACCTCAA 


CAGTTGGAAT 


4740 


TCCCATAGCG 


TGCATACcTG 


TCGGCATCCC 


ACGTCCATGG 


TCTTGAACCG 


TTAGACTACC 


4800 


CTCTTTATTG 


ATAGTTACAT 


CAATACGATC 


ACCAAACCCA 


GACAAGGCTT 


CATCGACTGC 


4860 


ATTATCAACG 


ATTTCCCAAA 


CTAGGTGATG 


AAGACCAGCG 


CCATCGGTCG 


ATCCAATATA 


4920 


CATCCCTGGA 


CGTTTTCGGA 


CCGCATCCAA 


CCCTTCTAGC 


ACCTGAATAG 


CATCATCATT 


4980 


ATAATTGTTA 


ATATTGATTT 


CCTTTTTTGA 


CACAAGGAAC 


CTCCTATTCG 


TTCATCTTTA 


5040 


CTATTCTACA 


GGTTTTCCAA 


GGATTTTGCA 


AAATTTTTCT 


TTCTCCGATC 


TCACAATTTC 


5100 


AGCAGACATT 


CTCTGCTTTT 


CTTTCCCAAT 


TCATGATATA 


ATAGGAGTAT 


GATTACAATA 


5160 


GTTTTATTAA 


TCCTAGCCTA 


TCTGCTGGGT 


TCGATTCCAT 


CTGGTCTCTG 


GATTGGACAA 


5220 


GTATTCTTTC 


AAATCAATCT 


ACGCGAGCAT 


GGTTCTGGTA 


ACACTGGAAC 


GACCAACACC 


5280 


TTCCGCATTT 


TAGGTAACAA 


AGCTGGTATC 


GCAACCTTTG 


TGATTGACTT 


TTTCAAAGGA 


5340 


ACCCTAGCAA 


CCCTGCTTCC 


GATTATTTTT 


CATCTACAAG 


GCGTTTCTCC 


TCTCATCTTT 


5400 


GGACTTTTGG 


CTGTTATCGG 


CCATACCTTC 


CCTATCTTTG 


CAGGATTTAA 


AGCTGGTAAG 


5460 


GCTGTCCCAA 


CCAGTGCTGG 


AGTGATTTTC 


GGATTTGCCC 


CTATCTTCTG 


TCTCTACCTT 


5520 


GCGATTATCT 


TCTTTCGAGC 


TCTCTATCTT 


GCCACTATGA 


TTTCACTGTC 


TAGTGTCACA 


5580 


GCATCCATTG 


CGGCTGTTAT 


CGGGGTTCTG 


CTCTTTCCAC 


TTTITGGTTT 


TATCCTGAGT 


5640 


AACTATGACT 


CTCTCTTCAT 


CGCTATTATC 


TTAGCACTTG 


CTAGTTTGAT 


TATCATTCGT 


5700 


CATAAGGACA 


ATATAGCTCG 


TATCAAAAAT 


AAAACTGAAA 


ATTTGGTCCC 


TTGGGGATTG 


5760 


AACCTAACCC 


ATCAAGATCC 


TAAAAAATAA 


AATGCCAGTT 


CTCTACTGCC 


CCCAAACAGT 


5820 


TAGACAAATA 


ATTTATCCAA 


AGGATTTAGT 


TCTGTACTGC 


ACAGGACTAA 


GTCCTTTTAG 


5880 


TTTTACCTTA 


ATTCGTTTGT 


TGTTGTAGTA 


ATCAATATAC 


TCTATAATGG 


CTTGTTCCAA 


5940 


TTGATTAAGT 


GATTTAAATG 


TTTTCTCATA 


GCCATAAAAC 


ATTTCCGATT 


TTAAAATGCC 


6000 


AAAGAAAGAT 


TCCATCCTAC 


CCTTGTCTTG 


GCTGTTGCCC 


TTACCTGACA 


TGGATGCTTG 


6060 


AATTCCCTTA 


CTCTCTAGGA 


ACCGATGATA 


AGAATCGTGT 


TGCTATTCCC 


ACCCTTGGTC 


6120 


ACTATGGAGA 


ATCCTATTCT 


CGTAGTGCTT 


CTCTGTGAAT 


GCCTGTTCCA 


A 


6171 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18475 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(Dl TOPOLOGY: linear 
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<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



TATTAUW* I 


AAAAAAACGG 


AGGAGTGCTT 


TATCAAAGCC 


TATACTTATG 


TTAAACCAGG 


60 


ACTTGCTTCT 


1 1 4 w 1 A 1 w 


TAGACAAACC 


AGTTATTCGC 


AACCCAACAG 


ACCCTATTGT 


120 


GCGTATTGTA 




TTTGTGGAAC 


AGACCTCCAT 


ATTATCAAAG 


GGGATGTTCC 


180 


TACTTGCL-AA 




TTCTTGGCCA 


CGAAGGGATT 


GGGATTGTTG 


AAGAAGTTGG 


240 


GGAACGAO l I 




AAAAAGGTGA 


CAAGGTCTTG 


ATTTCTTGCG 


TCTGTGCCTG 


300 


TGGTAAATGC 




AAAAACSCAAT 


TTATGCTCAC 


TGTGAAGACG 


AAGGGGGCTG 


360 


GATTTTCGGT 


CACTTGATTG 


A 4 A 4 


fifiCTGAATAT 


CTACGTGTCC 


CTCATGCAGA 


420 


TAATACTCTT 


TACCATACTC 


v_AL»AAV*Al. 1 1 


CTCACATGAA 


GCTTTGGTTA 


TGCTGTCAGA 


480 


CATTCTGCCT 


ACTGGATATG 


AAA 1 I 1\# 4 


PTTAAAAGGC 


AAACTAGAAC 


CTGCTTGCAG 


540 


CGTAGCCATT 


ATTGGTTCAG 


/TV**" h.fYTTClCl 

\3 i V_ V_/*^J A 1 WW 


ATTGCCTGCT 


CTTTTAACAG 


CCCAATTCTA 


600 


TTCACCAGCT 


AAATTGATTA 




ACACGATAAC 


CGCTTGGAAA 


CTCCCCTATC 


660 


ATTCGGTGCG 


ACTCATAAGG 


TTtA'iTrT'Pr 

11 1 1L 1 IV. 


AHA CCCTG AA 


AAAGCCATTA 


AAGAAATTTA 


720 


TGATTTGACA 


GATGGTCGTG 




CCCTATCGAA 


GCTGTTGGT A 


TTCCTGCAAC 


780 


ATTTGATTTC 


TGTCAAAAGA 




AnACCCAACC 


CTTCCCAACT 


GTGCTGTGCA 


840 


TGGTAAACCA 


GTTGAATTCG 


A 1 1 1 Awft A A** 


A. f rTTPCGAITF 
4 4 4 w*» * * 


CGCAACATCA 


ATGTAACAAC 


900 


TGGTTTCGTA 




rCACTCCACA 


ATTGTTGAAA 


GCACTTGAAA 


GTCATAAGAT 


960 


TGAACCGGAA 


AAA 4 1 uu 1 At 




CAAACTCAGT 


GAAATTGAAA 


AAGCCTACCA 


1020 


AGTCTTCAGT 


AAuwLAWi. A\J 


Aff ACCATGC 


CATTAAGGTC 


ATTATCGAAA 


ACGATATCTC 


1080 


AG AAG C CTAA 


1* I Aw 1 AAAA/% 


A • » 4 A 4 * • *» * •* 


CATAAGTAAA 


TAGAAATTCA 


CTCATCCATC 


1140 


AGATCCCTGC 


ATITTTTATC 


AAAAAATTAA 


GAAATGAGCA 


TA'l 1 inw 


PTTflTPTCGC 

W 4 4 w * V» 4 V*\*W 


1200 


GGAATTGGTT 


ATAATATACG 


CTACAAAGGA 


ATGAATGAAT 


ATGTATCCTC 


TTATACAAAT 


1260 


GTACGGAGAT 


TTTGAACCGT 


GGTGGTTCTT 


AGAAGGTTGG 


GAAGAAGATA 


TTGTAGCAAG 


1320 


TAGAAAATTT 


GACCAGTATT 


ATGATG CTCT 


CAAATACTAC 


AAAACTTGCT 


GCTTTAGATT 


1380 


GGAACAAGAA 


TCGCCTCTTT 


ATAAAAGTAG 


AAGCGACTTG 


ATGACCATTT 


TTTGGGACCC 


1440 


GGAAGACCAA 


CGCTGGTGTG 


ATGAATGTGA 


TGAGTATTTA 


CAACAATACC 


ATTCTTTGGC 


1500 


TCTTTTGCAG 


GATGAGCAGG 


TTATCCCAGA 


CGAAAAACTA 


CGCTCAGGCT 


ATGAAAAACA 


1560 


AACCAGTCAG 


GAAAGGAATC 


CTTCTC'GCCG 


TATGAAATTA 


AAATAGAGAA 


AAGTAACTTT 


1620 


TTTGGAGTTG 


CTTTTTTTAT 


TTTTCTAACT 


CTTTGCGAAT 


AGTATAGGTG 


AGGAGGTAAG 


1680 
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TATGGTTCAA 


GAAATTGCAC 


AAGAAATCAT 


TCGTTCAGCT 


CGGAAAAAAG 


GCACGCAGGA 


1740 


TATCTATTTT 


CTCCCTAAGT 


TAGACGCCTA 


TGAGCTTCAT 


ATGACGCTAG 


GAGACGAGCG 


1800 


CTGTAAAATT 


GGTAGCTATG 


ATTTTGAAAA 


GTTTGCAGCC 


GTTATCAGTC 


ACTTTAAGTT 


1860 


TGTGCCGGGT 


ATGAATGTGG 


GAGAAAAAAG 


ACGTAGTCAA 


CTGGGTTCCT 


GTGATTATGC 


1920 


CTATGACCAT 


AAGATAGCGT 


CTCTACGTTT 


ATCTACTCTA 


GGCCATTATC 


GGGGGCATGA 


1980 


CAGTTTGCTT 


ATCCGTTTCT 


TGCACGATGA 


GGAGCAGGAC 


CTGCATTTTT 


CGTTTCAGGA 


2040 


TATTGAAGAA 


TTAGCCAAGC 


AGTACAGGCA 


ACGGGGACTC 


TATCTTTTTG 


CTGCTCCGCT 


2100 


TGGGAGTGGT 


AAGACGACCT 


TGATGCATGA 


ATTCTCCAAG 


TCACTCTTTA 


AAGGACAGCA 


2160 


AGTTATGTCC 


ATCGAAGATC 


CTGTCGAAAT 


CAAGCAGGAC 


GACATGCTTC 


AGTTGCAGTT 


2220 


GAACGAAGCA 


ATCGGCCTAA 


CCTATGAAAA 


TCTAATCAAA 


CTTTCCTTGC 


GTCATCGACC 


2280 


AGATCT CTTG 


ATTATCGGAG 


AAATTCGTGA 


CAGCGAGACG 


GCGCCTGCAC 


TGGTCACAGC 


2340 


TAGTTTGACA 


GGTGCGACAG 


TCTTTTCAAC 


CATTCACGCC 


AAGAGTATCC 


GAGGTGTTTA 


2400 


TGACCGTCTG 


CTGGAGTTGG 


CTCTGAGTGA 


AGAAGAATTC 


CCACTTCTTC 


TGCAAGCACT 


2460 


CTGCTACCAG 


AGATTAATCG 


GGGCAGGAGG 


AATCGTTGAC 


TTTGCAACCA 


GACATTATCA 


2520 


AGAACACCAA 


GCAGCCAAGT 


GGAATGAGCA 


AATTGACCAG 


CTTCTTAAAG 


ATGGACATAT 


2580 


CACAACTCTT 


C AGGCTG AG A 


CGGAAAAAAT 


TAGCTACAGC 


TAAGCAAAAA 


AATATCATCA 


2640 


CCCTATTTAA 


CAATCTCTTT 


TCTAGCGGTT 


TTCATCTGGT 


GGAGACTATC 


TC CTTTTT AG 


2700 


ATAGGAGTGC 


TTTCTTGGAC 


AAGCAGTGTG 


TGACCCAGAT 


GCGTGTCCCC 


TTGTCTCAGC 


2760 


CGAAATCATT 


CTCAGAAATC 


ATGGAAACTT 


TGGGATGTTC 


AAGTGCTATT 


GTCACTCAGT 


2820 


TATCCCTAGC 


TGAAGTTCAT 


GGCAATCTCC 


ACCTGAGTTT 


GCGAAAGATA 


GAAGAATATC 


2880 


TGGACAATCT 


GGCTAAGCTC 


AAGAAAAAAT 


TGATTGAACT 


AGCGACCTAT 


CCCTTGATTT 


2940 


TGCTGGGTTT 


TCTTCTCTTA 


ATTATGCTGG 


GGCTACGGAA 


TTACCTGCTC 


CCACAACTGG 


3000 


ATAGTAGCAA 


TATTGCCACC 


CAAATTATCG 


GTAATCTCCC 


CCAAATTTTT 


CTAGGCATGC 


3060 


TACGGCTTGT 


TTCCCTGCTT 


GCCCTTTTAG 


CACTCACTTT 


TTATAAAAGA 


AGTTCTAAGA 


3120 


TGAGTGTCTT 


TTCTATCTTA 


GCACGCCTTC 


CCTTTATTGG 


AATCTTTGTG 


CAGACCTACT 


3180 


TGACAGCCTA 


TTATGCACGT 


GAATGGGGGA 


ATATOATTTC 


ACAGGGAAYG 


GAGTTGACGC 


324C 


AGATTTTTCA 


AATGATGCAG 


GAACAAGGTT 


CCCAGCTCTT 


TAAAGAAGTC 


GGTCAAGATC 


3300 


TGGCTCAAAC 


CCTGAAAAAT 


GGCCGTGAAT 


TTTCTCAGAC 


GATAGGAACC 


TATCCTTTCT 


3360 


TTAGGAAGGA 


ATTGACTCTC 


AT CAT AG ACT 


ATGGGGAAGT 


TAAGTCCAAG 


CTGGCTAGTG 


3420 


AGTTGGAAAT 


CTATGCTGAA 


AAAACTTGGG 


AACCCTTTTT 


TACCCGAGTC 


AACCGCACCA 


3480 
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TGAATTTGGT 


GCAGCCACTG 


GTTTTTATCT 


TTGTGCCACT 


n at*t A***c fyrr 


I 1 /\*» 1 1 i #* * 


3540 


CGCCAATCCT 


CATGCCCATG 


TATCAAAATA 


TGGAGGTAAA 


TTTTTAAAA*P 


flAAAA.AJfcvATG 


3600 


ATCACATTCT 


TGAAAAAAGC 


TAAGGTTAAA 


GCTTTTACAT 


luvi luunuA 1 




3660 


TTGCTGATTA 


TCAGCGTGCT 


TTTCTTGCTC 


TTTGTACCTA 


AiW luAL^AA 


UWvVWWUIIA 


3720 


GCAGTCAATG 


ACAAAGGAAA 


AGCAGCTGTT 


GTTAAGGTGG 


IwuAAAuLLA 




3780 


TATACCTTAG 


AAAAGAATGA 


AGATGCTAGC 


CTAAGAAAGT 


1 ALAAuLAuA 




3840 


ACGGAAGAAC 


AGGCTAAAGC 


TTATAAAGAA TACAATCATA" 


AAAATGGAGG 


AGCAAATU\jT 


J JUw 


AAAGTCAATG 


ATTAAGGCCT 


TTACCA1\#CT 




TTGGTTTTGG 




j y o w 


TATCCTTGCC 


TTGGGCTTGT 


CCGGCTCTviT 


LLAul LLAW 1 


TTTTCAGCGG 


TAGAGUAACA 




GATTTTCTTT 


ATGGAGTTTG 


AAGAALlVlA 




CAAAAACGCA 


GTGTAGCCAG 


4 uou 


TCAGCAAAAG 


ACTAGTCTGA 


ACTTWiATWi 




AGCAATGGCA 


GTCAAAAGTT 


4 14U 


GCCACTCCCT 


AAACGAATTC 






ATTACATTTG 


ACCGAGCTGG 


42UU 


GGGCAATTCG 


TCCCTGGCTA 






AAAGGAGCGA 


TTCGCTATCA 


4 1 OU 


ATTATATCTA 


GGAAATGGAA 






ACAAAAAATT 


AGGGCAGTGA 


4 J^U 


TTTTACTGGA 


ACCACTAGTC 


GCTCTAGCTA 


TCTTTGCCAG 


CATTGCGACC 


CTCCTTTTGC 




GACAAATTCA 


AAAAAATAGG 


CAAGAGGAAG 


CAAAAATCTT 


GCAAAAGGAA 


GAAGTCTTuA 


4440 


GGGTAGCTAA 


GATGGCCCTG 


CAGACGGGGC 


AAAATCAGGT 


AAGCATCAAC 


CGAtjilliAbA 


4 3UU 


TTCAGCTATT 


TTCTAGTGAA 


AAAGGATTGG 


AGGTCTACCA 


TGG I lCAGAA 


CAuHVliVAi 


H JOU 


CAATCAAAGA 


GCCATAAGGT 


CAAGGCTTTT 


ACCTTGTT AG 


AATCCCTGCT 


TGCCCTCATT 


4620 


GTCATCAGTG 


GGGCATTACT 


CCTTTTTCAA 


GCTATGAGTC 


AGtTCtTLAT 


ft & ft.^^*?^T 


4680 


CGCTACCAGC 


AACAAAGCGA 


GCAAAAGGAG 


TGGCTCTTGT 


I i N-j luuAwwn 


#\t 1 tUAVAl •> *% 


4740 


GAATTAGACC 


GTTCGCAGTT 


CGAAAAAGTA 


GAAGGCAATC 


CCCTATACAT 


GAAGCAAGAT 


4800 


GGCAAGGACA 


TCGCCATCGG 


TAAGTCAAAG 


TCAGATGATT 


TCCGTAAAAC 


GAATGCTCGT 


4860 


GGTCGAGGTT 


ATCAGCCTAT 


GGTTTATGGA 


CTCAAATCTG 


TACGGATTAC 


AGAGGACAAT 


4920 


CAACTGGTTC 


GCTTTCATTT 


CC AGTTCCAA 


AAAGGCTTAG 


AAACGGAGTT 


CATCTATCGT 


4980 


GTGGAAAAAG 


AAAAAAGTTA 


AGGCAGGTGT 


TCTCCTCTAC 


GCAGTCACCA 


TAGCAGCCAT 


5040 


CTTTACTCTT 


TTGTTGCAAT 


TTTATTTGAA 


CCGACAAGTC 


GCCCACTATC 


AAGACTATGC 


5100 


TTTGAATAAA 


GAAAAATTGG 


TTGCTTTTGC 


T ATGG CT AAA 


CCAACCAAAG 


ATAAGGTTGA 


5160 


GCAAGAAAGT 


GGGGAACAGT 


TTTTTAATCT 


AGGTCAGGTA 


AGCT AT C AAA 


ACAAGAAAAC 


5220 
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TGGCTTAGTG ACCAGGGTTC GTACGGATAA GAGCCAATAT GAGTTTCTGT TTCCTTCAGT 
CAAAATCAAA GAAGAGAAAA CAGATAAAAA GGAAGAGGTA GCGACCGATT CAAGCGAAAA 
AGTGGAGAAG AAAAAATCAG AAGAGAAGCC TGAAAAGAAA GAGAATTCAT AGTCAATTCA 
ACTATAATGC GTTGAATCCA GAATACTCCA CTGTACTTTC TAGAAAATTG CTGGAAATGG 
ATGTTAAGCT CCAATTCATT TGTTTATATC TTATTTCAGT TTACTATACT TTGTGCTAAA 
TTAAAGATAT GAAACATCAT TTTAACCACA AACCAGAAAC TTTCGATTCC CCTAAAAATA 
TCTTCCTCGC AAACTTGGTA TGTCAAGCAG CCGAGAAACA GATTGATCTT CTATCAGACA 
AAGAAATTTT AGATTTCGGT GGTGGCACGG GTCTATTAGC CTTGCCCCTA ACCCCTACCC 
AAGCAGGCTA AGTCAGTCAC TCTTGT AG AC ATTTCTGAGA AAATGTTGGA GCAAGCTCGT 
TTGAAACTGG AGCAGCAACC AATCAAGAAT ATCCAGTTTT TGGAGCAAGA TTTACCGAAA 
AATCCCTTGG AGAAAGAGTT TGATTGCCTT GCTGTTACTC GGGTTCTTCA TCATATGCCT 
GATTTGGATG CGGCTCTCTC ACT GTTTC AT CAACATTTGA AGGAAGATGG G AAACTC ATC 
ATTGCTGATT TTACCAAGAC AGAAGCTAAT CATCATCCAT TTGATTTACC TGAACTGGAA 
AACAAGCTAA TTGAGCATGG TTTTTCATCT CTGCATAGTC AGATTCTCTA TAGTCCTGAA 
GACCTGTTTC AAGGAAATCA CTCAGAATTC TTTTTAATAG TACCCCAAAA ATCACTCGCC 
TAGTCAGGGA GTGATTTTTC TATAAGGATG GAAAAAAGAA GGCAAATTTG CTAAGATAGG 
AATATGGATT TTGAAAAAAT TGAACAACCT TAT AC CT ATT TACT AG AG AA TGTCCAAGTC 
ATCCAAAGTG ATTTGGCGAC CAACTTTTAT GACCCCTTGG TGGAGCAAAA TAGCATCTAT 
CTGGATGGTG AAACTGAGCT AAACCAGCTC AAGGAGAACA ATCAAACCCT TAAGCGTTTA 
GCACTACGCA AAGAAGAATG GCTCAAGACC TACCAGTTTC TCTTGATGAA GGCTGGGCAA 
ACAGAACCCT TGCAGCCCAA TCACCAGTTT ACACCGGATG CTATTGCTTT GCTTTTGGTG 
TTTATTGTGG AAGAGTTGTT TAAAGAGGAG GAAATTACTA TCCTCGAAAT GGGTTCTGGG 
ATGGGAATTC TAGGCGCTAT TTTCTTGACC TCGCTTACTA AAAAGGTGCA TTACTTGGGA 
ATGGAAGTGG ATGATTTGCT GATTGATCTC GCAGCTAGCA TGGCAGATGT AATTGGTTTG 
CAGGCTGGCT TTGTCCAAGG AGATGCCGTT CCCCCACAAA TGCTCAAAGA AAGCGATGTG 
GTCATCAGTG ACTTGCCTGT CGGCTATTAT CCTGATGATG CCGTTGCGTC GCGCCATCAA 
GTTGCTTCTA GCCAAGAACA TACTTACGCC CATCACTTGC TCATCCAACA AGGCCTTAAG 
TACCTCAAGT CAGACGGATA CGCTATTTTT CTAGCTCCGA GTGATTTGTT GACCAGTCCT 
CAAAGTGATT TGTTAAAAGA AT GG C TG AAA GAAGAGGCGA GTCTGGTTGC TATGATTAGT 
CTGCCTGAAA ATCTCTTTGC TAATGCCAAA CAATCTAAGA CTATTTTTAT CTTACAGAAG 
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AAAAATGAAA 


TAGCACTAGA 


GCCTTTTCTT 


TATCCACTTG 


CTAGCTTGCA 


ACATGCAACT 


7080 


GTTTTAATGA 


AATTTAAAGA 


AAATTTTCAA 


AAATGGACTC 


AAGGTACTCA 


AATATAAAAT 


7X40 


AGATTTTGTT 


ATAATAGTTG 


AAAACGCTTA 


AAAAGGGGTA 


TCATGTTATG 


ACAAAAACAA 


7200 


TTGCAATCAA 


TGCAGGAAGT 


TCAAGTTTGA 


AATGGCAATT 


ATACTTAATG 


CCAGAAGAAA 


7260 


AACTATTGGC 


GAAAGGTTTG 


ATTGAACCTA 


TCGGTTTGAA 


AGATTCAATT 


TCAACTGTAA 


7320 


AATTTGACGG 


CCCTTCTGAA 


CAACAAATTT 


TGGATATTGA 


AAATCATATA 


CAAGCCGTTA 


7380 


AAATTTTATT 


GGATGACTTG 


ATTCGTTTCG 


ATATTATCAA 


GGCTTATGAC 


GAGATTACAG 


7440 


GTGTTGGACA 


TCGTGTTGTT 


GCTGGTGGAG 


AATATTTCAA 


AGAATCAACA 


CTTGTTGAGG 


7500 


GAGATCTTTT 


AGAAAAAGTT 


GAAGAGTTGA 


GTTTGTTCGC 


TCCTCTACAC 


AACCCGGCCA 


7S60 


ATGCAGCAGG 


TGTTCGTGCC 


TTCAAGGAAT 


TGTTGCCAGA 


CATTACCAGT 


GTAGTTGTTT 


7620 


TTGATACTTC 


CTTCCACACA 


ACTATGCCAG 


AGAAAGCTTA 


TCGCTACCCT 


CTACCAACAA 


7680 


AATATTACAC 


AGAAAACAAG 


GTTCGTAAAT 


ACGGTCCTCA 


TCGTACAAGT 


CACCAGTTTC 


7740 


TAGCAGGAGA 


AGCTGCAAAA 


CTCTTGGGAC 


GTCCATTAGA 


AGACTTGAAC 


TTAATTACCT 


7800 


GTCATATTGG 


TAACGGAGGC 


TC AATT AC AG 


CTCTGAAAGC 


CGGCAAATCT 


GTAGACACTT 


7B60 


CTATGCGGTT 


CACTCCTCTT 


GCTGCTATTA 


TGATGGGAAC 


GCGTACAGGG 


GATATTGATC 


7920 


CAGCTATCAT 


TCCTTATTTA 


ATGCAATATA 


CAGAGGATTT 


TAACACACCA 


GAAGATATCA 


7980 


CTCGTCTTCT 


TAACCGTCAA 


TCAGGTCTTT 


TGGGAGTTTC 


TGCTAATTCT 


AGCCATATGC 


8040 


GCGATATAGA 


AGCACCTGTA 


GCAGAAGGGA 


ATCACCAGGC 


TAGCTTGGCT 


TATGAAATCT 


8100 


ATGTTGACCG 


TATCCAAAAA 


CATATCGCTC 


ACTACCTTGC 


AGTGCTAAAT 


GGAGCAGATG 


8160 


CCATTGTTTT 


CACAGCAGGT 


GTCGGTGAAA 


ATGCACACAG 


TTTCCGTCCT 


CATGTAATCT 


8220 


CACCGATTTC 


GT GGTTTGGT 


TGTGATGTTG 


ATGATGAAAA 


GAATGTCTTT 


GGCGTTACAG 


8280 


CACACATCTC 


AACACAGGCA 


GCTAAAATCC 


GTGTCTTGGT 


TATTCCAACA 


CATCAAGAAT 


8340 


TAGTCATTGC 


CCCTGACGTT 


GAACGCTTGA 


AAAAATAAGT 


GAAACTAAAA 


AAATATTCAA 


8400 


TACAAGGAGT 


TGGGAAAGTT 


ATTTTTCCAG 


CTTCTTTTTC 


TGATGAAATT 


GTCCAAAACC 


8460 


TTCCTATGAT 


TGGCmTIT 


GAAAAATATG 


GTATAATAGT 


AGTAATTTAA 


TAGATGGAGT 


8S20 


TGAGTTTTGA 


AGAAAAACTT 


TCGTGTAAAA 


AGAGACAAAG 


ATTTTAAGGC 


GATTTTCAAG 


8580 


GAGGGGACAA 


GTTTTGCTAA 


TCGCAAATTT 


GTGGTCTACC 


AATTAGAAAA 


C C AG AAAAAC 


6640 


CCTTTTCGAG 


TACGTCTATC 


AGTTAGCAAA 


AAACTGCGGA 


ATGCCGTCAC 


TAGAAATCAA 


6700 


ATTAAGCGAC 


GGATTCGGCA 


TATTATCCAC 


AATGCAAAAG 


GGAGTCTGGT 


AGAAGATCTC 


8760 
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CACTTTGTTC TCATTGCTCG AAAAGCAGTC GAAACCTTGG GAWVGC.7ACA GATGGAGAAA 
AATCTACTCC ATGTATTAAA ATTATCAAAG ATTTACCGGG AAGGAAATGG GAGTGAAAAA 
GAAACTAAAG TTGACTAGTT TGCTAGGACT GTCTCTGTTA ATCATGACAG CCTGTGCGAC 
TAATGGGGTA ACTAGCGATA TTACAGCCGA ATCGGCTGAT TTTTGGAGTA AATTGGTTTA 
CTTCTTTGCG GAAATCATTC CCTTTTTATC CTTTGATATT AGTATCCGAG TGGCGATTAT 
TCTCTTTACG GTCTTGATTC GTACAGTCCT CTTGCCAGTC TTTCAGGTCC AAATGGTGGC 
TTCTAGGAAA ATGCAGGAAG CTCAGCCACG CATTAAGGtG CTTCCAGAAC AATATCCAGG 
TCGAGATATG GAAAGCAGAA CCAAACTAGA GCAGGAAATG CGTAAAGTAT TTAAAGAAAT 
GGGTGTCAGA CAGTCAGACT CTCTTTGGCC CATTTTGATT CAGATGCCGG TTATTTTGGC 
CCTGTTCCAA GCCCTATCAA GAGTTGACTT TTTAAAGACA GGTCATTTCT TATGCATTAA 
CCTTGCTAGT GTGGATACAA CCCTTGTTCT TCCGATTTTA GCAGCAGTAT TCACCTTTTT 
AAGTACTTGG TTGTCCAACA AAGCTTTGTC TGAGCGAAAT GGCGCTACGA CTGCGATGAT 
CTATGGGATT CCAGTCTTGA TTTTTATCTT TGCAGTTTAT CCCCCACGTC CAGTCGCCCT 
ATACTGGACA GTGTCTAATG CTTATCAAGT CTTGCAAACC TATTTCTTCA ATAATCCATT 
CAACATTATC GCAGAGCGCG AGGCCGTACT ACAGGCACAA AAAGATTTGG AAAATAGAAA 
AAGAAAAGCC AACAAAAAGG CTCAGAAAAC GAAATAAATA AGGAGGAATC TCCTACTCGT 
AGTATTTACA GGTTCAACTG TTGAAGAAGC AATCCAGAAA GGATTGAAAG AATTAGATAT 
TCCAAGAATG AAGGCTCATA TCAAAGTCAT TTCTACCCAG AAAAAAGCCT TTCTTGGTCT 
ATTTGGTAAA AAACCAGCCC AACTGGATAT TCAAGCGATT AGTGAAACGA CTGTTGTCAA 
AGCAAATCAA CAGGTAGTAA AAGGCGTTCC GAAAAAAATC AATGATTTGA ACGACCCTGT 
GAAGACGCTT AGTCAAGAAA CCGTTGACCT TGGTCATCTG GTTGATGCTA TTAAAAAAAT 
AGAGGAAGAA GGTCAAGGTA TTTCTGATGA AGTCAAGGCT GAAATCTTAA AACATGAAAG 
ACATGCCAGC ACTATCTTAG AACAAACTGG TCACATTGAG ATTTTAAATG AACTTCAAAT 
CGAGGAAGCG ATGAGGGAAG AAGC AGGCG C TGATGACCTT GAAACTGAGC AAGACCAACC 
TCAAAGTCAA GAACTAGAAG ACTTCGGCTT GAAAGTTGAA ACGAACTTTG ATATTGAACA 
AGTAGCTACG GAAGTAATGG CTTATGTTCA AACGATT ATT CATGACATGG ATCTTCAGGC 
TACACTTTCA AATGATTATA ACCGTCGTAG CATCAATCTA CAAATTGACA CCAACGAACC 
AGGTCCTATT ATCGGCTACC ATGGTAAACT CTTCAAGGCC TTGCAACTGT TGGCTCAAAA 
TTATCTTTAC AACCGCTATT CCAGAACCTT CTACGTTACA ATCAATGTCA ATGATTATGT 
CGAACACCGT GCAGAACTCT TGCAGACCTA TGCGCAAAAA TTGGCGACTC GTGTTTTGGA 
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ACTCATAAAA 


LAuA & >.Lrtrt 1 


CTCAAATAGT 


f.AACCCAACA 


TTATCCATCG 


10620 


TATTATTTCA 


CGTATGGATG 


GCvjTwAI. 1 Au 


T*P A. f—T*f*Tfl A. & 
I 1 Al» i v. 1 


wVi 1 1 UAUW 


CAAATCGCTA 


10680 


TGTTGTTGT A 


GATACAGAAT 


m fc /""f » ft » A TP 

AACiTAAAATL 


nb<j H In l^t 


1 \M% t H 1 1 1U 


CTAGTTAGAG 


10740 


GAGCTTAAAC 


TGATGTTGAA 


m< k/>tlpft ft P ft 

TAAGATAAGA 




1 1 i Am. 1 wu 




10800 


CGTAATCCTG 


ATAAAGCGGG 


AGCAUAGCuA 


IjAbAnun Iu(. 




rCACAAAGCA 


10860 


CAAGAGGCCC 


GAAAGGTTTT 


TACAGAACTG 


UCCAAAGCL 1 


•PtV & A ft ft/** 




10920 


TGCCAACTCC 


AACAGACTAG 


CCAGTGGATG 


AATCAGGCCC 


*> f*f*C "TTTT* Af* 


nULALA 1111 


10980 


TGGGTTTATC 


TACAGAGAGA 


CGGACAAGTG 


ACAGAACCTA 


TGATGGCL 1 i 




1 1 Cid fi 
X X\JH \J 


GGGACATCTA 


CTGACTTTGG 


AATTTC TTTG 


GAAGTCAGTT 


TCATCGAACG 


TAnunnuuAT 


111 OA 


GAGCAAACAC 


TGGGCAAGCA 


GGCCAAACTT 


TTAGACATTC 


& ft A/^/^WM ft ft 

CAACCGTTAA 


AGGGAH l AT 


1 1. lOU 


TATCTAACCT 


ACTCTAATGG 


TCAAAGTCAA 


CGGTGGGAGG 


CGAATGAAGA 


AAAGCGTCGT 




ACTTTACGCG 


AGAAGGTGAG 


AAGTCAAGAA 


GTTCGAAAAG 


jMMMh ft j-fcjMjT- ft ft 

TTTTAGTGAA 


GGTAGATGTT 


11280 


CCTATGACAG 


AAAATTCCTC 


TGAAGAAGAA 


ATCGTAGAAG 


GCTTATTGAA 


GTCTTATTCT 


1 1 T A f\ 


AAAATTCTTC 


CCTATTATCT 


AG CT ACG AGA 


AAATAAGATA 


ATTTGTAAAA 


\_/\ 1V.A i AAA I 


111UU 


CATACACTCC 


AAGAGTGAAC 


AGTCCGCTCT 


GTAATTCTTC 


CTCTTTTTGT 


TTGCGCTTTC 


1 1 4 60 


GCATTATATA 


ATAAACTTAC 


AAAAACAATT 


CAAAAGGAGA 


ft ^ ft ft (f«*ft ft ft*^^ 

ACAATTATGC 


AAG*rCGTTTO 


i i con 


AAGTGTTCTA 


AATTGGTTTT 


CTAGCAATAT 


TTTGCAGAAT 


CCCGCATTTT 


TCGTAGGTTT 




ATTGGTGTTG 


ATAGGATATG 


ft MlMlf|lf|V* ft ft 

CACTTTTGAA 


AAAACCTGCC 


CATGACCTTT 


TTTCAGGGTT 


1 1 £4 fl 
X 1 04 U 


TCTTAAAGCA 


ACAGTAGGGT 


ATATGTTGCT 


TAACGTGuoT 


GCTGGTGGTT 


^ Liu I i 


X X 'UU 


CTTTCGTCCA 


ATCTTAGCAG 


CTCTTAACTA 








11760 


CCCTTACTTT 


GGACTTGCTG 


CAGUAAAWAA 


tAAAATTtilA 


fir* a. n h. rTrr*™ 


CAGATTTTGT 


11820 


TCGAACTGCA 


ACTACAGCTC 


TATTGATTGG 


TTTTGGAATA 


AATATCTTGC 


TCGTAGCTCT 


11880 


TCGAAACATT 


ACGAAGGTAA 


GAACCCTCTT 


TATTACTGGT 


CACATCATGG 


TACAACAAGC 


11940 


TGCAACAGTA 


TCTCTTATGG 


TTCTATTCTT 


AGTACCACAA 


TTGCGCAATG 


CTTACGGTAC 


12000 


AGCAGCGATT 


GGTATCATCT 


GTGGACTTTA 


CTGGGCAGTT 


AGTTCAAATA 


TGACTGTTGA 


12060 


GGCAACTCAA 


CGCTTGACTG 


GTGGTGGCGG 


ATTTCCCATT 


GGTCACCAAC 


AGCAATTTGC 


12120 


AATCTGGTTT 


GTAGATAAAG 


TAGCAGGACG 


CTTTGCTAAG 


AAAGAAGAAA 


CTTTAGACAA 


12180 


TCTTAAATTA 


CCTAAGTTCC 


TCTCAATCTT 


CCACGATACA 


GTTGTTGCAT 


CTGCTACCTT 


12240 


CATGCTCGTA 


TTCTTCGGAG 


CCATTCTTTT 


AATCTTGGGT 


CCAGACATTA 


TCTCTAATAA 


123O0 
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AGAAGTCATC 
TATCCAAACA 
GTTCGTATCT 
ATTCCCAGCG 
AGGATTTACC 
AAATCCGATT 
GGTCTACGCT 
TGTCCTTCAA 
TGGCTACCAT 
ATACCTTGCT 
TCAATTTGCC 
TTAGTATCTA 
TGGGTTCATC 
CAGATTTTAC 
ACATCCTAAT 
TAATTGGGCT 
TACAGTAAAA 
CCTCTTTAAA 
ATCCGACTAG 
TTAATTGAAA 
GAGTATGGGC 
GCAGCTCTGC 
GATGGGAAAG 
AGTGTACCCA 
CAGGCTTGCC 
TATCTCGAAG 
ACCTAATTTA 
TTCTGTTGGT 
AAGTGAACTG 
AAAATGTGCT 



ACTTCAGGAA 
GCCTTTACCT 
GAGTTGACAA 
GTTGACGTTG 
TTTGGTTTGA 
CTTATTATTA 
CATAAACGCG 
GTTGCTCTAG 
GGAAATATCG 
ATTGTTGCTT 
AAACCAAAAG 
GAAAAGGAGA 
AATGCTTATC 
ACTCAATTCA 
CGCTTCTCTT 
TGATAACTTG 
GCTTGGAGGG 
TAAAGGAGGC 
GTTTAGAGGC 
GTGGGGCAAT 
CTTACTATAT 
AAAGTGATGC 
AGGTATCTGT 
TTCCACAAAT 
AGACTAAAGA 
GATTGGATTT 
CAAGTTGCAT 
CAGGAACTAG 
GCTGAACTCT 
GATGCTGGTG 



CTCTATTCAA 
TCTCACTTTA 
ACGCCTTCCA 
CAGCTTCTTA 
TTGGTCAATT 
CACGATTTCT 
GCGGATGGAA 
GAGCTCTTTG 
ACTTTGAATT 
ATGTACTTGT 
ATAAAGAGAA 
AATAAAATGG 
AAGATGAAGG 
TGCAGTGTCG 
CATTTGATTC 
ATGGATGATA 
GGCTGGACAG 
AGATATGAAT 
TAACAATTCG 
TTTGCCAGAC 
CTTGATGCCA 
CTTTTCATTG 
TTTGTTGGCA 
TATTGCCCTA 
AGATGTCTTC 
GGAAACTTAG 
TAGACCATTC 
ATATTATCGA 
TGCGTAGCCT 
GAACACTTGC 



382 
TCCTGCTAAA 



CTTGTTCCTT 
AGGTATTTCA 
TGGATTTGCT 
GATTACAATT 
ACCAGTGTTC 
ACCCCCTGTT 
TGTGGCCCTT 
CCCATGGCTT 
GTGTCTCTTC 
ATATTACAAC 
TTAAAGTATT 
TTGAAAATGC 
GTGAAGCTAA 
AAGAATTGGA 
AAGAAATCAC 
AAACTCAGAG 
TTAAAACAAG 
AAAGAAGCAG 
TATTACGATG 
GGTATGGCTA 
ATTACCTTAC 
CTAGCAGCAA 
TTTGAATTAG 
GCTATGATTG 
AAAGAGGAAT 
AGACTTGCAA 
AGCTGGAACT 
TTTCCCAGAT 
TAAAAATAAT 



CAAGATTTCT 
TTCATCCAAG 
AACAAATTGT 
TCTCCAAATG 
GTTTTGCTCA 
TTTGACAATG 
ATCCTTTCCT 
CTCGATTTGG 
GGATTTGGAT 
TTGCTTGTTA 
GGTGAACTTC 
AGCAGCGTGC 
TCTCCGTAAG 
AGGTTTAGCA 
AGGGCGAACT 
CCAAAAACTC 
TTATCGTTTC 
CTTTAATTGA 
TCAAGGTAGC 
CTATCATTGA 
TGCCCCACGC 
AAAATCCTGT 
CAAGTTCAAA 
AAGATTCTAT 
AAGAATCTAA 
AAAGAAATGA 
GGAGCGATTA 
GTTTGCTTGC 
AAGATTATTG 
GCGGTTCGTG 



TTATGTACAT 
GTGTCCGAAT 
TGCCAGGTTC 
CTGTCTTGTC 
TCGTCTTTAA 
CAGCCATTGC 
TTATATCAGG 
CATCTTATGG 
ATATCTTCAA 
TTCCTCAACT 
AAGAAGAAGC 
GGAAATGGAA 
CTTAATCAAA 
GTAGGATATG 
AATGGGAAGT 
AGTCAAGCAC 
TGTCCTTCTC 
CAATGACTCC 
AGTAGATCCC 
ATCGACTGAA 
TAGACCTGAA 
TGTATTTTCA 
AATTCACACA 
TGCACGTTTA 
GGATAGCCCT 
CAAAAAGAAT 
AAGCAGCTGT 
TTCAAGTTGG 
TGGCAGACAC 
GAGCAGACTG 



12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 
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CATGACTTGT 


ATCTGTTGTG 


CAACCATCCC 


TACTATGGAA 


GCAGCTCTAA 


AGGCTATCAA 


14160 


GACTGAACGA 


GGAGAACGAG 


GCGAAATCCA 


GATCGAGCTT 


TATGGCGATT 


GGACTTTTGA 


14220 


ACAAGCTCAG 


CTTTGGCTAG 


ATGCAGGTAT 


CTCACAAGCT 


ATTTATCACC 


AATCTCGTGA 


14280 


TGCTCTTCTT 


GCTGGTGAAA 


CTTGGGGTGA 


AAAAGACCTT 


AATAAGGTTA 


AAAAACTCAT 


14340 


TGACATGGGC 


TTCCCTCTAT 


CTGTAACAGG 


TGGTCTAGAT 


GTAGATACTC 


TCAAACTCTT 


14400 


TGAAGGTATT 


GATGTCTTTA 


CCTTTATCGC 


AGGTCGTGCA 


ATTACAGAGG 


CTGTGGATCC 


14460 


A rip ACCAfi C A 


GCGCGTGCCT 


TCAAGGATGA 


AATCAAACGA 


ATTTGGGGGT 


AAATCATGGT 


14520 


ArRTTCAATP 


GGAATTTATG 


AAAAGGCAAC 


CCCAACACAC 


TGTACTTGGC 


TAGAACGTTT 


14580 


/vvH 1 1 1 lVA>Vi 


AAGCAGTTAG 


GCTTTGATTT 


TCTCGACATG 


TCTATTGACG 


aacctgacga 


14640 


r* fWTT & Gf* A 


AGACTTGACT 


GGAGTAAGGA 


AGAACGCTTG 


GAAGTTGTCA 


AAGCAATCTA 


14700 




GTTCGTATTC 


CTTCTATCTG 


TTTTTCAGGC 


CATCGTCCCT 


ACCCATTGGG 


14760 


«twi*^» & k & »/* » m 




AGGAAAAATC 


TCTAGAACTC 


ATGAAAAAAT 


GTATCGAATT 


14820 




TTCCGAGTTC 


GTACGATTCA 


ATTAGCTGGT 


TACCATGTTT 


ACTATGACGA 


14880 


& &. anfPArTT* 

AaAu 1 v. \_ 


C AG AC ACG CC 


AACGTTTTAT 


CAAAAATTTG 


AGAAAAGCCT 


GTGACTGGGC 


14940 




CAGCTCGTAC 


TTGCTATTGA 


AATTATCGAT 


GATCCTTTCA 


TCAGTACCAT 


15000 




TTGGCTATAG 


AAAAAGAGAT 


TGACTCTCCC 


TTCCTCTTTG 


TATATCCAGA 


15060 




GTGTCTGCAT 


GGCATAATGA 


TATCTATAGT 


GAGTTTTATC 


TTGGTCATCA 


15120 




G CTCTCC ATC 


TCAAGGATAC 


TTATGCAGTG 


ACAGAAACTT 


CAAACGGCCA 


15180 


RTTCCCAGAT 


GTACCTTTCC 


GGCAAGGTTG 


TGTCAAATCG 


GAAGAACCTT 


TCGATATTTT 


15240 


AAAGGAAACC 


AATTATAATG 


GACCTTTCCT 


AATCGAAATG 


TGCTCTGAAA 


ATTCTGAAAC 


15300 


AGTAGAAGAA 


ACACGCGCAG 


CCATTCAACA 


GGCGCAAGCT 


TTTCTCTATC 


CACTCATTAA 


15360 


GAAAGCAGGT 


TTGATCTAAG 


ATGAATCAAG 


TAATCAATCC 


TATGCGTAAA 


CGACTCTGTG 


15420 


ATGCCAATCA 


ATCATTGCCA 


AAACATGGAC 


TTGTCAAATT 


TACCTGGGGG 


AATGTATCTG 


15480 


AAGTTAATCG 


CGAACTCGGT 


GTCATTGTTA 


TCAAACCATC 


AGGCGTGGAT 


TATGACGAAT 


15540 


TGACACCTCA 


AAACATGGTA 


GTGACTGATC 


T AG ATGGT AA 


GATCCTAGAA 


GGGGATTTAA 


15600 


GACCATCTTC 


CGACCTCCCA 


ACTCATGTGC 


AATTATATAA 


GACTTGGTCA 


GAAATTGGTA 


15660 


GTGTGGTTCA 


CACCCATTCG 


ACAGAAGCTG 


TTCGTTGGGC TCAGGCAGGT 


CGTGATATTC 


15720 


CTTTCTACGG 


AAC AACC CAT 


G CAGATT ATT 


TCTACGGTTC 


AATCCCTTGC 


CCCCCTAGTT 


15780 


TGACCAAGGA 


CGAAGTAGAA GTGGCCTATG 


AAAAAGATAC 


TGGCCTGGTT 


ATCGTAGAAG 


15840 
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AGTTTCAACA 


TCGCGGACTT 


AACCCGGTTG 


AACTACCAGG 


AATTGTTGTA 


CGCAATCACG 


15900 


GTCCATTCAC 


CTGGGGCAAA 


AATCCAGAGA 


ATGCTGTTTA 


TCACTCTCTC 


GTACTAGAGG 


15960 


AAGT AT C AAA 


GATGAATCGC 


TTTACAGAAC 


AAATCAATCC 


AAGAGTTGGA 


CCTGCTCCCC 


16020 


ACT ACAT ACT 


AGAAAAACAC 


TACCAACGTA 


AACATGGACC 


AAATGCTTAT 


TATGGTCAAA 


16080 


AGTAAGAACG 


ATGAAGGAGG 


AGAAAAAGAT 


AAATTTAGCT 


CCTCTTTTTA 


CATTTGATTT 


16140 


TTATTGAGAG 


TAAACTTGGA 


GTTGAAGTAA 


TTTTAAAAGA 


TTTTTTAGAA 


ATAGCGCTTG 


16200 


ATATATATAT 


GGTAAAATAA 


AAAGAATTGC 


TGTGATATCA 


ATAGATTTGC 


GGGATTTTTT 


16260 


AATATGGTAC 


TGGATAAGGC 


AAGTTGTGAT 


TTGCTTCAAT 


ATTTG ATGG A 


TCAAGAAACG 


16320 


TCCAAAACGA 


TTATGGCGAT 


TTCGAAAGAT 


TTGAAAGAGT 


CAAGAAGGAA 


AATTTATTAT 


16380 


CACATTGACA 


AAATCAATGC 


TGCTCTGGGT 


GACGAGGCGC 


TT CACATC AT 


TAGTATTCCA 


16440 


CGAATTGGTA 


TTCACTTAAC 


GGAAGAGCAG 


AGAGATGCTT 


GTTGTAAACT 


ATTATCGGAA 


16500 


GTAGATTCGT 


ACGATTATAT 


CATGAGTGCG 


CATGAACGTA 


TGATGATAAT 


GTT ACT ATGG 


16560 


ATAGGTATTT 


CTAAAGAACG 


TATTACGATT 


GAAAAATTGA 


TAGAGTTAAC 


AGAGGTATCT 


16620 


AGGAATACTG 


TTCTCAATGA 


TTTGAATAGT 


ATTCGTTATC 


AACTAACTTT 


GGAACAATAT 


16680 


CAGCTGATCT 


TGCAACTGAG 


CAAGTCACAG 


GGATACAACC 


TTCATGCCCA 


CCCTCTTAAT 


16740 


AAAATTCACT 


ATCTTCAATC 


G CTTCT AT AT 


CATATTTTTA 


TGGAAGAAAA 


TGCCACTTTT 


16800 


GTATCTATTT 


TAGAAGATAA 


GATGAAAGAG 


AGCTTAGATG 


ATCACTGTTT 


GCTTTCTUTT 


16860 


GAAATCAACC 


AATTTTTTAA 


GGAACAGGTT 


CCTTTAGTTG 


AACAAGATTT 


AGGGAAGAAA 


16920 


ATAAACCATC 


ATCAAATAAC 


TTTTATCTTG 


CAGGTTCTAC 


CTTATTTCCT 


GTTAAGCTGT 


16980 


CATAATGTTG 


AACACTATCA 


AGAAAGACAT 


CAGGATATAC 


AGAAACAATT 


TTCTTTGATA 


17040 


AGAAAAAGAA 


TAGAGTATCA 


GCTGTCTAAG 


AAATTAGGAG 


AACGGTTCTT 


TCAAAAGTTT 


17100 


GAAATTTCTT 


TGTCAGGACT 


TGAAGTTTCT 


CTTGTAGCTG 


TTCTCCTCCT 


CTCCTATCGT 


17160 


AAAGATTTGG 


ATATTCATCC 


ACAAAGTGAT 


CATTTTCGGC 


AATTAAAACT 


TCCTTTAGViA 


17220 


GAATTTATCT 


GGTATTTTGA 


ATCACAAATC 


CGAATGGAGA 


TTGAGAACAA 


GGATGATTTC 


L7280 


TTACGAAATT 


TGATGATCCA 


CTGTAAAGCC 


TTGTTATTTA 


GAAAGACTTA 


CGGTATTTTT 


17340 


TCTAAAAATC 


CTCTAACAAA 


ACAAATTCGA 


TCCAAGTATG 


GAGAATTATT 


TTTAGTCACT 


17400 


AGAAAATCTG 


CGGAAATTTT 


AGAAGGAGCA 


TGGTTTATTC 


GGCTAACAGA 


CGATGATATT 


17460 


GCCTATTTGA 


CGATTCATAT 


TGGAGGATTT 


TTAAAATATA 


CACCATCATC 


TCAAAAAAAT 


17520 


ATGAAAAAAG 


TTTATCTCGT 


TTGTGATGAA 


GGTGTTGCGG 


TTTCGAGACT 


TTTGCTGAAA 


17580 


CAATGCAAAC 


TTTATTTTCC 


AAATGAGCAA 


ATTGACACTG 


TATTTACAAC 


AGAACAATTT 


17640 
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AAG ACT CTGG AAGATATTGC ACAAGTTGAT GTAGTGATTA CTACTAATGA TGATTTGGAT 17700 

AGCACATTTC CGATTTTAAG GGTTAATCCT ATCCTTGAAC CAGAAGATAT TTTCAAAATG 17760 

CTAGACTATC TTAAACACAA TATATTTCGT AATAAGACCA AAAGTTTCAG TGAAAATCTT 17820 

TCTACTCTTA TTTCGTCTTA TATTGTAGAC AGCAAGTTGG CTAGTAAGTT CCAAGAAGAG 17880 

GTTCAAACAC TTATAAATCA AGAAATAGTA GTTCAAGCTT TTTTCGAAGr TATTTGAAGG 17 940 

ACAGTCCAAT GATGAACACA AACCTGTGT k TTTCsTGGTC TTTTtTACTG TTTTGAAGGG 18O00 

TGGkATACTA ATCTCAAAGA TAACAATTAT ATCCAAAGGA GGCAACATAT GCCAAACGTC 18060 

AAAGAAATTA CAAGAGAGTC ATGGATTTTA GCCACTTTCC CAGAGTGQCG AACATGGTTG 18120 

AACGAAGAAA TCGAAGAAGA AGTCCTACCT CAACGCAACT TTGCCATCTG GTGGCTAGGC 18180 

AACTGTCGTA CTTGGATTAA GACACCAGCT GGTGCTAACG TTGTCATGGA CCTTTGGTCA 18240 

AACCGTCGAA AATCAACCAA AAAAGTCAAA GATATGGTTC GTGGGCACCA AATGGCAAAT 18300 

ATGCCAGGTG TTCGTAAGCT GCAACCAAAC TTGCGTGTTC AGCCAATGGT TATCGATCCA 18360 

TTTCCTATCA ACGAACTAGA CTATTACTTA GTTTCACACT TCCACAGTGA TCATATCCAC 18420 

CCATACACAG CTGCAGCAAT TCTCAATAAT CCTAAGTTAC AGCATCTTAA GTTGG 18475 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7186 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNSSS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CCAGGATTTG GTACCGTTGC AAGTGGTGTG CCTTTCCTCC TAAAGGAAAA TGGAGCAAAA 60 

ATCAATCAAT CACCACATTC ACATATCAAA GTTGCTAAGG TATTGGTCAA GGATGAAGAT 120 

GAAAAAAATC GCTTGCTTGC AGCAGGGAAT GACTTTAACT TTGTAACCAA TGTGGATGAT 180 

ATTTTATCAG ACCAGCATAT TACTATCGTA GTCGAATTGA TGGGGCGTAT TGAGCCTGCT 240 

AAAACCTTTA TCACTCGTGC C T T G G AACCT GGAAAACACG TTGTTACTGC TAACAAGGAC 300 

CTTTTAGCTG TCCATGGCGC AGAATTGCTA GAAATCGCTC AAGCTAACAA CCTAGCACTT 3 60 

TACTACGAAG CAGCAGTTGC TGGTGGGATT CCAATTCTTC GTACTTTAGC AAATTCCTTG 420 

GCTTCTGATA AAATTACGCG CGTGCTTGGA GTAGTCAACG GAACTTCCAA CTTCATGGTG 4 80 

ACCAAGATGG TGGAAGAAGG CTGGTCTTAC GATGATGCTC TTGCGGAACC AC AACCTCTA 540 
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GCATTTCCAG 


AAAGCGATCC 


GACGAATGAC 


CTAGATGGGA 


TTGATGCAGC 


CTACAAGATG 


600 


CTTATTTTCA 


CCCAATTTGC 


CTTTGGCATG 


AAGATTGCCT 


TTGATGATGT 


AGCCCACAAG 


660 


GGAATCCGCA 


ATATCACACC 


AGAAGACGTA 


GCTGTAGCTC 


AAGAGCTTGC 


TTACGTAGTG 


720 


AAATTGGTTG 


GTTCTATTGA 


GGAAACTTCT 


TCAGGTATTG 


CTGCAGAAGT 


GACTCCAACC 


780 


TTCCTACCTA 


AAGCGCACCC 


ACTTGCTAGT 


GTGAATGGCG TJOTTGAACCC 


TGTCTTTGTA 


840 


GAATCTATCG 


CTATTGGTGA 


GTCTATGTAC 


TACGGACCAG 


GTGCGGGTCA 


AAAACCAACT 


900 


GCAACAAGTG 


TTCTAGCTGA 


TATTGTCCGT 


ATCGTTCGTC 


GTTTGAATGA 


TCGT ACT ATT 


960 


GGCAAAGACT 


TCAACGAATA 


TAGCCCTGAC 


TTGGTCTTGG 


CAAATCCTGA 


AGATGTCAAA 


1020 


GCAAACTACT 


ATTTCTCAAT 


CTTGGCTCTA 


GACTCAAAAG 


GTCAGCTCTT 


GAAGTTGGCT 


1080 


GAAATCTTCA 


ATGCTCAAGA 


TATTTCCTTT 


AAGCAAATCC 


TTCAAGATGG 


CAAAGAGGGT 


1140 


GACAAGGCGC 


CTCTCCTTAT 


CATCACACAC 


AAGATTAATA 


AAGCCCAGCT 


TGAAAATGTC 


1200 


TCAGCTGAAT 


TGAAGAAGGT 


TTCAGAATTC 


GACCTCTTGA 


ATACCTTCAA 


GGTGCTAGGA 


1260 


GAATAAGATG 


AAGATTATTG 


TACCTGCAAC 


CACTGCCAAT 


ATCGGGCCAG 


GTTTTGACTC 


1320 


GGTCGGTGTA 


GCTGTAACCA 


AGTATCTTCA 


AATTGAGGTC 


TGCGAAGAAC 


GAGATGAGTG 


1380 


GCTGATTGAA 


CACCAGATTG 


GCAAATGGAT 


TCCACATGAC 


CAGCGTAATC 


TCTTGCTCAA 


1440 


AATCGCTTTG 


CAAATTCTAC 


CACACTTGCA 


ACCAAGACGC 


TTGAAAATGA 


CCAGTGATGT 


1500 


CCCTTTGCCG 


CGCGGTTTGG 


GTTCTTCCAC 


CTCGGTTATC 


GTTGCTGGGA 


TTGAACTAGC 


1560 


CAACCAACTG 


GGTCAACTCA 


ACTTATCAGA 


CCATGAAAAA 


TTGCAGTTAG 


CGACCAAGAT 


1620 


TGAAGGGCAT 


CCTGACAATG 


TCGCTCCAGC 


CATTTATGGT 


AATCTCGTTA 


TTCCAACTTC 


1680 


TGTTGAAGGG 


CAAGTCTCTG 


CTATCCTACC 


ACACTTTCCA 


GAGTGTGATT 


TTCTAGCTTA 


1740 


CATTCCAAAC 


TATGAATTAC 


GTACTCGCGA 


CAGCCGTACT 


GTCTTGCCTA 


AAAAATTGTC 


1800 


TTATAAGGAA 


GCTGTTGCTG 


CAACTTCTAT 


CGCCAATGTA 


GCGGTTGCTG 


CCTTGTTGCC 


1860 


AGGAGACATG 


GTGACCGCTG 


GGCAAGCAAT 


CGACGGAGAC CTCTTCCATG 


AGCGCTATCG 


1920 


TCAGGACTTG 


GTAAGAGAAT 


TTGCGATGAT 


TAAGCAAGTG 


ACCAAAGAAA 


ATGCGGCCTA 


1980 


TGCAACCTAC 


CTTTCTGGTG 


CTGGGCCGAC 


AGTTATGGTT 


CTGGCTTCTC 


ATGACAAGAT 


2040 


GCCAACAATT 


AAGG C AGAAT 


TGGAAAAGCA 


ACCTTTCAAA 


GGAAAACTGC 


ATGACTTGAG 


2100 


AGTTGATACC 


CAAGGTGTCC 


GTGTAGAAGC 


AAAATAAAGA 


ATAGAAGATA 


GGATGGGGAA 


2160 


ACT CTTGACC 


AGAGGGCTTC 


ATATCCTTTT 


TGTCAAAAGA 


AGTTTATACT 


CAATGAAAAT 


2220 


CAAAGAGCAA 


ACTAGGAAGC 


TAGCCGCAGG 


CTGCTCAAAA 


CAGTGTTTTG 


AGGTTGCAGA 


2280 


TAGAACTGAC 


CAAGTCAGCT 


CAAGACACTG 


TTTTGAGGTT 


GCAGATAGAA 


CTGACGAAGT 


2340 
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CACTAACCAT 
TAGTTAAAAA 
TTTTTGGGGC 
CTACGCTAAT 
AGAAACGGTC 
TTATTTCCCA 
ATATCCAACT 
GCAGGAGCAG 
CTACATTCTG 
TCATATCGAT 
TAGTCAAGAG 
TGCTACAGAG 
AGATATTACG 
TTGGCCAAGT 
CCATGGAATG 
TTTCACAGAT 
ACGCTTTGTG 
AAACAAATAA 
GCGATTTATG 
CCCCTTGTTC 
GATTGTTGAC 
GCTCCTTATC 
AAAGGCAGCA 
CTTCCCCAAG 
GGATACCTAC 
CATTATCGTT 
GTTCTTAGTC 
TCCTTTCTAC 
ATTGCAAGGG 



ACTACGGTAA 
CGTGATAAAG 
CTAGAGGAAT 
GGTCAACTCG 
CAAGTGATTT 
GTTATCGATC 
CGGATTTATT 
GAACGCATGC 
GCTGAAGACT 
GTGACCGATG 
GTGTTGAAGG 
GCTCCATTTA 
ACAGGTGACC 
TTTAGCCGTC 
GAGCGAATTG 
GGACCGCGGG 
GCCAAGCATG 
AACAGACACT 
AAACACCTAT 
AAGCTCTTAG 
CAATCTTTAC 
TTTGCAGTAA 
GTAGGTTCTG 
GACAGCAGAG 
CACATTCACA 
TTTGGTGCCA 
TTGGTTGCCA 
AGTAGTCTCA 
ATGCGGGTTA 



GGTGACGCTG 
GAGAAATAAA 
ATTTTTCACG 
AAACGACCAA 
ACGATGAGAA 
CTCTATCTAT 
ATCAGGATGA 
TGGGTCGAAA 
ACCACCAAGA 
CTGATAACCC 
CCAGTCTATC 
CCAATGCCTA 
CACTCTTTTT 
CGATTTCCAA 
AAGTTCGTTC 
AGTTAGGCGG 
AGATGGAAAA 
GGGGCTTCCC 
TATCTTACTT 
AAGCTGTTTT 
CTCAGGGAGA 
TTGGCGTTTT 
CTAAGGAATT 
ACCGTCTGAC 
CTGGTATCAA 
TTTTTATGGC 
TTTTGACCAT 
GAAAGAAAAC 
TTCGTGCTTT 



387 
ACGTGGTTTG 
GATGGCAGAA 
CATTTCTGGA 
TTACCAGTTG 
CGAACTCTCA 
CAATCAACAA 
AGCAGATTTG 
CATTGCAGTA 
CTATCTCACG 
ATTGATTGAT 
TGAAGACTCT 
TGACCAAACC 
TGCCAAGGAT 
AGAGTTGATT 
TCGTTCAGCC 
CCTCCCTTAC 
AGCACGATAT 
ACTTTCTTCA 
CAAACCCTAC 
TGAGCTCTTG 
TCAAGGTCAT 
AGTGG CCTTG 
GACAAACGAT 
AACTTCTAGT 
TCAATTCCTG 
TTATCCAATC 
TGTCATTGTA 
GGACCAACTG 
TGCTCAAGAA 



AAGAGATTTT 
ATTTATCTAG 
GTGCTAGAAA 
CTCAAGGAAA 
CTCAGAGAGA 
GGGAATGACC 
CCAGCTATCT 
GAAGTGGAGC 
AAGAATCCTT 
GCAGCAAACT 
TATCGTGTCA 
TTTGAAGAGG 
AAGTTTGCTT 
CATTATTACA 
ACTCCTCACT 
TGTATCAATT 
GGCTATCTAT 
TTTCTAGAAT 
ATCAAGGAAT 
GTTCCCATGC 
CTCTGGATGC 
ATAGCTCAAT 
CTTTATCGTC 
TTGGTCACTC 
CGTCTCTTTT 
TCAGCTGAGT 
GGGTTATCTC 
GTTCAGGAAA 
AAACGAGAGT 



CCAAGAGTAT 
CAGGTGGTTG 
CCACTGTTGG 
CAGACCATGC 
TTTTACTTTA 
CTGGTCGCCA 
ACACACTGGT 
AATTACGCCA 
CAGGTTACTG 
ATGAAAAGCC 
CACAAGAAGC 
CGATTTATGT 
CAGGTTGTGG 
AGGATCTCAG 
TGGCTCATGT 
CTGCTTCTTT 
TGCCTTACTT 
ATGAATAGAA 
CAATTTTAGC 
TGATTGCTGG 
AGATTGGCCT 
TTTACTCACC 
ATATTCTTTC 
GCTTGACTTC 
TACGAGCGCC 
TGACTTTCTG 
GATTGGTCAA 
CGCGCCAGCA 
TACAGATTTT 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3S40 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 
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I 



TCAAACCCTT 


AACCAAGTTT 


ATGCTAGATT 


ACAAGAAAAG 


ACAGGTTTCT 


GGTCTAGTTT 


4X40 


ATTAACACCT 


CTGACCTATC 


TGATTGTCAA 


TGGAACTCTT 


CTCGTTATTA 


TCTGGCAAGG 


4200 


CTATATTTCA 


ATTCAAGGAG 


GAGTGCTCAG 


TCAAGCTGCT 


CTCATTGCTC 


TTATCAATTA 


4260 


CCTCTTACAG 


ATTTTGGTGG 


AATTGGTCAA 


CCTAGCCATC 


TTGATCAATT 


CCCTCAACCA 


4320 


GTCCTATATC 


TCAGTCAAGC 


GAATCGAGGA 


AGTCTTTGTT 


GAGGCTCCAG 


ACGATATCCA 


4380 


TTCAGAGTTA 


GAACAAAAGC 


AAGCTACCAG 


ACATAAGGTT 


TTACAAGTCC 


AACAATTGAC 


4440 


CTTTACCTAT 


CCTGATGCGG 


CCCAGCCTTC 


TCTGAGATAC 


ATTTCCTTTG 


ATATGACTCA 


4500 


AGGACAAATT 


CTAGGTATCA 


TCGGGGGAAC 


TGGTTCTGGT 


AAATCAAGCT 


TGGTGCAACT 


4560 


CTTACTTGGA 


CTTTATCCAG 


TAGACAAGGG 


GAACATTGAC 


CTTTATCAAA 


ATGGACCTAG 


4620 


TCCTCTTAAT 


TTGGAGCAGT 


GGCGGTCTTG 


GATTGCCTAT 


CTACCTCAAA 


AGGTCGAACT 


46B0 


CTTTAAAGGA 


ACCATTCCTT 


CCAACTTGAC 


TCTAGCTTTC 


AATCAAGAAG 


TATCTCACCA 


4740 


GCAACTCTGG 


CAGGCCTTGG 


AGATTGCGCA 


AGCTAAGGAT 


TTTGTCAGTG 


AAAAGGAAGG 


4800 


ACTCTTGGAT 


GCTCTAGTTG 


AGGCAGGGGG 


CCGAAATTTC 


TCAGGTGGAC 


AAAAACAAAG 


4860 


ATTGTCTATC 


GCCCGAGCAG 


TCTTGCCCCA 


GGCTCCGTTT 


CTCATCCTAG 


ATGATGCAAC 


4920 


CTCGGCACTG 


GATACCATTA 


CAGAGTCCAA 


GCTCTTGAAA 


CCTATTAGAG 


AAAATTTTCC 


4980 


AAACACGAGC 


TTAATTTTGA 


TCTCTCAACG 


AACCTCAACT 


TTACAGATGG 


CGGACCAGAT 


5040 


TCTCCTCTTG 


GAAAAAGGTG 


AGTTGCTAGC 


TGTTGGCAAG 


CACGATGACT 


TGATGAAATC 


5100 


CAGCCAACTC 


TATTGTGAAA 


TCAATGCATC 


CCAACATGGA 


AAGGAGGACT 


AGAATGAAAC 


5160 


GACAAACTGT 


AAACCAGACC 


CTCAAACGTT 


TAGCCGTAGA 


TTTACCAACC 


CATCCTTTCC 


5220 


TCCTTTTCCT 


AGCCTTTCTA 


GGAACTATTG 


CCCAAGTTGG 


CTTATCAATT 


TACCTACCTA 


5280 


TTCTGATTGG 


GCAGGTCATT 


GACCAACTCC 


TAGTGCCTGG 


TTCATCACCA 


GTTTTTTGGC 


5340 


AGATTTTTCT 


CCAGATGCTC 


TTGGTCGTAA 


TAGGAAATAC 


TCTGGTACAA 


TGGCCCAATC 


5400 


CTCTCCTCTA 


TAATCGTCTA 


ATCTTCTCTT 


ATACCAGAGA 


TTTACGGGAG 


CGAATCATCC 


5460 


ATAAGCTCCA 


TCCTTTACCG 


ATTGCCTTTG 


TAGATAGGCA 


AGGTAGTGGA 


GAGATGGTTA 


SS20 


GTCGTGTAAC 


CACGGACATC 


GAACAGTTGG 


CAGCTGGCTT 


GACCATGATT 


TTTAACCAAT 


5580 


TTTTCATTCG 


TCTTTTGATG 


ATTTTGGTCA 


GTATTCTAGC 


CATGCTCCAA 


ATTCATCTCC 


5640 


TCATGACTCT 


CTTAGTCTTG 


CTGTTGACGC 


CACTGTCCAT 


GGTGATTTCA 


CGCTTTATTG 


5700 


CCAAGAAATC 


CTATCATCTC 


TTCCAGAAGC 


AAACAGAGAC 


GAGGGGAATT 


CAGACTCACT 


5760 


TGATTGAAGA 


ATCGCTTAGT 


CAGCAGACTA 


TAATCCAGTC 


CTTCAATGCT 


CAAACAGAAT 


5820 


TTATCCAAAG 


ATTGCGTGAG 


GCTCATGACA 


ACTACTCAGG 


CTATTCTCAG 


TCAGCCATCT 


5880 
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TTTATTCTTC 


AACGGTCAAT 




1 1 l\i I AAA 




TATGCCCTTT 

ft#t4w%»S>W* 4 k 


5940 


TAGCTCGAGT 


AGGAGCTTAT 


r»/—0** fi * ft 
CGTATw* i«« 


1 wuu i AwV- 




4 A ft 4V ^^ftj 


6000 


TGAcrirnT 


GAACTATCTT 


A* AT* ft ft 

CAGCAATACA 


t\.AAOV.t.<- 11 


TAAr^GATATT 


1*^*T**T*^AC*fV5^ 

l W 4 * WW* ft 


6060 


TAGCTGACTT 


GCAAAGTGCT 


CTGGt-vTOL. Cj 




fT&TCCAGTC 


TTAGATACCC 


6120 


CTGAAGTCGC 


TGAAACAGGT 


AAGGAAGTCT 




1 UAt> V. AAW 1 1 




6180 


TTTCCTTTAA 


ACATGTCTCT 


TTTGGCTACC 


ATCCTGAAAA 


AATTTTGATT 




£2 Aft 


CTATCGATAT 


TCCAGCTCGT 


AGTAAGGTAG 


CCATCGTTGG 




UV. luuAnAAl 


OJUV 


CAACTCTTAT 


CAATCTCCTT 


ATGCGTTTTT 


ATCCCATT AG 


CTCGGGAGAT 


ATCTTGCTGG 


O J O v 


ATGGGCAATC 


CATTTATGAT 


TATACACGAG 


fvtm mm ft fM* ft ^% 

TATCATTGAG 


ACACCAGTTT 


GGTATGGTCC 




TTCAAGAAAC 


CTCGCTCACA 


CAAGGGACCA 


TTCATGATAA 


TATTGCCTTT 


r+r^C k ftm/*^**!^ 
GGCAATV.CTG 


040V 


AAGCCAGTCC 


AGAGCAAGTA 


ATTGCTGCTG 


CCAAAGCAGC 


TAATGCAGAC 


TTTTTCATCC 


634U 


AACAGTTGCC 


ACAGGGATAC 


GATACCAAGT 


TGGAAAATGC 


ft ft ft vn^fft 

TGGAGAATCT 


CTCTCTGTCG 


CCAA 

DOUU 


GCCAAGCTCA 


CCTCTTGACC 


ATAGCCCGAG 


TCTTTCTGGC 


TATTCCAAAG 


ATTCTTATCT 


ooou 


TAGACGAGGC 


AACTTCTTCC 


ATTGATACAC 


GGACAGAAGT 


GCTGCTACAG 


GATGCC rri G 


o t Z U 


CAAAACTCAT 


GAAGGGCCGC 


ACAAGTTTCA 


TCATTGCTCA 


CCGTTTGTCA 


AtCATTL Al»t» 


O 'DU 


ATGCGGATTT 


AATTCTTGTC 


TTAGTAGATG 


GTGATATTCT 


TGAATATGCT 


AACCATCAAG 


6840 


AACTCATCGA 


TAGAAAGGGT 


AAGTATTACC 


AAATGCAAAA 


AGCTGCGGCT 


TTTACTTCTG 


6900 


AATAAGCCAT 


TCTCTTTTCA 


AACTTTATGG 


ACGAAAAAAG 


TTGCCTTCGA 


CTCACTrTTT 


6960 


TGTTACAATA 


GCTAGAAAAA 


TTGTTCACTG 


TAATACTCAA 


TCAAAATCAA 


AGACCAAACT 


7020 


AGGAAGCTAG 


CCGTACGTTG 


CTCAAAGCAC 


ACCTTTGAGG 


TTGTAGATAA 


GACTGACGAA 


7080 


GTCAGTTCAA 


AACACTCTTT 


TGAGGTTGCA 


GATAGAACTG 


ACGAAGTCAG 


CTCAAAACAC 


7140 


TGTTTTGAGG 


TTGCAGATAG 


AACTGACGAA 


CTCAGCTCAA 


AACAGG 




7186 



(2) INFORMATION FOR SEQ ID NO: 40: 



<i) SEQUENCE CHARACTERISTICS : 

<A> LENGTH: 14273 base pairs 
<B) TYPE : nucleic acid 
- - (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTGAAAATTC TAAAAAATTT ATAAGTAAGG AATTAATTAG TTATTTTTGT GATAAAGTTT 60 
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ATCATGAAAT ATTTGTTGAA GAGGTAGTTC CGCACGTTTT TCTGCCATAT GAATCTGACT 120 

TACTTCTTAT TTTACCAGCT ACCGCAAATG TCATTGGCAA AATTGCTAAT GGTATTGCTG 180 

ATGATTTAGT TACAGCAACT GTTTTAAACT TTAATAAAAA AATAATTTTT TGTCCCAATA 240 

TGAACTCTAC TATGTGGGAC AATCACATAG TTCAAAGAAA TGTATCAATT CTAAAGGAGT 300 

TGGGACATAT ATTTTTATTT GAGTCTAAAA AAACATATGA GCTAGGATTG CGTAAAGCAA 360 

TAGATTCAAC ATCTTCAATC TTA CAACCAC AGTCGTTAGT AAAAGAACTT ATCAAATTAG 420 

AAAATATTGT CCTTGAAGAG GGACATTAAA AACTACTGAG AATATTAATG AGGGGAAAAA 480 

ATGGAAAATT CATCAATCGA TGTAGATATG CTGTTGGAAG AATTGACACA AGAAGCAATG 540 

GTC G TT G TTG CTGTTGATAA GGACTGTTAA TTTAAACTTA TCGCAATATA TGAAAGGTTA 600 

CTGGATGTTT TAAATTATGC AGGCAGTAGC CTTTTATTAT ATACAAATGG ATAAAGTAAG 660 

GATAATACAA TGATTAATAA AAAAATACAA CAA G TT G TTT TGGAATCATT ACAGAATTTT 720 

TTGAATGGGA ACTTCATTTC GCCTTGTGTA GTCTATGATT TTGCCTTGCT GGAAACTGTA 780 

CTTGATGAAT TTAAAAATCA AATTCCTGTA ACATTCAATT ACCAACTTTT TTATGCCGTT 840 

AAAGCAAATT CAAATG AG AA GATACTTGAA TTCTTAGTAG ATAAAATTGA TGGAGTTCAT 900 

GTGGCGTCAT TATCTGAATT AGATGTGGCT AAAAAATTTT TCCCACCAAC TCAAATTTCT 960 

GTTAATGGTC CCGCATTTTC TTATGAAACT TTATATAATC TGATTAAAAA ACAATATAAA 1020 

GTTGATATTA ACTTTTTGGA ACATCTTCAA CAATTTTCCC CAAAAGAATC TGTTGGAATA 1080 

AGAGTAACGG AGCCAGATGA ACTTAATAAT CGTATGAGTC GATTTGGAAT AAATATTTGC 1140 

AGTGATAATT GGACTAGTAA TTTACAAAAT CCTTTAATTA CACGACTGCA TTTTCATTTT 1200 

GGAGAAAAAG ATGATAAATT TATTGTTAAG TTAGATAAAA TATTATTTAA GTTACAAGAA 1260 

ATTAATAAAC TTAGACAGGT TAGAGAAATA AATCTTGGAG GCGGTTTTAT GAAATTATTT 1320 

ATGGAAAATC GTTTGAAAGA ATTTTTTCTA TCACTTATGG AAATCTATAA AAAGTACGAT 1380 

ATTGATAGTA CTGTGACTAC AATAATAGAA CCACCTACTG CAATTACTTC ATTTTCTGCC 1440 

TATATGATTA CTAGCCCAGT TAATGTTAGT GAGGTGAATG AGCAGCAGGT TATCACGTTA 1500 

GACACATCAA TATACACCAA TACATTATGG TTTGTTCCGC ATATTATTAC AACGTTAAAT 1560 

TCAACTAGTA AAGAGCGTTA TAGTACTATT CTCTATGGTA ATACCTCTTA TGAACATGAC 1620 

AAGTATAAAA TGAAAGTTTC GCTTCCAAGG TTAACTCAAA ATAGCAGTAT AGTGTTTTTT 1680 

CCTGTAGGAG CTTATATAAA AAGCAATCAT TCAAATTTAC ATCGTAATGA TTTTATGCCG 1740 

GAGGTATATT TGT GG AC AAA AAACTTGACA TATTAGATAA AGTTAAGGAA TATTTAGGAA 1800 

ATAAAACTAC TCAAATTCTG GATAATCAAT ATAAAGAATT TTTGAAACTT AATGATATAA 1860 
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GGCGACCCTT 


TGGTATTTCA 


GAAAAAGTAT 


taaacaattc 


TTTTAATTTT 


ACGAGTAAAG 


1920 


AATTTAATGA 


.TTTAATTAAT 


AACGAAAATT 


atttattcga 


ATATGCATGT 


AGAATTAGAG 


1980 


AHCAATGGAG 


AAAAAAATGC 


TTTAATCATT 


cttatcgttt 


TCTATGCTCA 


CCTATAATTA 


2040 




TCTTAACACG 


AAGACATTGA 


gaagtagcca 


AATTGAATAT 


AAATATGAGC 


2100 


V# A 1 ^ A i iniv 


GAAAAGTTCG 


ATAGGCGATA 


gagcgcttga 


TGGCTTTGTT 


TCCTTCAATA 


2160 


Li i * AALAwL 


TAATGGTATG 


TCTGCTATTA 


aactatgtct 


TGAGATATTA 


AACTCTATTT 


2220 


ILl 1 V» A/\V*r\A 


GAAGATTGAT 


TT ATT AT ATT 


caaccggata 


TTATGAAACA 


AGATTTTTAT 


2280 


1 AAA 4 AAA *- i 


TGCTAAATCA 


GGTATTAGTT 


gctatgaggt 


AAGTAATTGT 


GAATTGGATA 


2340 


AAUA 4 AAA 1 i 


TTATAATGTA 


TTCATGATGG 


aacccaatcg 


AGCCGATTTA 


ACATTACAAA 


2400 


AAAwlunl 1 1 




GAATATTTTG 


ttaagtataa 


AAATAATTCA 


ATAAAAGTCG 


2460 


TTATTTTAGA 


TiTTTTATAT 

1A1 1 


CAAGGTTCTA 


attttaaatt 


AGTAGAATTT 


TTACACAAAT 


2520 


TTAAATTTGt 


V^AA I w 1 1 4 


^\ A » * 4 » w * 


T A CG ATCTTT 


CATAAAATTA 


GATCAAATGG 


2580 


GATTAGAATT 


UAL AAA I VAA» 


CZCt AAT AAT AG 


AACTGTTTAT 


TCCTAATCAT 


TTGAGAAAGT 


2640 


TtiAAAAA ill 


TATTflAAflAn 


G AATTC AAT A 


AATTTAGAAA 


TTCTCACGGA 


GCTAATCTAA 


2700 


GCCTL,TATGA 


A J AL I I 1 




CTTT AACTTT 


AAAAAATCAT 


TGGAACTATT 


2760 


CTGATTTAGT 


TATftAAATTT 


ACCAGTAATT 


TTTATCCTCA 


TATAAAAGAC 


TTGTTCATGG 


2820 


AAAA 1 Ivl w A 


TATTGAAATC 


ATCCATGAAG 


AGGGAGTACC 


TTTTGTATTT 


TT AG ATTTAA 


2880 


1 Au v 1 unnuu 


TAAAAAAGAA 


TATGAAATGT 


TTTTTCAATG 


GTTAAACTTC 


TTTTACAAAC 


2940 




CACATTCTAT 


GCTAGAAATA 


CTTTTGGGTT 


TCGGAATCTA 


ACAGTAGAGT 


3000 


AT TTTGG AAT 


TATTGGGACA 


GAAAGATATA 


TATTTAAGAT 


TTGTCCAGCT 


GTTTATAAAG 


3060 


GGTTAAGTTA 


TTATTTGATG 


AAATTTTTAT 


TAAAATCTTT 


TTCAAATGAA 


TATTTAAAAA 


3120 


CTACTCATGA 


GGTTAATAGA 


TGAAAAATTT 


GATAAAGTTG 


f*»*r> k k«k k**pi 

CT AAT AA I I A 


uni 1 L> A 1 iul 


1 1 AO 


TAACTT AG CA 


GACACTGTAT 


TTTATATAGT 


AGCATTGTGG 


CACGTTAGCA 


ATAATTATTC 


3240 


TTCGAGCATG 


TTCTTAGGAA 


TATTTATTGC 


ACTAAATTAT 


CTACCGGATT 


TGTTACTAAT 


3 300 


CTTTTTTGGA 


CCAGTTATTG 


acagagtaaa 


TCCGCAAAAA 


ATTCTT AT AA 


TATCAATTTT 


3360 


GGTTCAATTA 


CCAGTGGCTG 


TAATATTTTT 


ATTATTATTA 


AACCAAATAT 


CATTTTGGGT 


3420 


GATAATGAGT 


CTACTGTTTA 


tttcagtaat 


GGCTAGCTCC 


ATAACTTACG 


TGATAGAAGA 


3480 


TCTGTTGATT 


CCT C AAGTGG 


tagaatatga 


TAAGATTCTA 


TTTGCAAATT 


CTCTTTTTAG 


3540 


TATTTCGTAT 


AAAGTATTAG 


attctatttt 


TAATTCATTC 


GCATCATTTT 


TACAGGTGGC 


3600 
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AGTACCATTT 


ATTTTATTGG 


TTAAGATAGA 


TATAGGCATA 


•P3PWC\CTTG 


CTCTATTTAT 


3660 


ATTGTTGTTG 


TTAAAATTTA 


GAACTAGCAA 


TGCGAATATA 


GAAAACTTCT 


CTTTCAAATA 


3720 


TTACAAGAGA 


GAAGTGTTGC 


AAGGTACAAA 


CTTTATTTTA 


AATAATAAAT 


TATTATTTAA 


3780 


AACCAGTATT 


TCTTTAACGC 


TTATAAACTT 


TTTTTATTCA 


TTTCAGACAG 


TAGTTGTACC 


3840 


GATTTTTTCT 


ATTCGATATT 


TTGATGGTCC 


GATTTTTTAT 


GGTATTTTTT 


TAACTATTGC 


3900 


TGGTTTGGGT 


GCTATATTGC 


GAAATATGCT 


AGCGCCAATC 


GTAATAAAAT 


ATTTAAAATC 


3960 


GAATCAAATT 


GTTGGTGTAT 


TTCTTTTTTT 


GAACGGCTCA 


AGTTCGTTAG 


TAGCAATTGT 


4020 


TATAAAACAC 


TATACTTTAT 


CACTTATTTT 


ATTTTTCGTT 


TGTTTTATGT 


CTAAAGGAGT 


4080 


CTTCAATATT 


ATTTTTAATT 


CGTTCTACCA 


ACAAATACCT 


CCACATCAAC 


TTCTTGGTAG 


4140 


GGTAAATACT 


ACCATTCATT 


CTATTATTTC 


TTTTGGAATG 


CCAATTGCTA 


GTTTAGTTGC 


4200 


AGCAACCCTT 


ATTGATTTGA 


ATATTGAATT 


AGTGTTAATT 


GCTATTAGCA 


TACCTTATTT 


4260 


TTTGTTTTCT 


TATATTTTTT 


ATACGGATAA 


TGGATTGAAA 


CAATTTACTA 


TATATTAGAA 


4320 


ATGTTTATGT 


TCATTCAAAA 


GCATAATGAC 


TATAACTGAA 


AAAGAAAAGT 


GATATCTTTA 


4380 


ACGTTCTTCT 


TCTTGGTGGT 


CAGATTCGTG 


AGACAACCCA 


AGCTTTTGTC 


GGAAAGATTA 


4440 


CCAATGCTTT 


GATGCATAGG 


ATCTACTTTA 


CCAAGATGTT 


TTTAGTGGTA 


ACGCTATCGT 


4500 


GGATCGACCT 


GTAATAACCT 


CTTCTTTCGA 


GGAGTATTTT 


ACTAAAAAAC 


T AG CCTTGG A 


4560 


GCGTTCCCCA 


GAAACGCACT 


TACTCATTCA 


CTCTTCAAAG 


ATTTGGGGAG 


AAGATTTTGC 


4620 


TTCATCTGTT 


CCTTGAAAAA 


AGTCACAGCA 


GTCATCACAG 


ACGATAGTAC 


TGAACAAAAC 


4680 


TATGAAGAGT 


TAGAAATTTA 


TACCCAGCTG 


ATTGTATAAA 


GGATCTGGAA 


ATAGATAAGA 


4740 


AGTTGATTAG 


TATTCACCTA 


GGTGGTACAA 


ATATTAAGAT 


tactgttctt 


TCAAATGACG 


4800 


GTGAGATTCA 


AACTTTGTGG 


AGTATTACAA 


CAGATACAAG 


TGACAAAGCT 


TCTCAAATTA 


4860 


TATCGGACAT 


CATCAGTTCT 


ATTAAAAATA 


AATTGACCGA 


ACGGAATATT 


CCTGATAGCG 


4920 


ACCTTCTTGG 


AATCGCTATG 


GGAAGTTGCT 


CATCATACTT 


TCCTTGTAAA 


TCATAGGGGC 


4960 


TATAAACTCT 


CCGTCTACTT 


GTCCTGCAAC 


AATTGAAGTC 


TCCTCAAAAC 


GCCGTCCGCT 


3040 


AATCTTTTCA 


TAGACTTTCT 


CCCTTTTAGG 


AGCCTAGCTT 


TCTAGTTTGT 


TCTTTGATTT 


5100 


TTATTGAGTA 


TACCACTATT 


TTACTCCCTC 


TGGCAAGGGA 


CTTTGTCTAT 


GTGGAGGGAT 


5160 


TGGCCTCCTA 


TGTGGTGGAG 


CTTTTCTGTT 


CTTTCTGAAA 


TATGGTATAA 


TAGCACTAAT 


5220 


CAATTTCTAG 


G AAAAT AG AT 


ACAGAAAGGG 


CCTGAAAGAT 


GTCT C AT ATT 


ATTGAATTGC 


5280 


CAGAGATGCT 


GGCAAACCAA 


ATCGCGGCTG 


GAGAGGTCAT 


TGAACGTCCT 


GCCAGTGTGG 


5340 


TCAAAGAGTT 


GGTAGAAAAT 


GCCATTGACG 


CGGGCTCT AG 


TCAGATTATC 


ATTGAGATTG 


5400 



WO 98/18931 



PCT/US97/19588 



452 

(B) TYPE : nucleic acid 

(C) ST HANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



CCAATTACAT 


TATATTATCA 


AAATCGTCGA 


AACTGGCTCC 


ATGAATCAGG 


CAGCCAAGCA 


60 


ACTCTTTATC 


ACTCAGCCAA 


GTCTCTCCAA 


TCCAGTGCGA 


GATTTGGAAA 


ATGAAATGGG 


120 


CATTGAGATC 


TTTATCCGCA 


ATCCCAAGGG 


AATCACCTTG 


ACCCGTGATG 


GCATCGAGTT 


180 


TCTCTCTTAT 


GCCCGTCAGG 


TTGTCGAGCA 


GACCCAGCTT 


CTGGAGGAAC 


GCTATAAAAA 


240 


TCCTCTCGCC 


CACCGCGAAC 


TCTTT AG CGT 


TTCCTCTCAA 


CACTATCCCT 


TTGTGGTCAA 


300 


TGCCTTTGTC 


TCTTTGCTCA 


AGAAAAGCGA 


TATCGAGAAA 


TACGAACTCT 


TCCTTCGTGA 


360 


AACTCCGACT 


TGGGAGATTA 


TCGACGACCT 


CAAGAACTTC 


CGCAGTGAGG 


TCGGGGTCCT 


420 


CTTCTTAAAC 


AGTTACAACC 


GTGATGTTTT 


AACCAAGATG 


CTGGATCACA 


ATCACCTGCT 


480 


AGCCCACCAT 


CTCTTCACAG 


CGCAACCGCA 


TATCTTTGTC 


AGCAAGACCA 


ACCCTCTGGC 


540 


AAAGAAAGAC 


AAGGTGAAAC 


TG TCTGATTT 


GGAGAATTTC 


CCTTACCTCA 


GCTATGACCA 


600 


AGGGACGCAC 


AACTCCTTCT 


ACTTTTCAGA 


AGAGATTCTT 


TCTCAAGAAC 


ACCACAAGAA 


660 


ATCCATTGTG GTCAGTGACC 


CTGCCACCCT 


CTTTAATCTC 


TTGATTGGTT 


TGGATGGTTA 


720 


TACCATTGCG 


ACAGGGATTT 


TGAACAGCAA 


CCTAAACGGA 


GACAATATCG 


TTTCTATCCC 


780 


ACTGGATATT 


GATGACCCGA 


TCGAGCTGCT 


CTATATCCAG 


CATGAGAAAA 


CCAGCCTATC 


840 


TAAGATGCGC 


GAACGCTTTA 


TAGACTATCT 


CCTAGAAGAA 


GTTCAGTTTG 


ATACTTGAGA 


900 


AATGATAAGA 


ACCAATATGT 


AGGCTAGCAA 


CAACCTGCAC 


ATTGGTTCTT 


TTTACTTATA 


960 


ATTAAAAGTT 


TCCCCTGCCA 


ACTTATCACC 


TAGCTTGGGA 


AAGAGAGTAT 


AAAACTTATC 


1020 


GGCTAGGTTC 


AACAAAATCG 


GGAGATTGAG 


TTCTCGTTTC 


TTTTTTCCTA 


TAATCTTGAC 


1080 


AATCTTTTTA 


GCCACTGCAT 


CTGGTTCTAG 


CAGCAACCCA 


TCAACCGATT 


TAACATAAGT 


1140 


TCCATCTGGG 


TCCGCTTGCT 


CGAAAAATCC 


TGTACGGATT 


GGTCCTGGAT 


TGACTCTTGT 


1200 


CACATAGACT 


CCATAGGGCA 


TAAGTTCGAG 


TCGCAGACCA 


TTTGAAAAAC 


CAATAGCCGC 


1260 


AAACTTGCTC 


GCTGAGTAAA 


GACTAGACTT 


GCCAGTACCT 


ATTAGACCTG 


CCATGCTGAC 


1320 


GATGTTGATG 


ATATGCC CTT 


TGCTGCTTTC 


CTTCATACGA 


GC CGCAAGGT 


GACGAGACAG 


1380 


ATTCATCAGG 


G CAAAGGT AT 


TGACCTCAAA 


CATCTGGTGA 


ATATCTTTAT 


CAGCAATCTG 


1440 


GTCAAATCCC 


TCAAAAATCC 


CGTAACCAGC 


GTTGTTAATC 


AAGACATCAA 


TCTTGCCATA 


1500 


GCGGAGATAA 


AGATCAGTTA 


CCACACCTTC 


TAGGGCTGAA 


TCCTCGCTAA 


TATCAATTTC 


1560 
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AGCGGATCCC 


CGTGAAACTC 


AAGCGGCTTG 


GCATCATGCC 


TTG AC w A L» I A 




9900 


AACTGTCATT 


CTCTTAACCC 


GTCAAAACTT 


GGTAGTTGAA 


GAAGGGACAG 


ACTTTGGTAA 




CGTCGCTAAA 


GGAGCCTACG 


TCGTGTATGA 


TACCCCGGGA 


TTTGATACTA 


JAMA % k fltfMM 

TTATCATTGC 


10020 


TACAGGATCT 


GAGGTCAATC 


TAGCTATCAA 


AGCTGCTAAG 


GAATTGGTTT 


TACAAGGTGG 


10080 


TAAAGTACCT 


GTCCTATCTA 


TGCCCTCAAC 


CGAACTATTT 


GATGCTCAAG 


ATGCTACCTA 


10140 


CAAGGAAGAC 


ATTTTACCAT 


CTAAGACTCG 


TCGTCGTGTG 


GCCATTGAAA 


TGGCAGCGAC 


10200 


CCAAACTTCG 


TACAAGTATG 


TTGCTTTGGA 


TGGCGCGGTC 


ATCGGTATTG 


ACATCTTCGG 


10260 


TGCGTCTGCC 


CCAGCTCAGA 


CTGTGATTGA 


TAATTATGGA 


TTTACGCTAG 


AGAATATCGT 


10320 


TGCTCAAGTT 


AAGTCCCTAT 


AGAAACCAAT 


TACAATGAAG 


ATACAGCTGT 


TGTCAGACTA 


10380 


GCAGATCTAG 


TGATACACAC 


TAATCAGATG 


ATTGGTTATT 


TAAAAACTCT 


AATGAAAATG 


10440 


TAATAATTTA 


TCTACGAAAG 


TTATAGTAGA 


TAGTATACAC 


AATAGAGTAT 


ACCCTGAAAC 


10500 


GGTTGCGAAG 


TACGCTAATC 


ACTTTGCTAC 


TGATCTAGAT 


actttcttta 


ATCAATAAAC 


10560 


ACAGCATCCA 


CAGATTGACT 


TAGGATATTG 


TAACTTTTTT 


GAAAGCTAGA 


GAGAAGGTCT 


10620 


CTAAAATTAA 


AAAACCCATA 


GTATAGGATG 


TTGAAATGAT 


GAACTGCACC 


CCAAAACTTA 


10680 


GACAGAAAAA 


AATCTAACTT 


TTGGCGTGTT 


TTTATTATGA 


AATTAACTTA 


TG ATG AT AAA 


10740 


GTTCAGTTCT 


ATGAACTTAC 


AAAACAAGGA 


TATATCTTAG 


AGAAGCTTTC 


AAATAAATTT 


10800 


GGGATAAATA 


ATTCTAATCT 


TACCTACATG 


ATTAAATTGA 


TTGATCGTTA 


CGGAATAGAG 


10860 


TTCGTCAAAA 


AAGGCAAAAA 


TCCTTACTAT 


TCTCCTGATT 


TAAAACAAGA 


AATGATTCAT 


10920 


AAAGTCTGAC 


ATGAAGCCTC 


GACTAAAGAT 


AGAGTTTCTC 


TTGAATACGG 


TCTCCCAAGT 


10980 


CGTACGATAC 


TTCTTAACTG 


GCTAGCACAA 


TACAGGAAAA 


ACGGGTATAC 


TATTGTTGAG 


11040 


AAAACAAAAG 


GGAGAGTACC 


TGAGACCGGA 


GAATGCCATC 


CTAAAAAAGT 


TAAGAGAACT 


11100 


CCGATTGAAG 


GAGGAAAAAG 


AGAAATAAGA 


AAGACACAAA 


TTGTTCAAGA 


ATTAATGACT 


11160 


GAGTTTTCGT 


TAGATCTTCT 


TCTAAAACCC 


ATTAAACTAG 


CTCGTTGCAC 


CTACTACTAT 


11220 


CACTTGAAAC 


AGCTAGATAA 


ACCAGATAAG 


GACCAACAGC 


TTAAACCTGA 


AATTCAATCC 


11280 


ATCTTTATCG 


AACACAAGGG 


AGATTATGCT 


TATCGCCGGG 


TTCATTTAGA 


ACTAAGAAAT 


11340 


CGTGCTTATC 


TGGTAAATCA 


TAAAAGAGTT 


CAAGGCTTGA 


TG AAAGT ACT 


CAATTTACAA 


11400 


GCTAGAATGC 


GACAGnAACG 


AAAATATTCT 


TCTCATAAAG 


GAG 




11443 



(21 INFORMATION FOR SEQ ID NO: 50: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5338 base pairs 



I 
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CAAACCAACT 


ATTATGACTG 


TAGATCCAGG TTTTGCAGGA 


CAACGCTTTT 


TGGAGTCTAC 


8100 


CTTGTATAAA 


ATCCAAGAAC 


TCCGTCAGCT 


TAGAGTTCAG 


AATGGTTATC 


ACTACATCAT 


8160 


TGAGATGGAT 


GGTTCTTCGA 


GTCGTAAGAC 


TTTCAAACAA 


ATTGATGTGC 


CAGGACCAGA 


8220 


TATTTATCTT 


ATAGGTCGCA 


GTGGATTATT 


TGGTTTGGAT 


GACGATATTG 


CCAAACCCTG 


8280 


CGATATCTGT 


TCTAGAGATT 


ACGAAGAAAT 


GACCGGAAAA 


ACAATGCCAA 


TCAAATAATG 


8340 


GTTTGAGAAG 


AAATTTATTA 


GTTAGGAGGA 


ATATATGTCA 


CTACAATCAC 


TTAACCCCAT 


8400 


TCGTTTTCTT 


GGCGTAGATG 


CTATTAACAA ATCTAATTCT 


GGTCACCCGG 


GAATTGTCAT 


8460 


GGGTGCTGCG 


CCAATGGCTT 


ATAGCCTATT 


TACAAAGCAC 


CTTAGAATTA 


CACCTGAGCA 


8520 


GCCAAACTGG 


ATTAACCGAG 


ATCGCTTTAT 


CTTGTCTGCG 


GGTCATGGAT 


CAATGCTACT 


8580 


GTATGCTCTC 


TTGCATTTAA 


CAGGGTATAA 


GGATGTATCC 


ATGGACGAGA 


TTAAAAATTT 


8640 


CCGGCAATGG 


GGATCTAAGA 


CACCTGGTCA 


TCCTGAAGTG 


ACGCATACGT 


CTGGTGTGGA 


8700 


TGCGACATCT 


GGTCCGCTTG 


GTCAGGGGAT TTCTACTGCC GTTGGTTTCG 


CCCAAGCAGA 


8760 


GCGTTTTTTA 


GCTCCTAAGT 


ACAACAAAGA 


TGGTTTCCCT 


ATTTTTGACC 


ATTATACTTA 


6620 


TGTTATCGCT 


GGAGACGGTG 


ACTTCATGGA AGGAGTGTCT 


GCGGAGGCGG 


CTTCTTATGC 


8880 


AGGTCATCAA 


GCTTTAGATA 


AGCTTATCGT 


CCTCTACGAC 


TCCAACGACA 


TCTGCTTGGA 


8940 


TGGTGAGACC 


AAACATACTT 


TCTCTGAAAA 


TCTTCGCGTC 


CGTTACGATG 


CTTATGCTTC 


9000 


GCATACAGTT 


CTGGTAGAAG 


ATGGAACAGA TTTAGCAGCA ATTTCTACAC 


CAATTGAGAC 


9060 


GGCCAAGTTT 


TCTGGTAAAC 


CCAGTTTGAT 


TGAAGTGAAA 


ACGCTAATTG 


GTTACGGCTC 


9120 


ACCCAATAAA 


AGTGGTACAA 


ATGCTCTTCA 


TGGTGCACCA 


CTAGGAGCAG 


AAGAAACAGG. 


9160 


ACCAACTCGT 


AAGTTTTTGG 


CATGGGATTA 


CGATCCATTT 


GAAGTACCAG 


AGGAAGTATA 


9240 


TTCTGATTTC 


AAGACAAATG 


TAGCGGATCG 


TGGTCAGGAG 


GCATACGATG 


CTTGGGCTAG 


9300 


TTTGGTGTCT 


GATTACAAGG 


TTGCTTATCC 


CGAAGTTGCT 


AGTGAGATTG 


ACGCTATTGT 


9360 


AGCTGGAAAA 


TCCCCTGTAA 


CCATTACTGA 


AAAAGACTTC 


CCTGTCTATG 


AGAATGGCTT 


9420 


CTCTCAAGCA 


ACTCGTAATT 


CGTCCCAAGA 


TGCTATTAAT 


ACAGCAGCAG 


TTTTACCAAC 


9480 


CTTCTTAGGT 


GGATCGGCAC 


ACTTAGCTCA 


CTCTAACATG 


ACCTACATCA 


AGGCAGATGG 


9S40 


CTTACAAGAT 


AAATATAATC 


CATTAAACCG 


CAATATTCAG 


TTTGGGGTAC 


GTGAATTTGC 


9600 


CATGGGAACA 


ATCCTCAATG 


GAATGGCTCT 


TCATGGTGGT 


TTACGAGTTT 


ATGGCGGAAC 


9660 


CTTCTTTGTT 


TTCTCTGACT 


ACGTCAAAGC 


TGCTATTCGG 


CTATCAGCCA 


TTCACCACTT 


9720 


GCCTGTAACT 


TATGTCTTTA 


CCCATGATTC 


AATTGCCGTT 


GGTGAAGATG 


GTCCAACTCA 


9780 


TGAACCAGTT 


GAACATTTGG 


CAGGTTTACG 


CTCAATGCCA 


AACTTGACTG 


TTATCCGTCC 


9840 
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GACATCTGAT TCATCTTGAC ACTCAGGGCA CAATAGGGGT AGAAAATGAA TTGAGTCAAG 6360 
AGCAGATTGA TGCAGCGGAT GTAGTTATTT TAGCACTTGA TGTTAAGATT TCTGCTATGG 6420 
AACCCTTTGA GCGTAAAAAG ATTATCAAGG TTCCAACAGA AGTGGCAGTC AAATCTCCCA 6480 
ATAAACTGAT TGCTAAAGCT GTTGAGATTG TTACGAAATA ACTGAAAATA TTTAAGGAGA 6540 
AAATATATGT TGAAACACTT AAACTTAAAA GGTCACTTAT TGACAGCCAT TTCCTATATG 6600 
ATTCCAATTG TTTGTGCTGC AGGATTCTTA GTTGCCATTG GTTTAGCAAT GGGGGGTGGT 6660 
GTTCCTGACC CTCTTGTAGC AGGAAAATTC ACTATCTGGG ATGCTTTAGC AACTATGGGT 6720 
CCTAAAGCCC TTGGTCTCTT GCCACTTCTT ATTGCTACAG GTTTGTCTTA CTCGATTGCT 6780 
GCTAAGCCAG GGATTGCACC AGGTTTTGTT GTTGGTCTAA TTGCCAATTC TGTTGGTTCA 6840 

GGGTTTATCG GTGGTATCTT GGGAGGTTAT ATAGCTGGTT TCTTGGTTCA AG CG ATT ATT 6900 

AAAAAGGTCA AAGTACCAAA CTGGATTAAA GGTTTAATCC CAACCTTCAT TATTCCTTTT 6960 

l GTAGCCTCTT TGGTAACTAG TTTCATTATC ATTTATATTA TTGGAGCGCC TATCCCAGCC 7020 

TTTACCAACT CGTTGACGAG CTTATTACAA AGCTTGGGAA GTGCTTCAAA TGGTTTGATG 7080 

GGGGCAGTTA TTGGAATTCT CAGTGCTCTT GACTTTGGTG GCCCACTTAA TAAAACAGTC 714 0 

TATGCGTTTC TGTTGACTTT ACAGGCTGAA GCTGTGAAAC AACCATTGAC TGCTTTACAA 7200 

TTGGTGAATA CTGCTACACC AGTTGGATTT GGATTGGCCT ATTTTATCGC GAAATTACTC 7260 

AAAAAAAATA TCTATACTCA AGAGGAAATC GAAACATTGA AATCCGCTGT TCCTATGGGG 7320 

ATTGTCAATA TTGTTGAAGG TGTAATTCCG ATTGTTATGA ATAACTTGGT TCCAGCTCTC 7380 

ATTGCAACAG GTATCGGTGG TCCTGTTGGT GGTGCTGTTT CTTTGACAAT GGGTGCTGAT 7440 

TCTGCTCTGC CATTTCGTGC ACTCCTTATC TTACCAACCA TGACTCGTCC ACTAGCTGGT 7 500 

ATTTGTGCCT TGTTAGCTAA CATTGTAGTC ACAGGACTTG TCTACGCGAT TTTGAAAAAA 7 560 

CCAATAAAAC ATGCAGAACC AGTTATGACT GTTGAAGAAG AGATTGATTT GTCAGATATT 7620 

GAAATTTTCT AAGAGGGTAA CGATGTCAAG AATTGAATTT TCACCATCTT TGATGACCAT 7680 

GGATTTGGAC AAATTCAAAG AGCAGATTAC TTTTTTGAAT GATAAAGTAG CATCTTATCA 7740 

TATCGATATT ATGGATGGCC ATTTTGTTCC CAATATTACC TTGTCTCCTT GGTTCATTCA 7800 

AGAAGTTCAA AAAATTAGTG ACACACCTTT ATCAGTTCAT CTGATGGTCA CAGACCCAAC 7860 

CTTTTGGGTA GATCAAGTTC TCGATTTACA ATGTGAGTAT ATTTGTATTC ATGCTGAAGT 7920 

TCTGAATGGT CTTGCTTTTC GTTTGATTGA TAAAATTCAT GATGCAGGTC TAAAGCCTGG 7980 

TGTTGTCCTT AATCCTGAAA CACCTGTTTC TACAATCTTT CCCTACATTG ATTTACTTGA 8040 
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ATTTTCTCTC 


ACCTGTATAT 


TTTTATCAGT 


448 

imiTfCTC GTAGTACTAG 


TATTCATCTT 


4560 


CTAGCACCTT 


CAAAACCTAC 


TATTGTTGAT 


racaa iiTTT ACAGTGTCTG 


TCAAAAAATT 


4620 


ATTCAAGAAA 


TTGAACAATA 


TTTTAGGATG 


& ft rrj "*t**t*t* iTY^ {"* JX.GTTGAGAT 


TGACTATCTT 


4680 


TATCAATACG 


TTGTATCTTC 


GAGATTGCAA 


ksftrr&*p*TTT <™TTf*f* OCC AA 
AAALLAt 111 ^ * i w*-*»***ww» 


GCTTCCTTTT 


4740 


TCTCAGCGAG 


TTTTAGATGT 


CACTCATTAC 




GGACAATAGA 


4800 


GAGATTGAAA 


CGACAGATCC 


TGACTTTGTT 


^k/«mwri ^?*rr*ATATf*AC 


TCCCTTACTG 


4860 


AGGAGATTAG 


ATAATAGAGT 


ACAGATTAAG 


AATAGTCTTT TA I ualaaa a 


TCTTTTAACC 


4920 


TATCCTAATC 


TGGTTAAAGA 


GTTAACAACT 


ATTTCTAAAG AAGTGAuTtT 


#\Vj ft A ft ft ft w ft 


4980 


TTTGCTTCCT 


TGAGTCTGGA 


CGAGATTGGT 


TTTCTAGTCT TATATTTTGC 


ACGGTTXCAA 


5040 


CAAAAGCGAG 


CACCTCCTCT 


AAAAACAGTA 


GTGATGTGTA CATCAGGTGT 


LuunAW ft ft 


5100 


GAG CTTTT AC 


GAGCACGATT 


AGAAAAGCAA 




TuATu ft ft * 


5160 


GCTTATCATC 


AATTAGATGA 


GCTCATAAAT 


CTATATCCAG ATTTAGATTT 


f*Jk*TTGTGACG 

I 1 VJ ft 


5220 


ACGCTAGCTT 


TGCAGGAACC 


AGCAAGTGTC 


CCGTTTGTCC TACTTAGTGT 


1 I 1 I 1 AnWw 


5280 


GAGGGTGATA 


AACAACGTCT 


TCAACCAAAA 


ATTCAGGAGA TAAACTATGA 


A 1 n/\ 1 W ft ft ft W 


5340 


GCTTGTCCTT 


ATGGATATAT 


CTGTTCAAAA 


TCCTCAAGAA GCCTACAAAG 




54O0 


TCAAATCAGC 


CTTCTTGTTT 


CTGAACATAC 


AGAAAAAATA CAAGAGCTTC 




5460 


TGAGAGACAG 


GGAAGTATAG 


AGGTTGCTAA 


AGCTGTTCTT CTACCACATT 


la* 4 \*rt/*V^»*W*« 


5520 


CTTTCAACAT 


CATGTCTTAG 


TGATTACTAG 


ATTAAAATCA CCTATCAUAU 




5580 


GGATATCCAC 


TGTGTTGACC 


TTATTATCGG 


TTTGGCCATT GCAGTATCAu 


ACGACAAGTC 


5640 


ATGTATTAAA 


ACATTCATGA 


GAAGACTAGC 


AGATGAATCA TTCATAAATC 


JLATTAAAACA 


5700 


GTTAACAAAA 


GAAGAATTAC 


GGGAGATAAT 


ATATGGAAAT LAAAvjAlMi i 


cttaatgtga 


5760 


CTCTGATCCA 


GACGGATTTA 


CAGATCCAGA 


GCAAAGAAGA Wi n l l l vjm* 


GCATTAGCTC 


5820 


AACTATTGGT 


TGAGACGGGT 


TATGTGTCTG 


ATAGAGACCA Al i i«iv.v»An 


GGTCTTT AT C 


5880 


AGAGAGAGGC 


AGAAGGACAG 


ACCGGTATTG 


GGAATTATAT TGCTATTCCC 


CATAGCAAGA 


5940 


CTTCTGCTGT 


GGAGAACGCG 


GGGGTAGTCA 


TAGCTATAAA TCACAATGAG 


ATTCCTTGGG 


6000 


AGACCATTGA 


TGGGAAAGGG 


GTCAAAGTAA 


TTGTACTCTT TGCAGTTGGT 


GATGA1WCAC 


5060 


AAGCTCCTAG 


GGAGCATTTG 


AAGACCTTAT 


CACTCTTTGC TCGAAAACTT 


GGTAATGACG 


6120 


AAGTTGTTGC 


CAAATTAGTT 


CCGGCTCAGA 


CATCTGATGA TGTGATTGCA 


G rjTTTTTGTT 


6180 


AATAAGAAAA 


AATTTTGGAG 


GGTATCCGTA 


TGAAAATTGT TGGTCTTGCA 


t GCTTGTACTG 


6240 


TGGGAATTGC 


CCACACTTAT 


ATTGCACAGG 


AAAAATTAGA GAATGCCGCA AAGGTAGCTG 


6300 



fi 
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& A AfrrftATCT 


ACTGACTCGA 


ATAACGTGAG 


CTGGTCTGCT 


ACTCTGTCTT 


CTTGTAATTG 


2820 


Ml 1L1 i av 


TATTCAGCTA 


TCACTTTCTG 


ATTACGGCCT 


ACCCTATCTA 


CATAATAGCC 


2880 


L L 1 ALALLAA 


AACTTGCGAT 


TGCCATATTT 


GTATTTTAAA 


TTCGCATGCT 


TATCAAAAAT 


2940 


tn I L AAAw, t V* 


t "I'CTTGCCCT 

W * W, A 4 WW WW * 


TTAAATAGCC 


CATAAAGGAC 


GAAACACTAA 


GTTTCGGAGG 


3000 


AA 1 AL X L»A 1 A 


ACCATGTGAA 

#^^^W#^ 4 *J A 


TATGGTCTGA 


ACAAGCATTC 


GCTTCATCGA 


TTATTACACC 


3060 


L I l ACGt i v. A 


f & Tl AfTTf AC 


GTATGATTCT 


TCCCATACTA 


CCTTTGTATC 


TGCCATAAAT 


3120 


^* k fiamff*^ ft ft 

GATTTGACGA 


pr» ft T ft 'fT"TV*in 


CTGCAAAAAC 


AATATGATAT 


TTACAATTCC 


ATCTCGTATG 


3180 


TGATAAACTT 


ll>Al i A i w w i 


CTCTCATGAG 


GTACCTCCTG 


TATGATATGT 


TGTAGTGGCG 


3240 


CAGAAACCAC 


TTCTATCTTA 


TCATTTTAGG 


AGGTTCTTTT 


TCTTACCACG 


CTAAAAGCTC 


3300 


TATGGAAc C A 


CTAOLA 1 MuL 


T1RTCCTTTT 


CGCGAGACAA 


CAAGAAAGAC 


TGCAATCTGT 


3360 


GGATTGCAGT 


H»« WW^f ft ^ ft 

n H i atala» 




CCTAGATCTG 


ATGTCCAAGG 


CCTACGTGCC 


3420 


GATCATCTAT 


CGGTGaALLL 


& A^Aftrvr Af*C 

/WtwAU w, w> w 


CTCAAGCCTG 


CTTGGATTGA 


GGTAATAGAT 


3480 


TCAAATATCT 


Ml ft / ■»l»l»>f* ft 

GTAGTTAGAL 


1 A 1 1 1 V*/W%V* * 


TTGATGTAAG 


AAAGAGAAAG 


CGACAGATTG 


3540 


AAGTAATTTT 


AACTCTLi TL 


TaTrr.rTACA 

x r\ i i uw a. 


ACAAATGGTC 


GGATAGGTTG 


GTACTTTGAA 


3600 


AATGAAGATG 


j Mi* ft mi "#i • h (wp 


TT & AATCiC AA 


CATAGTGTTA 


TTTATTAGAA 


AATCCTTTGC 


3660 


TTTATTTCTT 


ft ft ft ft T ft. ST* 
ATL AAA I ALu 


AAAACCAACT 


TAAATATTTC 


AACTAAAATA 


GATGTTATGA 


3720 


AGAAAAGGTA 


ft ft ft ^Ni^ % »^^p^pt 
AAA TuA 1 111 


flRCATAGTGA 


GGTTCTGTTC 


TATTTGATAT 


CATATTTTTG 


3780 


ft M k ft Ik % ft ft, ft 

ATAAAAACAA 


ft ft ft <p<" T'ff 6.T 
AAAr\# 1 LLA 1 


TfZCAAAGGAC 


AAAATGCGAA 


GT AT ATT ATT 


TTTTCAAACC 


3840 


GAT ATAATGG 


All LA I Anftu 


GACGTGTATC 


GTGTCTAGAA 


AACAAGAACA 


AATGGAAACG 


3 900 


Tib l 1 wL ILL 


^P^P^P^^^^^rt A^ A 
ill » 


TAGTAAGGAT 


TATATATCTG 


CTAAAGTATT 


GGGAGAAAAA 


3960 


11 AAA i 1 I* L A 




GGTTTATCGC 


CTTCTCAAGG 


GAATCAACAA 


ACATTCTCCG 


4020 


GTAGAAGCAT 


TCATTTTATC 


TGAAAAAGGC 


ALAGLfT 1 i LA 


A/\ 1 i AAA 1 w>w^ 


AAGAAGTTCC 


4080 


CTCGTGCACG 


TTGATGGGAA 


TTTTACAGAG 


GCTTTTGATC 


CTCAAGTAAG 


GCGTGAAAAA 


4140 


TTACTAGAAC 


GTCTCTTGTT 


GACTGCTCCT 


AAGCCACATT 


CTATTTATGA 


TTTAGGAGAG 


4200 


GAATTCTACG 


TAAGCGAGTC 


AGTAGTACTA 


AAAGATCGTC 


AGATATTACA 


AGAGAGTCTA 


4260 


GCAATTTATG 


GCTTAGATTT 


AAAAATGAGA 


CAACGAAAGC 


TTTTTATTGA 


TGGGGATGAG 


4320 


GCTCAAATTC 


GTTCAGCCAT 


TCTAAATCTA 


CTGCCAATGT 


TTAATCAGTT 


GGATTTAGAG 


4380 


CAAATTACAC 


AGAATAAGGT 


TCAGCCTCTT 


GACGGAGAAC 


TTCCTCACTT 


TTCTTTGGGA 


4440 


TT ACTG ATT A 


CACTTGAGAG 


AGAATTGGGG 


GTAAACATTC 


CCTATCCATA 


TAATATAAAT 


4500 
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TGCTTTTAAA 


AGGCCTGCCT 


CTCAAGTCAT 


CCGTTGTAAA 


CTCGTCGACA 


GCTCTTTTGG 


1020 


CAATGTTGCC 


TAAGGGAATT 


GCCCTTTTGA 


CCATTACTTC 


GCTCTTGACT 


GCAGTGATTA 


1080 


ACTTGGGCTT 


GAAAAAGGTC 


TTGGTGCAGG 


AGATGTACTC 


TCTTGAGACC 


TTGGCGCGCG 


1140 


TGCATATGCT 


CTGTCTGGAC 


AAGACGGGTA 


CCATCACCCA 


AGGAAAGATG 


CAGGTGGAGG 


1200 


CTGTTCTTCC 


GTTGACGGAA 


ACGTATGGTG 


AAGAGCCTAT 


TGCCACCATC 


TTGACTAGCT 


1260 


ACATGGCCCA 


TAGTGACCAT 


AAGAATCCAA 


CTGCCCAAGC 


CATTCGCCAG 


CGTTTTGTGG 


1320 


GAGATGTTGC 


TTATCCTATG 


ATTTCCAATC 


TTCCCTTCTC 


GAGCGACCGC 


AAGTGGGGGG 


1380 


CTATGGACTT 


AGAACGCTTG 


GGGACAGTTT 


TCTTAGGGGC 


ACCTGAGATG 


TTGCTTGATT 


1440 


CTGAAGTCCC 


AGAAGCTAGG 


GACGCCTTGG 


AGAGAGGATC 


ACGTGTCTTG 


GTCTTAGCTC 


1SOO 


TCAGTCAGGA 


GAAATTAGAC 


CATCACAAAC 


CACAGAAACC 


ATCTGATATT 


CAGCCTCTAG 


1560 


CCTTCCTGGA 


AATCTTGGAC 


CCCATTCGAG 


AGGGACCAGC 


AGAGACGCTG 


CACTATCTCC 


1620 


GTTCTCAGGA 


GGTGGGACTC 


AAGATTATCT 


CTGGTGACAA 


TCCAGTTACG 


GTGTCCAGCA 


1680 


TTGCCCAGAA 


GGCTGGTTTT 


GCGCACTATC 


ACAGCTATGT 


AGATTGCTCA 


AAAATCACCG 


1740 


ATGAGGAATT 


GATGGCCATG 


GCGGAGGAGA 


CAGCTATTTT 


CGGACGTGTT 


TCCCCTCATC 


1800 


AAAACAAACT 


CATC ATCC AA 


ACGTTGAAAA 


AAGCGGGACA 


TACAACGGCT 


ATG AC AGGGG 


1860 


ACGGGGTTAA 


TGATATCTTG 


CCCCTTCGTC 


AGGCGGATTC 


TTCTATCGTG 


ATGCCGGAGG 


1920 


GGGATCCAGC 


AACCCCTCAG 


ATTGCCAATC 


TGGTTCTCTT 


GAACTCAGAC 


TTTAATGATG 


1980 


TTCCTGAGAT 


TCTCTTCGAG 


GGTCGTCGCG 


TGGTCAATAA 


CATTGCCCAC 


ATCGCCCCGA 


2040 


TTTTCTTGAT 


AAAGACCATC 


TATTCCTTCC 


TCTTACCACT. 


CATCTGTATT 


GCCAGTGCTT 


2100 


TACTAGGTCG 


GTCAGAGTGC 


ATTTTCATTT 


TCCCCTTCAT 


TCCGATCCAG 


ATTACCATGA 


2160 


TTGACCAGTT 


TGTGGAAGGT 


TTCCCACCAT 


TCGTTCTGAC 


TTTTGAGCGA 


AATATCAAAC 


2220 


CTCTTGAGCA 


GAATTTCCTC 


AGAAAATCCA 


TGCTTCCTGC 


CCTACCAAGC 


GCTCTCATGG 


2280 


TCGTCTTCAG 


CGTCCTGTTT 


GTCAAAATCT 


TTCGCGCGAG 


TCAAGGTTGG 


TCTGACTTAG 


2340 


AAATCTCAAC 


TCTACTCTAT 


TATCTCTTGG 


GGTCAATTGG 


TTTCTTATCC 


GTATTTACAG 


2400 


CCTGCATGCC 


ATTTACCCTA 


TGGCGTGTCC 


TCTTGATTGT 


TTGGTCAGTA 


GGAGGTTTCC 


2460 


TAGCCACAGC 


TCTCTTCCCA 


AGAATTCAAA 


AACTGCTTGA 


AATTTCAACC 


TTAACAGAAC 


2520 


AAACCTTGCC 


TGTTTATGGT 


GTCATCATGT 


TGGTCTTTAC 


CGTGATTTTC 


ATCCTGACCA 


2S80 


GTCGTTACCA 


AGCGAAAAAA 


TAAATCAAAA 


CCACCAGTGT 


GAACTGGTGG 


TTTGTTCTGC 


2640 


GGCTATAAGC 


CGCTTCTACC 


GGCCAGGGCC 


AAAGGCCCAC 


CGAAATAGCT 


TCCTCGCGCA 


2700 


CCACTTTCCC 


GAGCAGGTGC 


TAAAGCACCT 


TACTTACTTC 


CTCTTATTTA 


TTTCGCCAGT 


2760 
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TTTTCCCCAT TGATTAACAC TTCTCCATAA GACCGACTAT ATAAACCAGC 

AACAATACAC TTTTCCCTGA GCCATTCCCA CCAGTAATTC CTATAATTTC 

CAACTAAAGT TAAG G TTTTC AAAAACACAT GTCTTTTTTA ATTTCAACTC 

AATGTAATTA TTTCATTCAT TCTATAAACC TCCTCTTTTG ACGAGTCAAA 

TTTGAAAAAG AAAGACTAAA AATAGCAACT GAAGAAATAA ATCTCGTCCT 

TCCCTCGATT CAAAATATAA AATAGATAAT TAGTTCGATT TCCTACAAAT 

ACACAATCAT GAGTAAAAAG AAACTAACGC AAGCAAAGTT CG 

(2) IN FORMAT I OK FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11443 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



TATTAATTTA 
CCCCTGTTTA 
AATATTTTTT 
TAGAAAATGC 
ATATCTCCAT 
AGACCACCAA 



24660 
24720 
24780 
24840 
24900 
24960 
2S002 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CAGCTACGGT GAGGCGCAAC TAAAATATAA TTTTCATCTT GATTAGGAAT TTTATCACTA 60 

TTATGATAGT GAGCATTGCC ATTGATGGAC CATAAGAGCA AT ACAACT AA TCCACGCAAA 120 

TAAGTATAAA ACATGCGATC TCCTTCGATT CTTTTCTTGT T ATT ATT AT A CCTTATCAAA 180 

GGAGGGCTGG CAAACTTTTC CCTTGACTAG ATACATATTT AGGATGAAAT TAGAATTCTG 240 

TTAAAAAAAA TGATATAATA GAATTTATGG ATAAAAATAA GATTATCGGA TTAACCCAAA 300 

GAGAAGTCAA GGAAAGACAG CCTGAGGGTT TCGTCAATCA CTTTACCGCA TCAGCCAGTA 360 

CCAGCACTTG GCAAATCGTT AAACGAAATG TCTTTACCCT TTTTAACGCT TTGAACTTTC 420 

CCATTGCTTT GCCTCTTCCC TTTGTGCAGC CTTGGAGCAA TCTGGTCTTC TTTGCTGTTA 480 

TCTGCTTTAA CGCTTTTTCT GGGATTGTGA CCGAGCTACG AGCCAAACAC ATGGTGGACA 540 

AGCTCAATCT CATGACCAAG GAAAAGCTCA AAACCATCCC TGATGGTCAG GAAGTTGCTC 600 

TTAATCCTGA AGAATTAGTG CTAGGAGATG TCATTCCTTT GTCTGCACGA GAGCAGATTC 660 

CTAGTGATGC CTTGGTTTTG GAAGGCTTTG CGGAAGTCAA TGAAGCCATG TTAACGGGAG 720 

AAAGTGATTT GGTGCAAAAG GAAGTTGACG GCTTACTTTT GTCAGGAAGT TTCCTAGCCA 780 

GTGCCTCAGT TTTATCTCAA GTTCACCATG TCGGTCCAGA CAACTATCCT GCCAAACTCA 840 

TGCTTGAGGC TAAGACCCTT AAACCCATCA ACTCCCGTAT CATGAAATCG CTGGACAAGT 900 

TGGCTGGTTT TACTGGGAAG ATTATCATTC CCTTTGGTCT CGCTCTCTTG CTGGAAGCCT 960 



DMcnnrtn- ^-w/n opiooh a? i ^ 
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AGCGTGTCGC 


TTTGCTGGAT 


AAACAACATA 


GCTTTGCCCG 


TCAATTATCT 


CGTGGACAGA 


22660 


AGCAACGTGT 


TCCAATTCTC 


CCTGCCCTCC 


TAATGCATCC 


AGAAATCATC 


CTTTTTGACG 


22920 


AGGTGACTGC 


TTCCCTGGAT 


CCAGAAATGG 


TGCGTGAGGT 


CCTGGAACTT 


ATCAATGATT 


22980 


TGGCCCAACA 


AGGCCGTACC 


ATGATTTTAG 


TAACCCACGA 


AATGCAGTTT 


GCCCAAGCCA 


23040 


TTACTGACCG 


GATTATCTTC 


CTCGACCAAG 


GGAAAATCGC 


TGAAGAAGGA 


ACAGCTCAAG 


23100 


CCTTCTTTAC 


CAATCCGCAA 


ACCAAACGAG 


CCCAGGAATT 


TTTAAACGTC 


TTTGACTTTA 


23160 


GCCAATTCGG 


CTCATATCTA 


TAAAGGACAT 


TCTTATGAAA 


CTATTCAAAC 


CACTCTTAAC 


23220 


TGTTTTAGCA 


CTTGCCTTTG 


CCCTTATCTT 


TATCACTCCT 


TGTACCTCAG 


CTGGAAACGC 


23280 


TGGTTCATCC 


TCTGGAAAAA 


CAACTGCCAA 


AGCTCCCACT 


ATCCATGAAA 


TCAAAAAAAG 


23340 


CGGTGAACTG 


CGAATCGCCG 


TGTTTGGAGA 


TAAAAAACCG 


TTTGGCTACG 


TTGACAATGA 


23400 


TGGTTCTTAC 


CAAGGCTACG 


CTACGATATT 


GAACTAGGGA 


ACCAACTAGC 


TCAAGACCTT 


23460 


GCTGTCAAGG 


TTAAATACAT 


TTCAGTCGAT 


GCTGCCAACC 


CTGCCGAATA 


CTTCATTTCA 


23S20 


AACAAGGTAC 


AT ATT ACT CT 


TGCTAACTTT 


ACAGTAACTG 


ACGAACGTAA 


GAAACAAGTT 


23580 


GATTTTGCCC 


TTCCATATAT 


GAAAGTTTCT 


CTGGCTGTCG 


TATCACCTAA 


GACTGGTCTC 


23640 


ATTACAGACG 


TCAAACAACT 


TCAAGGTAAA 


ACCTTAATTG 


TCACAAAAGG 


AACCACTGCT 


23*700 


GAGA CTT ATT 


TTCAAAAGAA 


TCATCCAGAA 


ATCAAACTCC 


AAAAATACGA 


CCAATACACT 


23760 


GACTCTTACC 


AAGCTCTTCT 


TGACGGACCT 


CGAGATGCCT 


TTTCAACTGA 


CAATACGGAA 


23820 


GTTCTAGCTT 


GGGCCCTTGA 


AAATAAAGGA 


TTTGAAGTAG 


GAATTACTTC 


CCTCGGTGAT 


23880 


CCCGATACCA 


TTGCCCCACC 


AGTTCAAAAA 


CGCAACCAAG 


AATTCCTAGA 


CTTCATCAAT 


23940 


AAAGATATTG 


AAAAATTAGG 


CAAGGAAAAC 


TTCTTCCACA 


AGGCCTATGA 


AAAGACACTT 


24000 


CACCCAACCT 


ACGGTGACGC 


TGCTAAAGCA 


GATGACCTGG 


TTGTTGAAGG 


TGGAAAAGTT 


24060 


GATTAGTCAT 


TAACTCTTAA 


AAGCAACTGG 


ATTTTAAGCT 


CCAATCCCTT 


TTTAAGATTT 


24120 


TACCTATAAC 


ATCCTGACTC 


TATCTAAGAT 


GTTCAATCTG 


AACACAGTGT 


ACATACTTTA 


24180 


TCTTCTATTG 


CATATACTTT 


ATCACATAAG 


ATACGAATAT 


CCTCTTCACT 


ATGACTAGCA 


2 1240 


ATCAAAATTG 


TTGTCCCTTT 


TTCACTAGAG 


AGCTTTCTAA 


ACAATGTTCT 


CATATTTTCT 


24300 


ACACTTGATT 


TATCCAAGGC 


ATTCATAGGT 


TCATCTAGTA 


AAAGAATACA 


GGGATTCTCC 


24360 


ATAATTGCTT 


GACCAATCCC 


TACCTTTTTC 


CTCATACCTA 


GCGAATAAGT 


TTTAACTTTC 


24420 


TGGTCTTTTT 


GCTCATATAG 


ACCAACTATT 


TTC AGTGT AT 


CATTGATTTC 


CTGATTACCA 


24480 


ACTACTCCTC 


GTATGCTTGC 


CAAATATTGT 


AAATTCTTAA 


AGCCACTATA 


ATAATTTATA 


24540 


AAACCAGGTT 


CTTCAATCAA 


AGCTCCCAAA 


TTACCTGGAA 


TTTTTCTCTC 


AGGAACAATA 


24600 
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i Anu l\jk. 1 nw 


AAT AACTGTJV 

1 /\^\w 1 \J A r\ 


TTATTTCTAG 


ATGGCACGTT 


CTATTTTTGG 


ATTGATCCAT 


21120 


lull Vjn/M»/V\ 


TATCTACCAC 


TATATCAAAA 


GGCATTCTTT 


CTGACCTTGC 


ATATTGCAGT 


21180 




TTC3 GG ATC C*T 


TTCTCCTCGG 


TTTAATCCTT 


AGTATCATCC 


GACATTATCG 


21240 


AATCCTTGTT 


1 1 uvtbCnnu 


TAGCCACAGC 


CTACATTGAA 


TTGTCACGTA 


ATACGCCCCT 


21300 


TTTGATTCAA 


CTCTTCTTTC 


TPTAfTTCGG 
1 1_ l nw i l \»vjvj 


TCTTCCCCGA 


ATCGGGATTG 


TCCTATCTTC 


21360 


AGAAGTCTGT 


GCAACGw 1 it» 


GGCTTGTCTT 


TTTAGGACGC 


TCCTATATGG 


CAGAATCTTT 


21420 


CCGAAGTGGG 


CTGGAAGCCA 


* I, AG ItnAAL 


w w AwwAwWW 


ATTGGCCTCG 


CTATTGGTCT 


21480 


GACACCTCTA 


CAGGTCTTTT 


/(V. 1 A lu 1 uu A 


TCTTCCGCAA 

1 w 1 1 Cv\A*no 


GCAACAGCGG 


TGGCACTCCC 


21S40 


CTCC TTTAGT 


GCCAATGTCA 


7TTTCCTTAT 


WAnuunnAV. w 


TCTflTTTTCT 

1 W 1 1 1 A * W A 


CAGCAGTGGC 


21600 


TTTGGCCGAC 


CTCATGTACG 


TCGCCAAGUA 


L 1 I AjA 1 1 w 1 


W 1 W 1 W 1 1 w 


AGACAGACAT 


21660 


TGCGCTAGCT 


ATGTTGGTAG 


TTGCTTATCT 


AA I w A IVV. * w 


W 1 AW.W Wn 1 w 1 


CACTGGTCTT 


21720 


TAGCTGGATA 


GAAAGGAGGC 


TCCGCCATGC 


AOGATTCuVjIj 


A ATf*P A. AfiT A 
AA 1 \_L_ AAw 1 " 


CTCTTTCAA(3 

W A W AAA W*^^**# 


21780 


GAAATAATCT 


CCTGAGAATC 


TTACAGGGAT 


IaAiOLu 1 1 Aw 


Li A I 1 uUun l A 


A W A W W A A 


21840 


CTGTCCTCTT 


ATCCATGATG 


TTCACAACAO 


1 LAI uuuftA 1 


r atcatgac c 


TCCCATTCTA 


21900 


GAATCATACG 


ATTTTTAACA 


CGAl Idi Alt. 


1 UUAA 1 1 I r\ 1 


CCGTATCATG 


CCCCAGCTGG 


21960 


TGCTACTCTT 


CATCGTTTAC 


TTTGGCTTGG 


V. 1 wwAAAw 1 1 


T AAT ATCAAT 


ATCTCAGCTG 


22020 


AGACTTCAGC 


TATTATCGTT 


TTTACCCTCT 


fifV^GAACAGC 


TGAAATGGGA 


GACTTGGTAC 


22080 


GTGGAGCTAT 


CACTTCTCTC 


CCTAAACATC 


ALj I i »\»n«nvj 


TGCACAGGCA 


CTCGGCTTGA 


22140 


CTAATGTTCA 


ACTTTACTAC 


LALi\ * wA 1 w A 




CTTAAGAAGA 


CTGCTACCGC 


22200 


AGGCTATCAA 


IL1 l\» 1 wAw 1 


ff^fiATCATTA 


AAACCACTTC 




TTG ATTGGGG 


22260 


TTGTGGAAGT 




f?f?A£*AA£^AAA 


TC AT CGAT AG 


CAATCGCCTG 


ACCATCCCAA 


22320 


CTGCTTCATT 


TTGGATTTAT 


GGAACCATTC 


TAATCTTATA 


TTTCGCAGTT 


TGCTACCCTA 




TTTCCAAACT 


ATCCACTCAC 


TTAGAAAAAC 


ATTGGAGAAA 


CTAAATCTCT 


GAAACTATCT 


22440 


TAG AAATCAA 


GGAACTAAAA 


AAATCCTTCG 


GAGACAATCC 


CATCCTCCAA 


CGACTTTCTC 


22500 


TAGAAATCAA 


AAAAGGGGAA 


CTTCTTGTCA 


TCCTAGGGCC 


ATCTGGTTGT 


GGGAAAAGTA 


22560 


CCCTCCTTCG 


TTGCCTCAAC 


GGCTTAGAAA 


GTATTCAAGG 


TGGAGATATT 


CTTCTCGATG 


22620 


GTCAGTCTAT 


CGTTGAAAAT 


AAAAAAGATT 


TTCACCTAGT 


TCGCCAAAAG 


ATTGGCATGG 


22680 


TCTTTCAAAG 


TTATGAACTC 


TTTCCCCATC 


TGGATGTCTT 


AC AAAACCTC 


ATCCTAGGCC 


22740 


CTATCAAAGC 


TCAAGGAAGG 


GACAAGAAAG 


AAGTAACGGA 


AGAAGCTTTG 


CAATTACTAG 


22800 



WO 98/18931 



PCT7US97/19588 



442 

AATCGTAATT CCACTTGATC CTTTTAAAGC ACATCGAGAG CAW 8 ' - Z AGA GAGCTAACTA 
AACAAGCCTA TCCAGTTTAT ATAAACAAAA AACTCCAATT ACAATCAAGA ATTAGAGTTG 
ACTTACAAGA TTAGACCCTT CATTTCACCA TACGAAAAAA CTGTTCACAT TTCCCTTCGC 
CAGTCTTAAC TGTATCAGCT TCAATGGGTA TTATCTCAGC CTAAAGCACC CCAAATGTCT 
CTATTATTTA ACTACTGAAC C ACT AT AGC A AAAAATGAAA CCCCTACCAA CATATTTGAC 
CCAAAAATAT CTTT AT AT AT AATATATTGA AACTAGAATA GTACACCTCT ACTTATAAAA 
CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC CT ATTCTT AT TTCATTTTAC 

TATAcrrrrc gatagcaatt tattcttcca atacacgaag aaaaacctcc acattcagtg 

GAGGCAATCT GTTTTATCAA TACAATTTTA AGTCACGAGG CTCAACTGGG AAGGTTGGGT 
TGTATGGATT GTGACGGAGC TTGAAGTGTT TGACATCTTC AATGGTCTGA GTTCCAGACA 
ATTGCATAAC TGTCTTCAAT TCCGCATTCA ACTGTTCAAA CACTTGACGC ACACCGACAC 
TACCACCGAG AGCCAAGCCA TAGATGACAG GCCGTCCAAT ACCAACCAAG TCTGCTCCTG 
ATGCCAAGGC TTTAAAGACG TGTTGACCAC GACGAACACC AGAGTCAAAG AC AAT CGGC A 
CACCTCTATC AACTGCTTCT GCCACTTCTT GAAGCGAGTC AAAGGCAGCT GGTCCACCCT 
CGATTTGACC ACCACCGTGG TTGGTTACCC AGATACCAGA AGCTCCT GC A GCAAGCGAAC 
CTTCAACGTC CTCACGGCAT TGTGGTCCCT TGACATACAC AGGAAGACCA GAGTATTCAG 
CGATAAATTC TACATCGCGT GGAGACAAGC GTTGTTTAGC TGATTTGTAA ACAAAGTCCA 
TTGATTTACC ACCACCTTCT GGCAGGTATT CTTCAACAAT CGGCATGCCA ACTGGGAAGA 
CAAAACCATT ACGCTTATCC ACTTCACGAT TCCCCCCTAC AGTAGCATCT GCCGTCAAGA 
CAATCGCTTT ATAACCTTCA GCCTTCACAC GCTCCATGAT GTGGCCGTTG ATACCGTCAT 
CCTTACTAAA CTAAAATTGA AACCAATGAG GTGTCCCTTG GAGCCCTTCA GAAATCTCTG 
GAAGGTCAAC AGTAGAGTAA GAACTGGTTG TATAAACAGA ACCAAACTCA TGCACACCAC 
GCCCAGTCGC CACTTCCCCC TGTTCATTTG CCAATTTATG AGCCGCAACA CGTCCCATAA 
TG ATT GG AGA AGATAGTTTT TCACCTGCAA ATT CAATCTC TGTACTTGGA TTTTCTACAT 
TGCAAAGTGT ATGAGGAACG ATGAGCTTGT GGTTAAAGGC ACCGATATTC TCTCTTAAAG 
TCAAAGTATC TTCCCCCCCA CTAGCGATAT AGCCAAATGC TG CTTT AGG A ATAACTTGTT 
GCGCCATTCG CTCCAAATCA TAGGTATTGA TGAArTCTAC ATGACCTTCT GCATTCCTTG 
TTTTCTATGA CATAAAATGT CCTCCTTAAT AAGTAAGCGT TTACTTTGTG TATTACAAAA 
ATATCTTAAC TCTTTTTCAA AACTTTTAAA ATATTTTCTT TGGAAATTTC AGAAATTTTA 
TGTCTATGAT AAAAATCCTT ATAACGGCAA TAAAAAATAG ATATTATCCA AAGAAGATTT 
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ACTCTTTTGA ATTCCTTTAC ATCTTCATCT GTAATCTCGT ATAAGGCATT TATAAATTCA 17580 

ACTTTAAATG TCCCAGGAAG ATGTCCATTT GGACGTTTTT CTGCTATTTC TCCAGCGATA 17640 

TTGTAAACCA ACACTGCTGT TTTTAATCAT TTCAATT C TT GACCTTTTTC TAGTCCGATA 17700 

AAGCTTGCTA CTACAGCTCC TAATAAGCAT CCTGTCCCAA TGACTTTCGG CATCATAGCA 17760 

CTACCATTAT GAATCATTAC CACTTCTCCA TTAACAGCAA TGGCATCCAC TTCACCTCTT 17820 

ACTACTATTG GAATATTGAA CTTCTCATTT GCTGCTAGAG CAATTTCCTC AATATTATCT 17880 

ACGCCCGCAC TATCTACTCC TTTAGATGCC ACATCTATTC CTACTAAAGA GGCAATCTCG 17940 

CCAGCATTTC CTCTAATCGC TGCTAGTTTA TAATTGTTGA TTAGATCATC TGCTACTTTT 18000 

TTTCTATATT CTCCTGCTCC ACAGGCTACA GGATCTAAAA CTGCTGGGAC ATTATATTTC 18060 

TCTGCAATTT TCAGAGCAGC TTGGTATAAT TTCCAATTTT CATCTGTCAA TGTTCCTATG 18120 

TTTATTAATA AACCACCAGC ATACTTTAAC AAATCCTCTA AATCTGCTGG AAACTCACTC 18180 

ATGGCTGGTG ACGCGCCCAG TGCTACTAAT CCATTTGCTG TGAAATTTTT TACTACATCA 18240 

TTGGTTATAC AAATGACCAA TGGTGCTTTT TCTTTTAATA ATTTTAAACT TGTCATATTC 18300 

AAATCCTTCC TTTTCACTTT ATACCATCTA CT AATTTCG A TTTATCTTTA GTTGAGAATT 18360 

TTTTTCATTT ACATTCAATC ATTT AT ACT C AATGAAAATC AAAGAGCAAA CTACCACCCT 18420 

AACCGCAGGT TGCTCAAAAC ACTCTTTTGA GCTTCTCGAT AGAACTCACC TGCTTTGAAG 184 BO 

AGATTTTCGA AGAGT C TTAC CTCATCAAAT TTGTAAATAT CATCAGCCTT CTCTAGACAT 18540 

CCTAACCAAT ATCAAAAAAA GCTAATTCTA AAGCCACTGC TTGATTCCAG CCTTCCTGAA 18600 

GTTCTCTCAA ATCTTCTCGA TTTTTACCGA CACGATTGAG TTCGTCAACC AGAAATTCAA 18660 

CCCACTCTGC AAAGAAAGGA CCTCTGTGGA GATTGATCCA TTCCGAATGA ATATAGACTT 18720 

CAGCTAAAGC CAAATCTTTA CAACCCCAGT CT AAAT AG AG ACCTTCTGCA ATGACCAGCA 18780 

TGACCAAAAG ATGGGCATAG TCTGATGAAG CCACCGCCGA ATACATTACA TCCTGAAAGC 18840 

C T TTTGTT AC AGGGTGCAAA CTCACTTCTA GATAGTCATT CTCTGCTACT TTTAACTCTT 18900 

TAAAAGCCTT TTGGAAATAA CCATCTTCAT CTGCTTCAAG AAAGCCTAGT TGCTTGGCAA 18960 

AACGAAGCTT GGATTCAACT TTATCTGCGT GACTACGCAG GCACCCAGCA TGGATAAGAA 19020 

GGCATCAAAG AAGTCATAAT CTTCAATCAG ATACTCCTTT AAGACCTTAT TCTCAATTGT 19080 

CCCCGCAAAA AGTTCCTTAA CAAAACGATG ATTGATTGCA GCCTGCC AAT CCTTCTGACT 19140 

GCTTTTTAAT AATTCTCCAA CACTCAAACC TGGCTGAAAT GCATAGTCTT GTGTTTCCAT 19200 

ATTTACTTCT CCTCTCTTTA CTTGTTAGTA ATTAATAAAA CACCAAGAAA TATCAACCAA 19260 
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C AG AG AACAT 
TTGCAAACTG 
CAGAAATAAA 
CTAACTTAAT 
GAGAGAGCCA 
CTTGAACAAA 
CAAGCGCAGC 
CAATGGGCGC 
CAAGACCTGT 
CACCAAAAAA 
ACATAAAAAA 
TTTCTATACC 
AATTTGAATT 
CCAGTTTCGT 
TTAGACAGTT 
CTCTATCAGG 
AATTATGTGA 
AGCCTTCCAC 
TGAATTAGCC 
CTTAGAATCC 
TTTTTGCATA 
CTTTAATATG 
TTCTCATCAA 
TACCAAATAC 
CTTCTTCGTT 
CAACACCAAT 
CTATATCGTC 
TACCTACCTC 
AT AAGGTT AT 
AAGTAGTCGT 



TGCGGACTGA 
ATG AT AGAGT 
ACGTACCACA 
GTATTCATAG 
ACTTTCATAG 
GCCAGACAAA 
TAGCACTTCT 
AGCCATACAC 
TAAGACTAGA 
GATAGACAAG 
CTCCTTTTTT 
AGACTGAATT 
TTCTAATTGA 
AAAAGAAAAA 
CTTTTCGACA 
TTCAATGGCT 
TTATTATAAC 
TCACAAAGAC 
ATTCAAGCTG 
AAGGATAGAT 
CGATATTGCC 
TTTCGTCTGT 
TTCAAGACCT 
AGGCCCAACA 
TCTTATAGAA 
ATCATCTTGA 
ATTAACGATA 
AACTTTTTCT 
ACCACCTTTT 
TCCACAAATA 



440 

ATCGCTCGTA GTTCCAGACC 



TTGACAATGG CACGAACCAA 
AGCAAGGAAA GGACAAACGG 
ACAAAGCTAA CAAGCGTAAT 
CGATTCTTAG TTACGATAAA 
AGAGCTCCTA GACCAAATTG 
CCAATCGTTG CACTTCCGAC 
CAGAGACCGA AGACGATTTC 
CTGAGAATAT TATACACATA 
AAAGCAAGCA AGATAACATC 
TAAAGAAAAG TGAGGCACTC 
TAGAACAGTA CACAAGAACA 
TTTGTTCGCA TCTTATTTCA 
ACTCTAATTA CAGATACAAA 
TACGAAAAAA ACATTTCACA 
ATCATCTCAG CCTAAAGCAC 
ACACATTTTA TACTAGTTCA 
AGCAGATCTT TCTTTTGCAA 
AATCTCGACA TAGCTTTTTA 
ATCTATTGTT CACTCATTTC 
GAAATGATTG AAACGCCATC 
ATCCCACCAA TTGCAACTAA 
TCATAACCTA TAGTACCACC 
CCTACATAAT CTACATATTC 
AGACCAATTA TTTTATCTCC 
CCTACATGTA CGCCATCGGC 
AATGG AACAT TGTATTTTTT 
AAGCCTTCTA AAGCACCCTC 
AAGGCTTCCT CAACGACTGT 
AAATATAGTT TTAGTAATTC 



CATCTATGCT 
GATGGTACAG 
AAGGGAAAAC 
CC C AAT ACT A 
ACCAAATTCA 
GCTACCATAA 
TCTCGGAACA 
ATTGGCAAAG 
TCCTGAACCA 
TTTTAACTGC 
AAGAAGACCG 
CTAAAATATT 
ATCTACTATA 
TTAGAGTTCA 
TTTCCCTTCC 
CCCAAATGTC 
AGAAATTGAA 
AAAACAAATG 
AAAAAGGAAA 
CCGAACAGTT 
CATATTGGTC 
AGGCATTTGT 
AGCATCATCC 
AACTTTTGAT 
CATCAATTTT 
GTCAATTTCC 
ACAAACTTCT 
ACCTTTTTCT 
ATATAGATTT 
TTTATGAAAC 



TTTCCTCCAG 
AAGAAATAAA 
GCGTAGTAAC 
TTAGCAGTTA 
CTTCCCAAAC 
AGGACTTCAG 
AAGATGGCAG 
GCCTCCAAAC 
ACGAAAACCC 
CATTTTTTCA 
ACCTAAATAC 
TCTAGAAATT 
TCATCTTCAT 
GCTTACAAGA 
CCACTCTTAA 
TTTATTATTT 
CTGGAAATAC 
ACCTGTTTGA 
ATCCTACTTA 
TTTTCTATAT 
TTTATAATGT 
GGCAATAGTT 
TTTGACTGCG 
TGTTGAAATT 
CTAATTTCAT 
AYTGCTAAAT 
TTAATTTGGA 
CGAAATTGAA 
TTTCCTTGGC 
ATCTTACTTC 
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CTGAATGAGA 


GGAAAAGAAT 


TGAATACCAC 


AATCAAGGCA 


TAGCACCAAG 


AGCCTGATAG 


14040 


CCCCTTTTGA 


GCCAAGTACA 


AGAGAAGCTC 


TTTTAGTGAA 


ACAGAGCAAA 


CAAAGACAAG 


14100 


GCCGATACAA 


ACTGTCACAA 


AGGCCCTCCT 


TCCAAGCATG 


ACTGCCTGTG 


AACCATCTCC 


14160 


GTGTAACTGA 


ACTGCCCAGT 


AGTTGGCAAA 


AGATGGTAAA 


ATGCCAAGTA 


TCATCATCCA 


14220 


AGCTAACATT 


TTAAATCCAC 


GGTAATAGAG 


CATAAAGAGA 


ATACAAAATG 


CGACTACCGA 


14280 


AAGAGTCAGA 


GCAATCGAAG 


GAATGAAAGA 


TGTTTCCAAG 


GATAAAATCA 


GCAAGAAGAG 


14340 


ACTCATAATC 


GGTGTCTGGG 


TTGCTACTTT 


GACCATACTA 


TCTCACCTCC 


CCTTGGGTAT 


14400 


TG CT ACTCTG 


AGATGTAAGT 


GGTTTGGTAA 


TGGTCACTTC 


TTTCACATGC 


CGAAGACCCT 


14460 


GACTAGTCAT 


CTCAATCCAA 


TAATCAACCA 


CAGAAATCAA 


AGGGTCTAAA 


CGATGACTAA 


14S20 


TGAGCAGAAA 


ACTTCTTCCT 


TGATTCCTCT 


CCTCCACAAT 


CCACTTGCAA 


AAATAATGGC 


14580 


AGGCTCTATC 


ATCCAAACCT 


CCAAAAGGTT 


CATCTAGCAA 


GATCACGGAA 


GCCTTACTGC 


14640 


TCAAGATCCT 


CAGGAGCTGA 


AGAATTTTTT 


CCTGACCACC 


ACTTAATTGA 


TAGGGACTCT 


14700 


TATCCACTGC 


CTGCTCCAAA 


TCAAAATATC 


GTAAAGCTTG 


AAAAATCCGC 


TGATTTCTTT 


14760 


CAGAATCAGG 


TCCATCTAAT 


TGAAGCTCCT 


CTCGCAGACT 


GACTCCGATA 


AACTGCTTCT 


14820 


CAGCTTCCTG 


AACAACACCA 


CTCAGATCAC 


GATACAAACT 


CTTTTTCTTT 


TTCAGG AC CG 


14880 


AACCCTTCCA 


AGTAATGCTC 


CCCTTATACT 


TTTCAAATTG 


AAGAATAGAC 


CGAAAGAGGG 


14940 


TTGATTTCCC 


GACACCATTG 


TCACCCAGCA 


TACAGGAAAT 


CCCTTGATAG 


AATGTGAAAT 


15000 


CAGCAATTGA 


AAAGAGGGGG 


CGATTACCAA 


CCTCACCACT 


CACACGCTTC 


ATATGCAATA 


15060 


GTTCCGCGCT 


ACAAGCAACT 


TCCTTTGAAG 


CAACCTCTCT 


CATCTCATAG 


GAAGGGATTT 


15120 


GAAACACTTC 


CCTTAGTTTT 


CCCTCTCTTA 


GCTCCACCAT 


ATGGTCGATA 


TAGGCTTTAT 


15180 


AGTCAGATAA 


ATCATCGTCG 


CACAAAATAA 


CTCTCTTCCC 


ATCATAGACC 


AACTCTTTTA 


15240 


GAATCTCCAA 


TATCTCGATT 


CTGCTCTTGC 


GGTCAATGGA 


AGCGAAGGGC 


TCATCCAAGA 


15300 


GATAGACCCT 


AGGATTCATG 


GCAAAGAGGA 


CAGCCAGCGC 


TCCTTTTTGC 


TTtTCCCCAC 


15360 


CTGATAAGTG 


ATGCATGACA 


CGGTCCAAGA 


TGTCCTTGCA 


ACGACATTGC 


TGGACAACCT 


15420 


CTGCTATTTT 


AGAATCAATT 


TCCTGAAGGT 


GATAGCCGAT 


ATTTTCCATG 


GTAAAAACCA 


15480 


ACTCCTCAAA 


CAAGCTCTCC 


ATGGTAAATT 


GATGATTAGG 


ATTTTGCAAG 


AGAATACCAA 


1S540 


CCGTCTGGAC 


ACGTTCGACG 


ATAGAAAGCT 


GACTGACCTC 


GCTCCCATCT 


ATCAGGACTT 


15600 


GACCGCTATA 


GGGAAGAGAA 


CTAACTTGGG 


CAATCATTTG 


AAAGAGGCTG 


GATTTTCCAG 


15660 


ACCCACTACT 


CCCAACTAAC 


AAGGTAAAGG 


CTTCCGCATG 


AAAAGTAAAA 


TCAAACGCCT 


15720 
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TTAACTAAGC 


TCTTACGAAT 


CGATGAAGTC 


CTTACGCCAA 


CCGCATCTAC 


TACCATCGGG 


12240 


AGAGAAGATT 


GGTTTGCATA 


CGAAGCTGCC 


ATGCGGATTG 


CTTTTTCCTT 


CTCAGCTGAC 


12300 


AAATGCCCCA 


AATTGATCAA 


GAGAGCCTGA 


CTTTGCTTAG 


TAAAATCAAG 


AACTTCACGG 


12360 


GAATCATCTG 


CCATGACAGG 


TTTGCATCCC 


AGAGCCAAAA 


TCCCATTTGC 


CAGCATCTCA 


12420 


CAAGAAATCT 


CATTGGTAAT 


GCAGTGAATG 


AGGGAACTAG 


AGCCTATAGG 


AAAGGGATTT 


12480 


GTAAATTCCT 


GCATCAGTCT 


ATCCTTTCAC 


TAAAGAAATA 


TCCCTGCACT 


TTTTTAAAGA 


12540 


ATTCCTGCTT 


GATTAAAAAT 


CGAAAGGCAA 


TAAAGGAAAT 


CGCTGTACCA 


ATCAAGGTTG 


12600 


CTCCGAAAAA 


TCGAGGCGTG 


TAGATAAACC 


AGCTAAGCTT 


AGCAGCTGAT 


CCTGTAAAGA 


12660 


CTACCATAAC 


AGGATAGGAA 


ACAATGGAAC 


CAATAATACC 


TCTTCCCAAA 


ATCTCTCCTA 


12720 


GAGCAGAATA 


GTGAAATTTT 


CGACCGTACT 


TATAAAACAG 


ACCTGCTAGA 


AGGGCTCCAA 


12780 


AAGTCGCTCC 


TGTGAGAGCT 


AAAGGCGGAA 


TCCCTTGAGT 


CGTCATACGG 


ATAAAGGCTG 


12840 


TGACTGTAGC 


CATAGCCAAG 


GCATAAACAG 


GTCCCATCAT 


GATTCCTGCT 


AG AAT ATTGA 


12900 


CTACACTGGA 


CATCCGTGCC 


ATTCCCTCAA 


TTCCAAAGAT 


ACGTGTAAGG 


ACTACATCAA 


12960 


CGGCAATCAT 


CATAGATAAA 


ATGCTTAATT 


TGTGAACTTG 


TAATTGGTGC 


TTTCTCATGC 


13020 


TTCTATTCTT 


CTCCTTTTTC 


TAAAGACTGT 


AAATCGCTCT 


TCCATGTCTG 


GTGTTGGTAG 


13080 


GCCATTTCCC 


AAAACTTGGC 


TTCCATATGA 


ACACTGATGT 


GGAAGCCATC 


TAGCATTTTT 


13140 


TGCTTGTCTG 


TCTCGTCACT 


TTCTCGATAG 


AGCTGATTGA 


CCACTCCTCC 


CTCCTCTCTG 


13200 


ATCTGTTGCT 


CTAACTCATC 


CGTAATATAA 


GTTTCAATCC 


ATTGTTGATA 


GAGAGCATTT 


13260 


GGTGATGGTT 


TAAGATTAAG 


TGATTTGCCT 


ATATCATCCT 


ATAACCAAGG 


ACAAGGAACC 


13320 


AAGCTTGCAA 


AAGCGATCGC 


TAAGTTCGGT 


TCTGCAAATT 


GCCTATAAAT 


ATGAGAAATG 


133BO 


TAATGATAAC 


AGGTTGGAGC 


GATTGGATGT 


TGCTCCATTT 


CCTGGTCGCT 


GATTTCCAAT 


13440 


TCCTTGAAAA 


ATTGTTGGCG 


AATAAATAAC 


TCACCCTCCA 


CTAAACCCTG 


AGCATTTTGT 


13500 


TTCAAGAGTC 


TTTTCATCTC 


TTGCTTTCAA 


CTCTTATCAG 


CCAAAAGATG 


ATAGATTTCT 


13560 


GAGAAAGCCT 


TCAGATAGTA 


GGCATCCTGA 


ATCAGGTAAT 


AGCGGAAAAT 


GGCAGGTTCT 


13620 


AAATTCCCCT 


CTTGTAATTG 


TAAAATAAAG 


GGATGATGAA 


AGGAACCCTG 


CCAAGCTTTC 


13680 


TTGGATAATT 


CCATCGCAAT 


ATCTGTAAAT 


TCCATAATAA 


CTCCTTTATA 


AAAATAGACT 


13740 


GGTTTGAAGC 


AATAAAAAGA 


AAAGCAGGTA 


GATTAATTTT 


GTTTTTTTAG 


GAATATAAAA 


13800 


AGTCCGATAG 


CTATTCTTCA 


ACTGTGCATG 


TTCGTCATAT 


CCGTGAGCAG 


ATAGAGCTCT 


13860 


CAGGTAAAGA 


TGGCGCCACC 


TAAAGACTGT 


CATCAGAACC 


TTACTGTAAA 


TC AAGGGCG A 


13920 


CCAAAAATGT 


AGTTCTTGAC 


CACGTAATAG 


GCAACCTTCT 


TTGAGGGACT 


TGATTTCTTG 


13980 
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AAACAAACTT ACTACCTCTA GCAACTATTA TTACGCCAAA TCTTCCTGAA GCAGAAGAGA 10500 
TTGTTCCTTT TTCAATCCAT GACCCCGAAG ACATGCAGCG TGCTGGTCGC CTGATTTTAA 10S60 
AAGAATTTGG TCCTCAGTCT CTGGTTATCA AAGGCGGACA TCTCAAAGGT GGTGCTAAAG 10620 
ATTTCCTCTT TACCAAGAAT GAACAATTTG TCTGGGAAAG CCCACGAATT CAAACCTGTC 10680 
ACACCCATGG TACTGGATGT ACCTTTGCTG CAGTGATTAC TGCTGAACTA GCCAAGGGCA 10740 
AGAGTCTTTA CCAGGCAGTT GATAAGGCCA AGGCCTTTAT CACAAAAGCT ATTCAAGATG 10800 
CCCCTCAACT CGGTCATGGT TCTGCTCCAG TCAACCATAC AACTTTTAAA GATTAAGAAA 10860 
AAAAACTCTC TAGTTCCCAC TTTAAGCGAA TTAGAGAGTT TTTATACTCT TCGAAAATCT 10920 
CTTCAAACTA CCTCAGCTTC CATCTGCAGC CTCAAAACAC TCTTTTGAGC TGACTTCGTC 10980 
AGTCTTATCT AAAACCTCAA GGCAGTACTT TGAGCAACCT GCGACTAGCT TTCTA G TTTA 11040 
CTCTTTGATT TTCATTGAGT ATTAATTAGG AAAGAATGTT ATGCAACTTT TTTAAAAAGG 11100 
CTTGCGTTTT TGCCTCAATA TCTTCTGCTT GCATCAAATC ACGTACAACA GCTACACCAG 11160 

CTATGCCAGT GCCCATAAGC TCATCAATAT TCTCCGAAGT CAAGCCTCCA ATAGCAACTA 11220 
CTGGAATGCC AACCGTTTGG CAAATTCTTT TCAAGGTCGA TATCAGAGTA ATGGGCGCAT 112 80 

TTTCCTTGCT CGT GGTTGGG AAAATGGCTC CTGTACCCAA GTAATCTGCA CCTGATTTCT 11340 

CCGCTTCCAG AGCTCTTTTA ACCGTTTTAG CGGTGACACC GAGGATTTTT TCAGGACCCA 11400 

ACACTTTCCC AGCTACCGAA ACTGGTAATT CATCATCTCC GATATGCAGA CCTCCTGCAT 11460 

CAACCGCAAG ACAAACATCC AACCGATCAT CGATTATCAA GGGTACCTGA TAAGCATCTG 11520 

TTATTTCCTT GACTTGTTTT GCCAGTTGAT AATATTGATT GCTTGTGAGA TTTTTTTCTC 11580 

CCAATTGGAC TATGGTAACC CCTCAACGGC AGGCCGTCTC AACTTTTGCA AGAAAGCTTT 11640 

CCACGGAATC TTGATAGCGA TTGCTTACCA GATATACTCT AAGTCCTTCT CTATTCATAA 11700 

ACCTCTCCTT TGATGGTATC TAGCCAATTT TCATCTCTTC TTAGGACCGA AAGCTGATTC 11760 

AGTACTTGGT AACGAAATTC TTCCAATCCC ATTCCTTGAA CAACTATTTT CTCAGCAGCG 11820 

ATATTGAGAT AAGAGACTGC TAAGCAAGAA GCTTCAAAAC CAGTCTTTCC TTGGCTGAGA 11880 

AAAACAGCTG TTAAGGCTCC AACCAAGTCT CCTGTCCCTG TTATCCACTC TAATTCAGTA 11940 

CAGCCATTTC CCAGTACAGC GACCTGATTT TTCGAAACGA CGAGGTCCTT GGCACCTCTG 12 000 

ACTAAGAAAG ACATACCAGG ATAGCTCTGA CACCAGTCTT TCAAGACTTG AAGCAAATCC 12060 

TCCGTTTCTT GATCTTTAGC ACTCGCATCG ACCCCAACGC CGTGGTCCTT TAATCCAACA 12120 

AGACTTCGAA TTTCTGACAT GTTTCCTTTA AGGACCGTAG GTCTATAGTC TAAAAGGTCT 12180 
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TGCATCTTCA 


TGCCACAAGA 


AAATTCATGT 


CGCCCTAATT 


CTTGAGGCCT 


AAAACGAATG 


8700 


ACTTTCTCCT 


CATCTACGCC 


GATTGGTTCC 


AACATACCTT 


CTTCTTCAAA 


CAGAATTTCC 


8760 


TTATAACAGT 


TTGAACGAGT 


AGCACGATGA 


AACCTAATCT 


CAGCTGGAAT 


TCCCTTTTGA 


8820 


AGCTGGATAT 


GGGCTGGATG 


ATACCCTTTT 


TCAGCTCGGA 


TACGGATTTT 


TTGAATGCCA 


8880 


TTTTCTAAGC 


TTGCTTTCAC 


AATTTCTGTC 


ATAGTCTCCA 


CCTACTCTAC 


AATCATCTTG 


8940 


CCGTGCATCA 


TGTTCATACC 


ACAAGCAAAG 


CCAAACTCTC 


CAGCCTGTTC 


AGGCGTCATT 


9000 


TCCACTACAT 


ACTCTTCCCC 


CATTCCCAGG 


TTCGCATCTA 


CACCAAAATC 


TGGAAAAACA 


9060 


ATTTGATCCA 


GACATGGTGA 


AGGATCCTTG 


CGGTCAAAGA 


CAATGCGTGC 


TGGCACTGAT 


9120 


TTCTTGAGGA 


CAATCAACTC 


AGGAGTATAG 


CCTCCCATGA 


CTTCCACTCG 


AATCTCTTGG 


9180 


TATCCGTTTT 


TTTGCTGGGC 


TTTTTGTCCA 


GATTTTTCAG 


GCTTTTTGAA 


AAACCAAAAC 


9240 


AAGATAAACG 


CGATAAGGGC 


AATACAAATA 


ATGGTTACAA 


TACTATTTAA 


CATGACGTCT 


9300 


CCTTTACATA 


CAATTACATC 


TTACTTCTCT 


TACAGCACTT 


GATTTCTTCT 


CTGAAATCAC 


9360 


AGCTTCCAAG 


TCTTCCAAGT 


CACTCTGAGT 


AAATTCACAT 


TCTACAATCA 


ACTCAGCCAA 


9420 


CAAATTCCTA 


ATCCTACGGG 


AACAAACCTT 


GTCTTTGATA 


TCTTGGACAA 


GTAAATCCCG 


9480 


ACTTTGGTCT 


AGAGTTAAAA 


GGGCTGAATA 


AACAAAGGAC 


TTGCCTTCTT 


TTTTCCGAGT 


9540 


CAAACACTCT 


TTATCAACCA 


GACGAGCCAA 


AAGTCTCTCA 


ACCCTGCACT 


TGGACCAGTC 


9600 


AAACCGCTCT 


CCCAAAACCC 


TAATCAAATC 


TCTACTGGTC 


TGCTCCCCCT 


GCATCCAAAT 


9660 


AATCTTCATG 


ACCTGCCATT 


CTGCATCTGA 


AATCTGCATT 


ACCATACCTC 


CAAAATCTAC 


9720 


ATTTGTCAAT 


TACACTCATC 


AGTATACTCT 


TAAAATCTAC 


ATTTGTCAAT 


TATAGAAATA 


9780 


ATATTTTCTT 


CGAAAAATAG 


AATTTTAATC 


ATTTGAAAAA 


CGATTTGCAG 


TCAAATATTA 


9840 


CTATATAAAC 


AATAAAAATA 


TGCTATACTA 


AAGAAAAAAG 


AAAACAACCA 


CTAGGGGTGC 


9900 


GTAAACCTGA 


GATTAACGAC 


TCTTAGATCC 


CTCTGACTCA 


ATCTACGTAA 


TGCTACCTGA 


9960 


TGGAAGTGCA 


AATGATAATG 


GGGACTACCA 


GTCTTCTATT 


GCCTTTCTAA 


AACAGACT/vG 


10020 




AGAATACAAA 


CTTCAGTTGG 


TTGGGACCTT 


TTAGATGACT 


TATTTACCCG 


10080 


TTGCTTTGAC 


CATTGCAGGG 


ACTGACCCTA 


GTGGTGGTGC 


TGGCATTATG 


CCAGATTTAA 


10140 


ACTCATTCCA 


AGCGAGAGAT 


CTCTATGGAA 


TGGCTGTTG7 


AACCAGTCTT 


CTCCCTCAAA 


1C2C0 


ATACCAGAGG 


TGTTCAGCTA 


ATCGAGCACG 


TTTCTCCTCA 


AATGTTGAAA 


GCCCAATTGG 


10260 


AGAGTGTCTT 


TTCTGATATT 


CCACCTCAGG 


C7CTAAAAAC 


TGCAATCTTG 


GCTACTACTG 


10320 


AAATCATCGA 


AATCATCCAA 


CCCTATCTTA 


AAAAACTGGA 


TTGTCCCTAT 


GTCCTTGATC 


10380 


CTGTTATGGT 


TGCTACAAGT 


GGAGATGCCT 


TGATTGACTC 


AAATGCTAGA 


GACTATCTCA 


10440 
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* * ^» a * viv 4 * \* 


TCAACACCAA 


ATCTGCTGAC 


TCGATGGCGA 


TATCTGTTCC 


AGCTCCCATA 


€960 


GCAATCCCCA 

VJ *^^\f*. A 




ACTAACGGCA 


GCAGCGTCAT 


TGATACCGTC 


C CCAAC AAAG 


7020 


WW 1 rtW * * 4 WW 


w * unw ivii i w 


CACTTTATGG 


ATTTCATGGG 




TGGCAAGACG 


7080 




WW 1 W i 1 WAA l 




TCTCTaATAG 
X W 1 ww*v*. A 


CACGCGCCAC 




7140 


l-W 1 Lwb 1 Ln 




t WL*wAWV%WWA 


ww a t i i l tin 


HCTGArTCAT 

UW I unw i. m 


WJW i *»wN-n A & irk 


7200 


GCATTTTCCT 


1 AUunn 




nVjUAAuLL I i 


lunl 1 tV«Ai i 


W 1 W/W\W/\UW A 


7260 


AAGAAwACAA 


CTGTCTTAGC 


TTCTTTTTCT 


AGTT C'TTCTA 


GTTTTATCTTG 


A 1 AAu 1 A 1 1 A 


r J«U 


6AAATATCCA 


m* ****** » /pf"»<» 

TCCCATCCAG 


CATTTTAuLA 


*********** • 


AAACTTGTTT 


I WW A 1 I 1 


' JOU 


******** f*f9*** H K R 

wGCCCTGAAA 


CACCTTT www 


uTuLAAuunL 


iuAAAnl 111 


wAAwAui 1 lu 


& & ftr*rrkkri* 

AAAW 1 LAnuL 


/ 4 4 U 


CCAGCTTCAC 


TCGCTCGCTT 


ft. ft f*>r+ ft. 1* ft /^/^/** 

AACGATAGwC 


TwAGwwAGTG 


GGTCTTGAGA 


AGCATCTTCC 


ten A 
'DUO 


AAGGAGG CTG 


CCAACCCAAA 


CACTTCTACT 


TwGTwGCCGA 


TGACATCTGT 


TAwwAw AGGT 


Teen 


TTCCCTTCCG 


TCAAAGTCCC 


GGTCTTATwA 


AAGACAAGGG 


TTTGAACTTT 


wTGCjATTTwC 


•t jC ^ A 


TGTAAGACAG 


TTCCATVl'TT 


GAGGAGAACC 


CCCATCTTGG 


CACTACGTCC 


TGTCCCCACC 


7680 


ATAAGGGCTG 


T CGGTGTTGC 


ft. & .r** ft ft a 

AAGTCCCAAG 


GCACAAGGAC 


AGGCGATAA * 


CAAAACCGCC 


7 74 0 


A CT C CGT AG A 


GAAGAGAGGA 


CACAAAu CTA 


GCTCwAAGCA 


CAAL LACAC * 


A 4 WWW 1 bA(j(. 


'DUU 


AAGACGAACC 


AAACCCAAAA 


GGTCATGATT 


CwTAAAATGA 


CAAwTAGTGG 


GACAAAAATw 


' ObO 


CCTGAAATCT 


TATCCGTCAA 


CTCCTGAATC 


GGCGCAwGAw 


TTGTCTCACC 


TTTCTTCACA 




AAATCCACAA 


n* ******* ft f^fT* X A 

TCTGAGCCAA 


AAC ALt i W IVT 


unbW w AALvr 


TT*rC**r*GG*TCT 


AAA W^ AW AAWvW 




CTTCCACTAT 


GATTGATGGT 


TGAGCCAATlj 


ACACTATCTC 


CAACTGTCTT 


W 1 WWAWAwww 


OU4 U 


AGACH.1CAC 


CiVitACCAT 


uuAI IWulWA 


n 1 /\W i nuAun 


CACCTTCTAC 


i AWWAWAWWA 






TCTTTTCACC 


WHA»AWW>WAW 1 


LVj/i/\ l^nuu 1 


WuWW 1 AWW i 1 


unLl Iwl I WW 


8 160 


AAAGGAACTT 


GGACATAACT 


ATCATCACTC 


AACACTTCTG 


CGCTTTTAGC 


TTGCAAGTCC 


8220 


A GT AATTTCT 


CCACAGCTTC 


GCACGTATTT 


TTTCTCATTT 


TTTCCTCAAA 


AACTGCTCCC 


8280 


AAAAGAACGA 


AAAAGAGGAT 


AAATCCAGCA 


CTTTCGAAGT 


AAACAGGGAG 


ACCAGCAAAG 


8340 


AGACCAACTA 


GGCTATAGAA 


ATAAGCCACT 


AGAGTTCCCA 


GCGCAACCAA 


GGTATCCATC 


8400 


TTGGCATTGT 


GCTTTTTAAA 


ACTGGCCCAA 


GCACTCTGGA 


TATATGGCTT 


ACCTGCAACT 


8460 


AACATAATAG 


GCCTTGTTGC 


TAGAAAGCTT 


CCCCAATGCA 


TGACTTGATG 


ACTAATGCTA 


8520 


CCTGTCAACA 


TCCCAATCAT 


GAGAATCACA 


AGAGGCACAG 


TAAAGATACT 


ACTAATCCAA 


8580 


AAACCTTG C A 


GGAGAGATAG 


AGATTTTCGA 


GTCTTCTCAA 


CGACTGTATA 


CCTTCCCTTT 


8640 
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GTTTGTCATA 


ACGTCTGGGT 


AG C AC ATGTT 


GAATGCTCCG 


TCACCCATGA 


TGTTCCATAC 


5160 


TTGGCGATCT 


GGATTGTCTT 


TCTTAGCAGC 


GATACCACCA 


GGAAGGGCAA 


TACCCATTGT 


5220 


CGCAAAGAGT 


GGAGATGTAC 


GCCACATGTT 


CTTAGGTGTC 


ATGTGAAGGT 


GACCAGTAGA 


5280 


TGTTTGAGTA 


GTCTTACCTA 


CGTCGATTGA 


GTAGATAGCG 


TCTTGATCAG 


CAl-CTTTCTT 


5340 


GATTGCATTG 


TAAACTTGAT 


AC AATTG C AA 


TTCACCCTCA 


GTTTTACCTT 


CCAGTTTGTT 


5400 


CATGTAATCA 


CGCCAGTTTT 


GGTTGTTCTT 


AACGTTTGCA 


CGCCACCATG 


GAGTTGATTC 


5460 


AACTGGGTTT 


ACTTTCTCAA 


GGATAGCTTT 


AGCTGCTTGA 


CCAGCATCAC 


CAAGGATTGA 


5520 


AGCCTCAAGG 


GCATGACGTT 


TACCAAGTTT 


GTAAGGGTCG 


ATATCGACTT 


GGATGAATTT 


5580 


TTCAGTGTTC 


TTGAATGCTT 


CGTAAACTTC 


ACCAAATGGG 


AAGTTTG AAC CAAGGAAAAG 


5640 


AACTGTGTCT 


GCTTCAAAGA 


CCACTTCCTT 


GGCTGGTTTC 


CAACCAACAC 


GGTAAGCAGA 


5700 


ACCTGTCAAA 


CCTTCATAGT 


TCCATTCGAA 


AGCTTCAAAG 


TTTTTACCAG 


TTGTGATGAT 


5760 


TGGTGCTTTG 


ATTTTACGTG 


ACAATTCAGT 


AATCACTTCA 


CCACCTTTAA 


CACCACCAAA 


5820 


TCCAGCATAG 


ATAACTGGGC 


GTTCAGCATT GTTCAAGATT TCAACAGCTT TCTCGATTTC 


S880 


AACTTCGTTC 


AAAGCAGGAG 


CGATGAATGA 


GCGTTCCTAT 


GAACCTGAAC 


CCTAGTATGA 


5940 


CTTTTCATCG 


ATTTCTTGGA 


AACCGAAGTT 


TACTGGAATT 


TCAACAACAG 


CTGGACCTTT 


6000 


TTTAGAAACT 


GCAGCACGGC 


AGGCTTCCTC 


AATTACTTTT 


GGCAATTGCT 


CAGCCTAAGC 


6060 


TACACCTTTG 


TTGTAAACAG 


CGATACCGTT 


GTACATTGGG 


TTTTGGTTAA 


GCTCTTGGAA 


6120 


AGCATCCATG 


TTCAATTCGT 


TAACTGGACG 


TGATCCAAGG 


ATCGCTAGCA 


ATGGAGTGTT 


6180 


ATCCATAGCT 


GCATCGTAAA 


CACCCTTAAT 


CAAGTGAGTC 


GCACCTCGAC 


CACCTGAACC 


6240 


AACTCCAACC 


CCGATTGAGC 


CGCCGAATTT 


ACCTTCCATA 


ACCCCTGCAA 


CAGCACCTGT 


6300 


CTCTTCGTGG 


CGAACTTGTA 


ACAAACGGAT 


ATCTTTGTCT 


TCAGCCAAAG 


CCTCCATCAA 


6360 


TCAGCTGAGT 


GTTCCTGATG 


G GAT ACCGT A 


GATTGTATCT 


ACGCCCCATG 


TTTTCAATAC 


6420 


CTTAAGCATT 


GCTGCAGATG 


CAGTAATTTT 


CCCTTGAGTC 


ATAATGATAA 


CTCTCCTTCA 


6480 


ATTTTTTTAA 


ACTTGGAGAA 


TACGATTACA 


TAGAATTGGA 


AACGTTCTCC 


AAATTTTTAC 


6540 


TATTCCACTG 


TATCATATTT 


ATGCTGACTT 


TTCTAAAAAT 


CTGCTCAAAA 


CTCTCTATTC 


6600 


TCTATTCTAA 


TACAGTTTTG 


AAAGTTCTGT 


CATTTCTGTT 


TTATAACAAA 


GAAATCTAGT 


6660 


CATTACTTTT 


AGTCTATTTT 


ACTAAAATTT 


AACAGAAGGG 


AACTGCTCAG 


AACAGATACA 


6720 


GAACTAAAGG 


CCATCGCTAG 


ACCTGCCAAT 


TCTGGGTTGA 


GAGCCAGTCC 


AACACCTGAA 


6780 


AAGACTCCTG 


CTGCAATCGG 


AATTCCGACA 


ACATTGTAGA 


TAAAAGCCCA 


GAAAACATTG 


6840 


AGTAGAATTC 


GATGAAAGGT 


TTTCTTACTC 


ATATCAAAGG 


CACCAACCAC 


TCCTAAAAGA 


6900 
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4 



GCTACAGACG 


CACGAGAATT 


TGTTACAATT 


GATGCCTTAT 


TTCAGGCTTA 


TAAAAAGGAG 


3420 


TTACGTCGTT 


CCTACACACG 


TGATGCCTTC 


TATCAACTGT 


TGAAGCGCCA 


TGGTTGGCGA 


3480 


AATATTACGC 


CACCTCCAGA 


ACATCCTAAG 


AAAGCAGACG 


CTCAAACCAT 


TGTTGCGTCT 


3540 


AAAAATAAAA 


TCTCAATCCA 


AGAAGGCAAG 


AAAGCGTTTT 


AAATATAGTA 


GACGTTTTCG 


3600 


TAAGGTTTGC 


TTGATGTACC 


AAGCTGAAGC 


TGGTTTCGGT 


AGAATCAGTA 


AACTGGGATC 


3660 


TTfiTTGCCCT 


CCAATAGGAG 


TAGGTCCACA 


TATCCATAGT 


CACTATATAC 


GAGAATTTCG 


3720 


Vttnl Iwl A#\ A 


GGAGCTGTTG 


ATGCCTATAC 


AGGCGAATCA 


TTTTTCTTAA 


TAGCTGGTAG 


3780 


A lu 1 nn 4 Aw 1 


GACTGGATGA 


ACGCCTTTTT 


AGAAGAGCTT 


TCACAAGCTT 


ATCCTTTTAC 


3840 


TLw 1 1 A 1\A«A 


C* A ATfl CT AT A 


TGGCATAAAT 


CAAGTACCTT 


AAAGATTCCC 


ACTAATATTG 


3900 


GTTTTGCATT 




TACACACCAG 


AGATGAACCC 


CATTCAACAA 


GTGTGGAAAG 


3960 


& f & *wrrT & & 

Aun 1 1 Wu i An 


ftWU lUUA AAA 


AAGAATAAAG 


CCTTTCGAAT 


TTTGGAAGAT 


GTCATGAATC 


4020 


AACTCCAAGA 


lu 1 I— /*» 1 AV» AA 


CCATTGGAGA 


AGGAGGTGAT 


AAAGTCCATC 


GTTAATCGGA 


4080 


GATGGACTAG 


AATGCTTTTT 


ftAAACCAGAT 


GAGTATTATA 


TCCAATTTCT 


TTATATAAAA 


4140 


AGACCGGATT 




TTCAATAGTT 


CATATTCTCA 


ATTTCT ATTT 


TAAAAATAGC 


4200 


TAAGG 1 J AAt 




TACGCGACCT 


ATTTCATACG 


ATAAAAATCA 


AGC ACT AG AC 


4260 


CAGCAGGTCu 


1 I \» AA w 1 AA 1 


AACCACTCTG 


TTCCCCAATC 


GGTTACAGTT 


GGTCCGTGTA 


4320 


AAAtL 111 A 1 


A V. V— A V— V* 


TTCAACCGTT 


TGTAGTTCTG 


GTCTACATCC 


TCAACCTCGA 


4380 


*** ft*TV* ft ft *P ft ft *P 
1 A luAft 1 AA 1 


f* 4TTPCTGAC 


TGAAAGTTTT 


CCAAAGGAAC 


CAAATGATTT 


TGTGACAACA 


4440 


TAAuGUAu 1 V* 


At 1 nttnn * 


GTAAACTGAG 


CAAAACCATC 


ATTAGCATAA 


TCTGCCTTTT 


4500 


1 A 1 /WtuAl 


ATCI^CCAAG 


TCAGCACAGA 


CTTGGGGAAC 


ATTTCAAACG 


ATAATATCTA 


4560 


All OA 111 AA 


ATTCATTTAC 


TCTCCTCCAT 


AAAAAGACCG 


GATTGCTCCG 


ATCTTTTAAA 


4620 


GTTCTGCTCT 


ATCAAAATCA 


AAGAATAAAG 




TCATATTTGA 


TTTTCGGCGA 


4680 


GAGGAATTAT 


TTAATTGCGC 


CTGATTGCAA 


TCCTTCTTCT 


TCCAACAAGA 


GACGGAATCG 


4740 


TACGAGTTCT 


TCTGCTTCGT 


ATTTTTCCTT 


GAACGCTTTG 


ATAGCTTCTT 


CTGAGTGAAG 


4800 


TTTTCGATCC 


AATTCAAGTA 


CTTCTACTGG 


AAGTGGACGG 


TCTTGAGTGA 


TGCGAGCATC 


4860 


GATCACAACA 


GTTTTACCTT 


CTTTGTTCAA 


TTTAACAGCT 


TCTGCAACAA 


CTCCATCGAT 


4920 


GTCTTCGATA 


CGCTCAACTG 


TGAATCCAAC 


AGCTCCTTGA 


GCTTCCGCAA 


TTTTAGCGTA 


4980 


GTCAGCGTTT 


GTGAAGTCTA 


CACCAAACAA 


GTGTTTGTTT 


GT AT CTTCGT 


ATTTGTTCTT 


5040 


GATGAAGCCG 


TACTCAGCAT 


TTGAGAAGAC 


AAGGTTGATA 


ACTGGAAGGT 


CGTATTGAAC 


5100 
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TGACCTTGAC 


CAGCCAAGAC 


ACTCAATAAG 


CTGTAGTCAG 


CTGTCTTTTC 


AAAGTCAACT 


1620 


TTTCCAGTAA 


TTTCCCACTT 


GTAAACTGCG 


AAGAAATCAT 


TACATACAAG 


GAGAGTGGAA 


1680 


CCCAAATCAT 


CTGCTTTAAC 


AGTTACAGGA 


CGGCTATTTG 


CTGGCTCACC 


AATGTTCAAG 


1740 


ACATCGATGG 


ATTTTTCAAG 


ATGAAGTTCA 


CGCAAGTTGC 


CTTTGTCATC 


CTTGCGGTCA 


1800 


AAGTCATACA 


CGCGATAGGT 


GGTATCGCTA 


GACTGCTGGG 


TTTCAAGCAT 


TAAGATACCC 


1860 


GCCCCGATAC 


CGTGCATAGT 


CCCGCTTGGT 


ACATAGAAGA 


AATCTCCACC 


CTTAACAGGC 


1920 


ACTTTGGTCA 


ACAAGTCATC 


CCACTTCTTG 


TCCTCGATTT 


CCTGGCGGAG 


TTCTTCTTTT 


1980 


GACTTGGCAT 


TGTGACCGTA 


GATAATCTCT 


GAACCTTCAT 


CCCCTGCGAT 


AATGTACCAG 


2040 


CATTCTGTTT 


TTCCGAGTTC 


GCCTTCATGC 


TCGAGTCCAT 


AAGCATCGTC 


TGGGTGAACT 


2100 


TGGACACTGA 


GCCAGTCGTT 


GGCATCGAGG 


ATCTTGGTCA 


AAAGTGGAAA 


TACAGGTTCT 


2160 


GGACGATTCC 


CAAATAATTC 


ACGGTGTTCC 


GCATACAAAG 


TAGCAACATC 


TGTTCCCTCG 


2220 


TAACGACCAT 


TGGCAACTTT 


AGACACTCCA 


TTTGGATGGG 


CTGAGATGGC 


CCAATATTCT 


2280 


CCGATTTTTT 


CACTTGGGAT 


GTCGTAGCCA 


AACTCATCAC 


GTAGCTTGGC 


TCCACCCCAC 


2340 


ATTTTTTCTT 


GCATAACTGA 


TTGTAAAAAT 


AATGGTTCTG 


ACATCTCGAT 


CTCCTGTCTG 


2400 


ATTTTTCTCC 


CCTCATTATA 


GCAAAAAAAG 


AGTTCGAATT 


GAACTCTTTT 


TTACATCTTA 


2460 


TAAAGCAGGG 


AGAAGATTTT 


ATAAAAATAC 


TAAACAAATG 


TGCTCTACCC 


GATGCTTGCA 


2520 


CCATTGCTAT 


AAATGACATC 


CTTGTACCAA 


TAGAAGGACT 


TCTTCTTGCT 


ACGTTTGAGA 


2580 


GCTCCGTTTC 


CTACATTATC 


TCGATCTACA 


TAGATAAAGC 


CATAGCGCTT 


ATTCATTTCC 


2640 


CCTGTGCCAG 


CTGAAACCGG 


ATCGATACAG 


CCCCAAGTCG 


TATAACCAAC 


CAACTCAACC 


2700 


CCGTCTTCGT 


AAATGGCATC 


TCGCATGGCC 


TTGATCTGGG 


CCTCTAAGTA 


AGTAATCCGA 


2760 


TAGTCATCTC 


CTACATAACC 


ATTCTCATCC 


GGTGTATCCA 


TAGCACCCAG 


TCCATTTTCT 


2820 


ACGATAATAC 


TAAACTAAAA 


TCAAAAAGCA 


TTATATAATA 


GTGATATGAA 


ATCAACTAAA 


2880 


GAAGAAATCC 


AAACCATCAA 


AACACTTTTA 


AAAGACTCTC 


GTACAGCTAA 


ATATCATAAA 


2940 


CGCCTTCAAA 


TCGTTCTATA 


GTAAAATGAA 


ATAAGAACAG 


TACAAATCGA 


TCAGGACAGT 


3000 


CAAATCGATT 


TCTAACAATG 


TTTTAGAAGT 


AGGGGTGTAC 


TATTCTACTT 


TCAATCTACT 


3060 


ATATTTCGTC 


TGATGCGCAA 


ATCTTATAAA 


GACATTATAG 


AACTTTTATA 


GTACTTTGAA 


3120 


ATAAGATGTG 


AACAACTCTA 


TCAGGAAAGT 


CAAATTAATT 


TATAGAAATA 


TTTTAGCAGC 


3180 


CAAGGTCTAC 


TGTTATAGAT 


TCAATACACT 


ATAGACTGTA 


ATCAAACAAC 


GATTTGCCGA 


3240 


AATGTAAAAA 


AATATGAGGA 


GTTCGG ACT C 


GACTCTCTCC 


TTCAAGAAAC 


ACGTGGTGGT 


3300 


CCTAACCATG 


CATATATGAC 


AGTTGAGGAA 


GAGAAAGCCT 


TTCTTGCCCG 


CCATTTGAAG 


3360 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GACAACTCAA CTAGCTTTTT CTTATTTTCA AAAAGGAGAT CAGA G TT T AA CTATGTCAGA 60 

AAAATCACAA TGGGGGTCGA AACTTGCTTT TATTCTAGCA TCTGCTGCCT CGCCATCGGG 120 

CTTGGTTCCG TTTGGAAGTT TCCCTACATG ACTGCTCCTA ATGGCGGTGG AGGCTTTTTA 180 

CTAATCTTTC TCATTTCCAC TATTTTAATC GGTTTCCCTC TCCTGCTGGC TGAGTTTGCC 240 

CTTGGCCGTA GTGCTGGCGT TTCCCCTATC AAAACCTTTG GAAAACTGGG CAAGAATAAC 300 

AAGTACAACT TTATCGGTTG GATTGGCGCC TTTCCCCTCT TTATCCTCTT ATCTTTTTAC 360 

AGTGTTATCG GAGGATGGAT TCTAGTCTAT CTAGGTATTG AGTTTGGGAA ATTGTTCCAA 420 

CTTGGTGGAA CGGGTGATTA TGCTCAGTTA TTTACTTCAA TCATTTCAAA TCCAGCCATT 480 

GCCCTAGGAG CTCAAGCGGC CTTTATCCTA TTGAATATCT TCATTCTATC ACGTGGGGTT 540 

CAAAAAGGGA TTGAAAGAGC TTCGAAAGTC ATCATCCCCC TGCTCTTTAT CGTCTTTGTT 600 

TTTATCATCG GTCGCTCTCT CAGTTTGCCA AATGCCATCG AAGGGGTTCT TTACTTCCTC 660 

AAACCAGACT TTTCAAAACT GACTAGCACT GGTCTCCTCT ATGCTCTGGG ACAATCTTTC 720 

TTTGCCCTCT CACTAGGGGT TACAGTCATG TTGACCTATC CTTCTTACTT AGACAAGAAA 780 

ACCAATCTAG TCCAGTCAGG AATCTCCATC GTAGCCATGA ATATCTCGAT ATCCATCATG 840 

GCAGCTCTAG CCATTTTCCA AGCTCGATCC CCCTTCAATA TCCACTCTGA AGCGGGACCC 900 

AGCCTCCTCT TTATCCTCTT GCCTCAACTC TTTGACAAGA TGCCTTTTGG AACCATTTTC 960 

TACCTCCTCT TCCTCTTGCT CTTCCTTTTT GCGACAGTCA CTTTTTCTGT CGTCATCCTG 1020 

CAAATCAATG TAGACAATAT CACCAACCAG GATAACAGCA AACGTGCCAA ATGGAGTGTT 1080 

ATTTTAGGAA TTTTGACCTT TGTCTTTGGC ATTCCTTCAC CCCTATCTTA CGGTGTCATG 1140 

CCGGATGTTC ACATTTTTGG TAACACCTTC TTTGACGCTA TGGACTTCTT GGTTTCCAAT 1200 

CTCCTCATGC C A TTT G GAGC TCTCT AC CTT TCACTTTTTA CAGGCTATAT CTTTAAAAAG 1260 

GCTCTTGCAA TGGAGGAACT CCATCTCGAT GAAAGAGCAT GGAAACAAGG ACTGTTCCAA 1320 

CTCTGGCTCT TCCTTCTTCG TTTCTTCGTT TCGTCATTCC AATCATCATC ATTGTGGTCT 1380 

TCATTGCC C A ATTTATGTAA TCAAAAAGGA CTTGAGTAGT GAACTCAGGC CCTTTCTTTT 1440 

TATGCATGGC TAACAATCAA TTCCAAACCT TGCCCTTCCA GAGTCCAAGC TTCAACATCA 1500 

CTTGGTAGGA TAAAGTGGCT GCCTTTTTGA ATTGGATAAT TTTTCCCCTC AACAGTTAGC 1560 



Diicrwin- *.\*tr~\ 
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AAGACGAAGC 


AAGTCCGAAA 


ACTATTTTGG 


GAATTGAACT 


AAOTCVIGAA 


CCGAAAAAAG 


3420 


ATTACCTAGT 


TGGTGATAGC 


TTAGACTTGT 


CTGAAGGACG 


CTTTGCAGTG 


GCTTATAGCA 


3480 


ATCACACCAT 


GGAAGAACAT 


TCCTTTACTG 


ATGAGGGAGT 


TGAAATTTCT 


GGTTACGATG 


3540 


CTCAAAAGAC 


TGGTCGTCAA 


ACCTTGACGC 


TTCATTACCA 


ACGCCATGAA 


CTTAGCTTTG 


3600 


ATGTTTTGGT 


ATCTCCAAAA 


GCAGCATTGA 


ACCATGAGTA 


CCTCAAACAA 


AAATTAGCAG 


3660 


AAGTTGAAGC 


TGCTAAGAAC 


AAGGTGGTCT 


ATAACTTTGC 


TTCATCAGAA 


GTAAAAGAAG 


3720 


CCTTCTTGAA 


AGCAATTGAA 


GCCCCCCAAC 


AAGTGTTGAA 


AGACCATGAA 


ACTAGCACCC 


3780 


AAGATCAAGT 


CAATGACCCA 


CTTAATAAAT 


TGACAGAAGC 


TCATAAAGCT 


CTGAATGGTC 


3840 


AAGAGAAATT 


TACGGAAGAA 


AAGACAGAGC 


TTGATCGCTT 


AACAGGTGAG 


CTTCAAGAAC 


3900 


TCTTGGCTGC 


CAAACCAAAC 


CATCCTTCAG 


GTTCTGCCCT 


AGCTCCGCTT 


CTTGAGAAAA 


3960 


ACAAGGCCTT 


GGTTGAAAAA 


GTAGATTTGA 


GTCCAGAAGA 


GCTTACAACA 


CCGAAACAGA 


4020 


GTCTAAAAGA 


TCTGGTTGCT 


TTATTGAAAG 


AAGACAAGCC 


AGCAGTCTTT 


TCTGATAGTA 


4080 


AAACAGGTGT 


TGAAGTACAC 


TTCTCAAATA 


AAGAGAAGAC 


TGTCATCAAG 


GGTTTGAAAG 


4X40 


TAGAGCCTGT 


TCAAGCAAGT 


CCTCAAGAGA 


AGAAATACTT 


TGCTGGAGAA 


GATGCTCATG 


4200 


TCTTTCAAAT 


AGAAGGTTTG 


GATGAAAAAG 


GTCAAGATCT 


TGATCTCTCT 


TATGCTTCTA 


4260 


TTGTGAAAAT 


CCCAATTGAA 


AAACATAAGA 


AAGTTAAGAA 


ACTATX"ITTC 


TTACCTGAAG 


4320 


GCAAAGAGGC 


AGTAGAATTG 


GCTTTTGAAC 


AAACGGATAG 


TCATGTTATC 


TTTACAGCAC 


4380 


CTCACTTTAC 


TCATTATGCC 


TTTGTTTATG 


AATCTGCTGA 


AAAACCACAA 


CCTGCTAAAC 


4440 


CAGCACCACA 


AAACACAGTC 


CTTCCAAAAC 


CTACTTATCA 


ACCGACTTCT 


GATCAACAAA 


4500 


AGGCTCCTAA 


ATTGGAAGTT 


CAAGAGGAAA 


AGCTTGCCTT 


TCATCGTCAA 


CAGCATGAAA 


4560 


ATACTGAGAT 


GCTAGTTGGG 


GAACAACGAG 


TCATCATACA 


GGGACGAGAT 


GGACTGTTAA 


4620 


GACATGTCTT 


TGAAGTTGAT 


GAAAACGGTC 


AGCGTCGTCT 


TCGTTCAACA 


GAAGTCATCC 


4680 


AAGAAGCGAT 


TCCAGAAATT 


GTTGAAATTG 


GAACAAAAGT 


AAAAACAGTA 


CCAGCAGTAG 


4740 


TAGCTACACA 


GGAAAAACCA 


GCTCAAAATA 


CAGCAGTTAA 


ATCAGAAGAA 


GCAAGCAAAC 


4800 


AATTGCCAAA 


TACAGGAACA 


GCTGATGCTA 


ATGAAGCCCT 


AATAGCAGGC 


TTACCCACCC 


4860 


TTGGTCTTGC 


TAGTTTAGCC 


TTGACCTTGA 


GACGGAAAAG 


AGAAGATAAA 


GATTAAATAT 


4920 


CGAAAAATCT 


TCTGAAATCT 


TTCCG 








4945 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25002 base pairs 

(B) TYPE: nucleic acid 
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TTCCAACATC GCCCTGGTCG CAGACTTCAA CACGGCAAAA ACTTCGTCCA GAATATGATT 1680 

TTACAGATCC CTATAATGCC GGAAATTCCC TTAAATTCTC TGGTGATGTA GCCGGTAAGA 174 0 

CAGATCAGGA TGTGAGACTT TATTCTACTA AGTTAGAAGT AACTGAGAAG ACCAAACTTC 1800 

GTGTTGCCCA CAAGCGAGGA AAAGGTTCTA AAGTTTATAT GGCATTCTCT ACAACTCCAG 1860 

ACTACAAATT CGATGATGCA GATGCATGGA AAGAGCTAAC CCTTTCTGAC AACTGGACAA 1920 

ATGAAGAATT TGATCTTAGC TCACTAGCGG GTAAAACCAT CTATGCAGTC AAACTATTTT 1980 

TCGAGCATGA AGGTGCTCTA AAAGATTATC AGTTTAACCT AGGACAATTA ACTATCTCGC 2040 

ACAATCACCA AGAGCCACAA TCGCCGACAA GCTTTTCTGT AGTGAAACAA TCTCTTAAAA 2100 

ATGCCCAAGA AGCGGAAGCA GTTGTGCAAT TTAAAGGCAA CAAGGATGCA GATTTCTATG 2160 

AAGTTTATGA AAAAGATGGA GACAGCTGGA AATTACTAAC TGGCTCATCT TCTACAACTA 2220 

TTTATCTACC AAAAGTTAGC CGCTCAGCAA GTGCTCAGGG TACAACTCAA GAACTGAACG 2280 

TTGTAGCAGT CGCTAAAAAT GGAGTTCGTT CAGAAGCTGC AACCACAACC TTTGATTGGG 2340 

GTATGACTGT AAAAGATACC AGCCTACCAA AACCACTAGC TGAAAATATC GTTCCAGGTG 24 00 

CAACAGTTAT TC AT ACT ACT TTCCCTAAGA CTCAAGGTGG AGAAGGTATT GAAGGTATGT 24 60 

TGAACGGTAC CATTACTAGC TTGTCAGATA AATGGTCTTC ACCTCAGTTG ACTGGTAGTG 2 5 20 

TGGATATTCG TTTGACCAAG CCACGTACCG TTGTTAGATG GGTCATGCAT CATGCACGAG 2580 

CTGGTCCTGA GTCTGTTAAC GATGCCTTGA TGAACACTAA AGACTTTGAC CTTTATTATA 2640 

AAGATGCAGA TCCTGAGTGC AAGCTAGCTA AGGAAGTCCG TGGTAACAAA GCACACCTCA 2700 

CAGATATCAC TCTTGATAAA CCAATCACTG CTCAAGACTG GCGCTTCAAT CTTGTCACTT 2760 

CTGACAATGG AACTCCATGG AAGGCTATTC CTATCTATAA CTGGAAAATG TATGAAAAGC 2820 

TTGATACTGA CAGTGTCAAT ATTCCGATGG CCAAGGCTGC AGCCCCTTCT CT AGGCAAT A 2880 

ACAAGGTACA AGTTCGCTTT GCAGATGTAC CCCCTGGAGC AACTATTACC GTTTATGATA 2940 

ATCCAAATTC TCAAACTCCG CTCGCAACCT TGAAG AG CG A AGTTGGAGGA GACCTAGCAA 3000 

GTGCACCATT GGATTTGACA AATCAATCTG GTCTTCTTTA TTATCGTACC CAGTTGCCAG 3060 

GCAAGGAAAT TAGTAATGTC CTAGCAGTTT CCGTTCCAAA AGATGACAGA AGAATCAAGT 3120 

CAGTCAGCCT AGAAACAGGA CCTAAGAAAA CAAGCTACGC CCAACCGCAG GATTTGGACC 3180 

TTAGAGGTGG TGTTCTTCGA GTTCAGTATG AACGAGGAAC TGAGGACGAA CTCATTCGCC 324 0 

TAACTCACGC AGGTGTATCA GTATCAGGTT TTGATACGCA TCATAAGGGA GAACAGAATC 3 300 

TTACTCTCCA ATATTTGGGA CAACCGGTAA ATGCTAATTT GTCAGTGACT GTCACTGGCC 3 360 
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(D) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CCTCGCTGAT GATTGGTCCT GTTTTATTTG CTCGTCCACC CTTGGCTGAA GAAACTGCAG 60 

TTCCTGAAAA TAGCGGAnCT AATACACAGC TTCTTTCAGG AGAGACTCAG CATTCGACCA 120 

ATGAAGCTGA TAAGCAGAAT GAACGGGAAC ATGCTAGAGA AAACAAGCTA GAAAAGGCAG 180 

AAGGAGTAGC GATAGCATCT GAAACTGCTT CGCCAGCAAG CAATGAACCT GCAACTACTG 240 

AAACTGCAGA AGCAGCTAGC GCAGCTAAAC CAGAGGAAAA AGCAAGTGAG GTGGTTGCAG 300 

AAACACCATC TGCAGAAGCA AAACCTAAGT CTGACAAGGA AACAGAAGCA AAGCCCGAAG 360 

CAACTAACCA AGGGGATGAG TCTAAACCAG CAGCAGAAGC TAATAAGACT GAAAAAGAAG 420 

TCCAGCCAGA TGTCCCTAAA AATACAGAAA AAACATTAAA ACCAAAGGAA ATCAAATTTA 480 

ATTCTTGGGA AGAATTGTTA AAATGGGAAC CACCTGCTCG TGAAGATGAT GCTATTAACC 540 

GCGGATCTGT TGTCCTCGCT TCACGTCGGA CAGGTCATTT AGTCAATGAA AAACCTAGCA 600 

AGGAAGCAAA AGTTCAAGCC TTATCAAACA CCAATTCTAA AGCAAAAGAC CATGCTTCTG 660 

TTGGTGGAGA AGACTTCAAG GCCTATGCTT TTGACTATTG GCAATATCTA GATTCAATGG 720 

TCTTCTGGGA AGGTCTCGTA CCAACTCCTG ACGTTATTGA TGCAGGTCAC CGTAACGGGG 7 80 

TTCCTGTATA CGGTACACTC TTCTTCAACT GGTCTAATAC TATTGCAGAT CAAGAAAGAT 840 

TTGCTGAAGC TTTGAAGCAA GACGCAGATG GTACCTTCCC AATTGCCCGT AAATTGGTAG 900 

ACATGGCCAA GTATTATGGC TATGATGCCT ATTTCATCAA CCAAGAAACA ACTGGAGATT 960 

TCGTTAAACC TCTTGGAGAA AAGATGCGCC AGTTTATGCT CTATAGCAAG GAATATGCTG 1020 

CTAAGGTAAA CCATCCAATC AACTATTCTT GGTACGATGC CATCACCTAT AACTATGGAC 1080 

GTTATCATCA AGATGGTTTG GGAGAATACA ACTACCAATT CATGCAACCA GAAGCAGATA 1140 

AGGTTCCGGC AGATAACTTC TTTGCTAACT TTAACTGGGA TAAGGCTAAA AATGATTACA 1200 

CTATTGCAAC TGCCAACTGG ATTGGTCGTA ATCCTTATGA TGTATTTGCA CGTTTGGAAT 1260 

TGCAACAGCC TGGTTCCTAC AAGACAAAGG TTAAGTGGAA TGACATTTTA GACGAAAATG 1320 

GGAAATTGCG CCTTTCTCTT GGTTTATTTG CCCCAGATAC CATTACAAGT TTAGGAAAAA 1360 

CTGGTGAAGA TTATCATAAA AATGAAGATA TCTTCTTTAC AGGTTATCAA GGAGACCCTA 1440 

CTGGCCAAAA ACCAGGTGAC AAAGATTGGT ATGCTATTCC TAACCTAGTT GCGGACCGTA 1500 

CGCCAGCGGT AGCTAATACT TTTACTACTT CTTTTAATAC AGCTCATGGT AAAAAATGGT 1560 

TCGTAGATGG TAAGGTTTCT AAGGATTCTG AGTGGAATTA TCGTTCAGTA TCAGGTGTTC 1620 
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AATTGATTAG 


CAATAAAATC 


CAATGAATTT 


i i I IAuLaaA 


^.Ini l • tv\ l 1 




6120 


■** m ft ifi^ » gm * ft ff% 

TAATCATAAT 


ft ^+ ft ^ ft 0+ ft ^+ ft ft 

AGAGACACAA 


ATTCTOATTG 


TAALAni 111 






6180 


TGGCCTTACT 


CCTGGGCAAC 


TCATAC 1CAT 


Abn 1 luunU 1 






6240 


TAATTTCCCA 


AGATATTTTA 


AATACTCTCT 


TwAAATTuAC 


LLTUAAIVl A 


UALALAA 1 (.A 


£ inn 


ATTATACAAA 


TTAGGATACT 


TCACTAAAAA 


TAAGACTTTA 


H. ft* It WWII * 

TCATATCTTA 


CACTAGTAGA 


© J DU 


ATTAAAAACT 


ATATTATCTA 


AACATAATTT 


AGCTACTTCT 


GCAAAAAAAu 


LAbAA 11 AAT 


0 4 « U 


TACAAGAATA 


ATTAATAATG 


TTAACATTGA 


CAATTTAGAT 


ATTCCGTTCG 


AATTTAAACT 


c it on 
D4BU 


AACAAAAGAA 


GCACAAAATC 


TTATTATCGA 


ACATAGTGAC 


#V* ft fit ft ft ft. ft ^ 

TATATCAAAG 


CATACTATGA 


C Cil ft 
6540 


TAAAGACATA 


ACTATCGAAG 


ATTATTGTAA 


AGAAAAAAAC 


AATATCTCTT 


■MA ft ft ft ^^^^ ft\ ft 

TTAAAGCAAC 


C £ Aft 


TTTTGCTGAT 


ATAAAATGGA 


GTCTCTTAAA 


TAAACAAGCT 


CATAGGAATA 


CTGTATCAGG 


6660 


AGATTTTGGA 


TCCTTATCTA 


ACACACGAAA 


GGCTCAGGGA 


AGACATTTGG 


AACAAGAAGG 


6720 


TAATATTAAA 


CATGCTTTAA 


TATATTACAT 


AGAATCTTTG 


ATAATTACTA 


TTTCAGGATT 


6780 


AGAAAACAAT 


TTTTCAGCCA 


CTGATTATCC 


AGTATATTAT 


CCCGATTCGA 


TACCTGACTA 


6840 


CTCACTAAAA 


CATATTCAAA 


CATTAATGGA 


ATCATTATCT 


GATGACGATT 


ATGATTTTGC 


6900 


TTTTGATGAA 


GCATTATTTC 


GCTTCTCAAT 


TTTGAATGCA 


AATCATTTTT 


T ATCT AAGG A 


6960 


AGATATTGAC 


TATTTAAGAG 


TTAATTTACC 


TCGTTCCACT 


GCTGAAGAAA 


TAAACAATTA 


7020 


CTTAAAGAAA 


TATGAATGTT 


ATAGTCCTTT 


AAATAATTTA 


GAACTTGACG 


ATTTTGAATA 


7080 


AATTGACTAT 


ACAAACATTT 


ATATACTCGA 


TATAGTCTCA 


ATTTTATCTG 


ft ft nwv^^^^ 

ATGATTGCCC 


1 1 A ft 

7140 


AAATTTTTCA 


ATAATAAAAC 


GCATAATATT 


ft ft ft ^+ ft ft 

ATGGAGACAA 


TCCCCTATA . 


TATG CGTTCT 


T*> Aft 


TTTAATATCA 


AAGACTTTTT 


GACAAACTTC 


TTTGATATCT 


AATTACATGC 


CCCCTGCAGC 


7260 


AATCGAACCT 


GCAACTACTC 


CTTAGGAGGG 


AGTTGTTATA 


TCCATTGAAC 


TAAGGGAGCT 


7320 


AGATAAAAAC 


TCTGCTAAAT 


GAGCAGAGTT 


TTTTAGTCGA 


ATTAACGACC 


GATTTCTTTG 


7380 


ATACGAGCTG 


CTTTACCTTG 


AAGAGCACGC 


AAGTAGTACA 


ATTTCGCACG 


ACGTACTTTA 


7440 


CCGTAACGAA 


CAACTTCGAT 


TTTTTCAACA 


CGTGGAGTGT 


GGATTGGGAA 


GATACGCTCA 


7500 


ACACCTACAC 


CGTTAGAGAT 


TTTACGAACT 


GTGTAGTTTT 


CTGAGATTCC 


AGCACCTTTA 


7560 


CGTGCGATAA 


CAACACG 










7577 


(2) INFORMATION FOR SEQ ID NO: 47: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



Dwcnnnn- -wn 
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CATTACTCAG 


GAGATTATTC 


AAAATCCTGA 


GACAGAGATT 


CACCAAGTCC 


TCCTACAAGT 


4320 


GGCACCTGAT 


TTCCCAGAAA 


AGGTGGCCAG 


TCTCATTGAC 


CATACCTATC 


CTAATAAGCA 


4380 


GGTCCTGCAC 


CTCATTTCTA 


CTCAGATTGA 


CGCAGATCGC 


ATGGACTATC 


TCTTCCGCGA 


4440 


CTCCTATTTT 


ACAGGAGCAT 


CCTATGGGGA 


ATTTCACCTC 


ACTCGAATCC 


TCCGAGTCAT 


4S00 


TCCTCCTATC 


GAAAATGGTA 


TCGCCTTTCA 


CCGCAATGGC 


ATGCACGCCA 


TCGAAGACTA 


4S60 


CGTCCTCAGT 


CGCTACCAGA 


TGTACATGCA 


GGTTTATTTC 


CACCCCGCAA 


CACGCGCCAT 


4620 


GGAAGTTCTC 


CTACAGAATC 


TTCTCAAACG 


CGCCAAGGAA 


CTCTATCCTG 


AGGACAAGCA 


4660 


TTTCTTTGCC 


CGAACTTCTC 


CACACCTCCT 


GCCTTTCTTC 


GAAAAAAATG 


TGACCTTGAC 


4740 


TGACTATCTC 


GCTCTGGATG 


ATGGCGTGAT 


GAATACCTAC 


TTCCAGCTTT 


GGATGACCAG 


4800 


TCCTGACAAG 


ATTCTTGCAG 


ATTTATCGCA 


TCCCTTTGTC 


AACCGCAAGG 


TCTTTAAATC 


4860 


CATTACCTTT 


TCACAAGAGG 


ACCAACATCA 


ACTTACTAGC 


ATGAGAAAAT 


TGGTTGAGGA 


4920 


TATCGGCTTT 


GATCCCGACT 


ACTACACTGC 


CATTCATAAG 


AACTTPGACC 


TCCCTTATGA 


4980 


TATCTATCGT 


CCCGAATCTC 


AAAACCCACG 


GACAGAGATT 


GAGATTTTAC 


AAAAAAATGG 


S040 


AGAACTGGCC 


GAACTCTCTA 


GCCTGTCTCC 


TATCCTCCAA 


TCCCTTGCTG 


GCAGTCGCCA 


5100 


CGGAGATAAT 


CGCTTTTATT 


TTCCAAAAGA 


AATGTTGGAC 


CAAAACAGCA 


TCTTTGCAAG 


5160 


CATTACCCAG 


CAATTTTTAC 


ACTTGATTGA 


CAACGATCAT 


TTTACCCCAA 


ATAAAAACTA 


5220 


GAAGAGGAAA 


TTTATGAGTA 


TTAAACTAAT 


TGCCGTTGAT 


ATCGACGGAA 


CCCTTGTCAA 


5280 


CAGCCAAAAG 


GAAATCACTC 


CTGAACTTTT 


TTCTGCCATC 


CAAGATGCCA 


AAGAAGCTGG 


5340 


TGTCAAAGTC 


GTGATTGCAA 


CTGGCCGCCC 


TATCGCAGGC 


GTTCCCAAAC 


TTCTAGACGA 


5400 


CTTGCAGTTG 


AGAGACGAGG 


GGGACTATGT 


GCTAACCTTC 


AACGGTGCCC 


TTGTCCAAGA 


5460 


AACTGCTACA 


GGACATGACA 


TTATCAGCCA 


ATCCTTGACT 


TATGAGGATT 


ATCTAGATAT 


SS20 


GGAATTCCTC 


AGTCGCAAGC 


TCGGTGTCCA 


CATGCATGCC 


ATTACCAAGG 


ACCGTATCTA 


5580 


TACTGCAAAT 


CCCAATATCG 


GAAAATACAC 


TGTACACGAA 


TCAACCCTCC 


TCAGCATGCC 


5640 


TATCTTCTAC 


CGTACCCCTG 


AAGAAATGGC 


TGGCAAAGAA 


ATTGTTAAAT 


GTATGTTTAT 


5700 


CGATGAACCA 


GAAATTCTCG 


ATCCTGCGAT 


TGAAAAAATT 


CCAGCAGAAT 


TTTACGAGCG 


5760 


CTACTCCATC 


AACAAATCTG 


CTCCTTTCTA 


CCTCGAACTC 


CTTAAAAAGA 


ATGTAGACAA 


SB20 


GGCTTCAGCC 


ATTACTCACT 


TGGCTGAAAA 


ACTCGGATTG 


ACCAAAGATG 


AAACCATGGC 


5B80 


AATCGGTCAT 


GAAGAAAATG 


ACCGTGCCAT 


GCTCGAAGTC 


GTTGGAAACC 


CCCTTCTCAT 


5940 


GGAAAATGGA 


AATCCAGAAA 


TCAAAAAAAT 


CGCCAAATAC 


ATCACCAAAA 


CAAATGACGA 


6000 


ATCCGGCGTT 


GCCCATGCCA 


TCCGAACATG 


GGTACTGTAA 


AAGTATCATT 


TTTCAATAAG 


6060 
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CTCAACAGCT GGTACTTCTG TTCCAACTGC AAAGACAGGC ACATCAATCT GTTGTCCCAA 
CGTCTTAAGC TGGTCAATGG CAGCTGGACG ATAAATATCC GCCGCAATCA TCAAAGCACG 
AGCATTTTCT TCTTTCTTGA CTTT G TTGGC CAATTTACCA GCAAAGGTTG TTTTACCAGC 
CCCTTGTAAA CCAACCATCA TGATGATGCT TGGAATCTTA GGTGACTTGA TAATTTCTGC 
CCTATCAGAA CCTAAAACGG CTCTCAATTC CTCATCAACG ATTTTAATAA TCTGTTGCGC 
AGGATTAAGT GTATCAATGA CCTCATGCCC GACTGCACGC TCACGAACTT TCTTGATAAA 
GTCCTTTACA ACAGGCAAGG CAACGTCGGC CTCGAGCAAG GCCAAGCGAA T TT C TTTGCT 
TGCCTCTTGG ACATCAGATT CAGAGATTTT TCCTTTTTTA CGTAGATTTT TAAACACGTT 
CTGCAAACGT TCTGTTAAAC TTTCAAATGC CATTTTTCTT CCTCTTATTC TCTATTATCA 
ATG CTTGTTA AAATTTCTAT CTGCTCCTGC AGAAAGTCAT CCTTGGGATA GCGCTCCAAA 
ATCTGATCAA AAATCTGACT CCGGACAATA TAGTCCCAGT ACATGTGCAA TTTCATCTCA 
TAATCTTCCA GAATCTTTTC TGTTCGCTTG ATATTGTCAT AGACAGCCTG ACGACTGACA 
CCGAACTCCT CGGCAATTTC AGCAAGGCTG TAATCATCAG CGTAGTAGAG CTCGATATAA 
TTCATTTGCT TATCTCTCAA AAGCGCCGCA TAAAATTCAA AGACCGCATT CATACGATTG 
GTTTTTTCGA TTTCCATAAC TTTTATTATA CCAAAAATTA GCCTAATCTA CCACACTAGG 
AAGCCGATCC AAGAAGATAG ATAGCTAAAT TTGAAAAAGA CATGAGCCTA CCCCCAAGTA 
ATTTCCAATT GATAGCTGGC AAAGGGATGT CCCTCTTCAT TTTGTAGTTG ATAATCTAGT 
TCAATCTTTT GCCTATCAAC TTGATAATCG CTCGTTTCGA TGATAAACTC CTCCATCCCC 
ATACGTGTAG GAATATAGCC TAAACTATCG CTATCCTTTA CAAAGCCCAT AATGGTCTTG 
GGATTAGAAA ATCGCCTCA? CACAAGTTCT TGACCATGAA ATTTAATCAC T A C TTTTTCC 
TTTTCCTCAT TATAGAAAAG CAGGTAGCTA TAATCTCCTT TTTCATGCAC TTCCACATCA 
TAAAGCTGGT CAATCACTTC CAACTGCTCA TCAAACTCAA TCGTATTTCG CATCCGAATC 
TTCACATCAG GCCCTCTTTC TTGTCTCTTG TCCTACTATT TTACCAAAAA GAGCAGGATT 
TTGCTATAAT GGTCATATGA ACGAAAAAGT ATTCCCTGAC CCTGTTCACA ACTACATCCA 
TGTCAATAAT CAAATCATCT ATG A CTTG AT TAATACAAAA GAATTTCAGC CTTTGCGCCG 
CATCAAACAA CTGGGAACTT CCACTTATAC CTTCCACGGT GGAGAACACA GTCGC TTCTC 
TCACTGTCTA GGAGTCTATC AAATTGCACG ACGCATCACA GAGATTTTCG AACAAAAATA 
TCCTGAGGAA TGGAATCCTG CCGAGTCTCT CTTCACCATG ACCCCTCCTC TCCTACACGA 
CCTTGGGCAT GGTGCCTACT CCCATACTTT TGAACATCTC TTTGATACAG ACCATGAAGC 



2S80 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 
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CTTCCCAATC 


GCTATCGTAA 


CAATTTCTGA 


ACATATCGGA 


GACCATACTG 


TTTTGGGTCA 


780 


AATCTCTCCT 


CGTCAATTCT 


TAAAAGAACC 


AGGTCTTCAC 


CGTACTCTTC 


TTGGTGACGG 


840 


TATCGCAACT 


TCTGTTTCTG 


CCTTCCTTGG 


TGGACCAGCC 


AATACAACTT 


ACGGAGAAAA 


900 


TACAGGCGTT 


ATCGGTATGA 


CTCGTATCGC 


TTCTGTCTCA 


GTTATCCGTA 


ACGCTGCCTT 


960 


CATCGCCATT 


GCCCTCAGCT 


TCCTTGGTAA 


ATTCACTGCC 


TTCATTTCAA 


CTATTCCAAA 


1020 


CGCTGTACTT 


GGTGGTATGT 


CAATCCTTCT 


CTATGGGGTT 


ATCGCCACCA 


ATGGTTTGAA 


1080 


AGTCTTGATT 


AAAGAACGTG 


TTGATTTCGC 


TCAAATGCGA 


AACCTCATCA 


TCGCAAGTGC 


1140 


TATGTTGCTT 


CTTGGACTTG 


GAGGAGCTAT 


CCTTAAACTT 


GGTCCAGTTA 


CACTTTCAGG 


1200 


TACTGCCCTT 


TCAGCCATGA 


CAGGAATCAT 


CTTGAACTTG 


ATCTTGCCAT 


ACGAAAATAA 


1260 


AGACTAAGAG 


TCTAAATACA 


CCTAATCCAC 


TCAGACAGCT 


GAGTGGATTT 


TTCGTATACC 


1320 


ATAATAAAAG 


TGTCTTAACA 


AAATTATTAA 


AATCAAAAAA 


CGTATAATAT 


CACATATTCT 


1380 


AAAACCTTGA 


TACTGTACGT 


TTTATCATAG 


AAATTTTTAC 


TTTATTTTCT 


CATCAAATGA 


1440 


GATTTCCATC 


AATCTCTTGT 


CTTACTTGCG 


TTTCTTCTTC 


GCTTTCTTCA 


TTTTGTTAGC 


1500 


CATACGTTTC 


ATGGACTGTT 


TCATCGCAAA 


TTCACCAATT 


TTACCTTTCA 


AACCGCCACC 


1560 


AAACATCTGC 


CTCATATCTG 


GCATTCCTCC 


TCCTCCGAGA 


GCTGATAAGT 


CAGCCATACC 


1620 


CCCTTGTCCC 


ATCATTCCTT 


CAAGGGCAGA 


CATATCCATT 


CCTCCCATAT 


TTGGCATATT 


1680 


TTTAGGAAGG 


TTATTTGGAT 


TAATCCCCAT 


TTCCTTCATC 


ATTTT ATTC A 


TATCCCCACA 


1740 


CATAACACCC 


TGCATGAGCT 


GTT T AGCCTG 


CTTAAACTCC 


TTGATGAATT 


TATTGACTTC 


1800 


GACGAATCTA 


TTTCCAGAAC 


CAGCAGCAAT 


ACGACGGCCA 


CGGCTTGGAT 


TTAACAAATC 


I860 


TGGGTTTTCA 


CCCTCTTCAG 


CTCTCATCGA 


AGACACAATG 


GCACGTTTAC 


GAGCAATCTG 


1920 


GCGTTCATCC 


ACCTTCATGT 


TTTGAAGGGC 


TGGATTCTTG 


GCCATACCTG 


GAATCATCTT 


1980 


GAGCAAGTCT 


TCCATCGGCC 


CCATATTTTG 


CACCTGATCT 


AATTGATCGA 


TGAAATCATT 


2040 


AAAATCAAAG 


GTGTTTTCGC 


GCATCTTCTC 


ACCCATTTCA 


AGGGCTTTTT 


GTTC ATCC r A 


2100 


TTCCTGAGAA 


GCTTTCTCAA 


TCAAAGTGAG 


CATATCCCCC 


ATACCAAGGA 


TACCCCTAGA 


2160 


CATGCGCTCT 


GCGTGGAAGC 


TTTCAATGTC 


CCTAATCTTT 


TCACCTGTAC 


CAGTGAACTT 


2220 


GATTGGTTTT 


CCAGTAATGT 


GACGAACAGA 


CAGAGCAGCA 


CCACCACGAG 


TATCGCCATC 


2280 


AATCTTGCTA 


AGGATGACCC 


CAGTCACTTC 


CAACTGAGCA 


TTAAACTCAC 


GCGCAACATT 


2340 


GGCTGCTTCC 


TGACCAATCA 


TAGCATCAAC 


GACAAGCAAG 


ATTTCATTTG 


GTTGAGCCAA 


2400 


TGCTTTCACA 


TCACGAAGCT 


CATTCATGAG 


GAGCTCATCA 


ATCTGCAAAC 


GACCCGCACT 


2460 


ATCAATCAAG 


ACATAGTCGT 


TATGATTAGT 


TTGGGCTTCC 


TCCAAACCTT 


GACGTACAAT 


2520 
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TTCTTCTCAC GAAGCTCTGT TACGATTGCC CCTGTTCCTG CACGACCCTC ATACCTGACA 10800 

GCAAAAGGGT CAATCTTCAT CAGTTGATGA CCATTTTGAC GTGTGACATG ATATTTGTAA 10860 

ATATGCCCTT CTTGAGCCAT ATTGGTAAAG ACTTCCCAGA CCCCAAAATC ATTTCTTACC 10920 

ATTCCAATCT GATTTTCAAT CCAGTTGGTA AAATCACCAA CCAACTGAAC AGCCTGAGCA 10980 

TTAGGTGCCC AAACACGGAA GGTATAGCCA TGCTCTCCAT TTAGTTCTTC CCTATGTGCT 11040 

CCTAGATAAT GTTGGAGATA AAAATTTTCA CCCGTCATAA AGGTTTTTAA TGCTTCTCTA 11100 

TTATCCATAT ACTCCCCTTC TCCTCTAAGC CTTTTCTATG TTTTTATTAT ACTACCTTTT 11160 

TAGAGAAGAT TCAAGTAAAT TACTATACTT CTTTAATTAT TTTGAAAATC TACAACAAGT 11220 

TCACTTACTC GTTCAATTGT AAATCAATAT TTTTTCAAAA AATTGCGAAA ACGCCTTTCT 11280 

TTTTCTACTA TAGTGAAATG AAATAAAACA TGCGCAAATC GATTAAGGAA TTTAATCTAA 11340 

TTTCTAACAA TCTCTTAGAA ATCAAAGTGT ACTATTTTAA CTCC 11384 
(2) INFORMATION FOR SEQ ID NO: 46: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7577 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNES5 : double 
( D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



TCTTGATTTG 


TTACTAGACG 


TTGACCAACG 


TCCTTCCCCT 


CCAAAAGGAA 


TTCTCCTTAG 


60 


TTTCCAACAC 


GTTTTCGCCA 


TGTTTGGTGC 


G AC C ATCTTG 


GTACCATTGA 


TTTTGGGAAT 


120 


GCCTGTATCT 


GTTGCCCTTT 


TTGCTTCAGG 


TGTTGCAACA 


CTCATCTACA 


TGATTGCTAC 


180 


TGGTTTTAAA 


GTTCCAGTTT 


ATCTAGGTTC 


TTCATTTGCC 


TTTATCACAG 


CTATGTCACT 


240 


GGCTATGAAA 


GAAATCGGGG 


CGGATCTATC 


TGCTGCCCAA 


ACAGGGCTTA 


TCTTGACTGG 


300 


TTTGGTCTAT 


GTCCTTGTTG 


CTACCAGCAT 


CCGATTTGTA 


GGAACAAAAT 


GGATTGATAA 


3 60 


ACTCTTGCCA 


CCAATCATTA 


TCGGTCCTAT 


GATCATCGTT 


ATCGGTCTTG 


GACTTGCAGG 


420 


TTCAGCTGTT 


ACCAATGCAG 


GTCTTCTAGC 


AGACGGAAAT 


TGGAAAAATG 


CTCTGGTAGC 


480 


CGTTGTTACT 


TTCCTAATTG 


CTGCCTTTAT 


CAATACAAAA 


GGAAAAGGCT 


TCCTACGAAT 


540 


CATTCCATTC 


CTCTTTGCCA 


TTATCGGTGG 


TTACCTTTTC 


GCACTAACTC 


TTGGCTTGGT 


600 


TGACTTTACA 


CCAGTTCTTA 


AAGCCAACTG 


CTTCGAAATT 


CCTGGTTTCT 


ACTTGCCATT 


660 


TAGCACAGGT 


GGTGCCTTTA 


AAGACTACAA 


TCTTTACTTT 


GGTCC AG AAG 


CCATCGCTAT 


720 
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ACACCTCAAT 


TAATACCCTC 


TAGTCCCCAG 


CTTGAACCAT 


TCCCAATATG 


GTTGTTGAGA 


9000 


GCAACTGGTT 


GATACTGTGT 


AACGACCCCA 


ACATTGTGAA 


TCCCTGACTT 


GGCACACTTT 


9060 


GATAGGGCAA 


AGTCAATGAT 


ACGGTAGCGC 


CCACCAAATT 


GCACAGCTGG 


TTTTGCGATC 


9120 


CTTTGAGTGA 


GTTTACCGAG 


ACGAGTTCCT 


TGCCCACCAG 


CAAGAATCAA 


AGCTAACATT 


9180 


TCATTTTTCA 


TTTTCTACTC 


CTTTTTGGTT 


TTTATTTCTG 


ACGGTTTTAG 


TAGATTTCAA 


9240 


GCGACGTTTG 


ATTTTCCATA 


CACTTGCTCC 


CATAGCCGGT 


AGGGTAAAGG 


TTAAGGTCTG 


9300 


CTCATAATCT 


TTCCATAGTC 


CTTCTTGCGT 


TTCAACAGTT 


TGATTATGTT 


CTTTCCAAAC 


9360 


CCCTCCCCAC 


TCTTCCAACT 


CAGTATTCCA TACTTCTTCG 


TAAATTCCTG 


CAACGGGTAG 


9420 


TCCGATTGTA 


AAATCTTTCC 


GCTCAACAGG 


TACCATATTA 


AAGATACAGA 


CTAACATTTC 


9480 


TCCCTTTTTA 


CCCTTACGAA 


TAAAGGAAAG 


AACACTCTGG 


TCTCGATTAT 


CCGCATCAAT 


9540 


GATTTCAATA 


CCATCATAGC 


TGGTATCAAT 


TTCCCACAGA 


CAGCGATGAT 


CTTTGTAAAA 


9600 


CTGGTTTAGC 


TGAGAAGCGA 


AATACTTCAT 


CTTAGCATTC 


ATTGCGTCTT 


CTAGGTTAGA 


9660 


CCATTCCAAC 


TGTTCTTCAG 


ATTTCCATTC 


TAGGAATTGA 


CCGTATTCGC 


TACCCATGAA 


9720 


GAGCAATTTC 


TTACCAGCGT 


GACAAATTTG 


GTACGTATAG 


AGATTGCCCA 


AGCCTGCGAA 


9780 


TTGATTGTAA 


CCATCTCCCC 


ACATCTTATG 


CATCATACTC 


TTCTTGCCAT 


GAACCACTTC 


9840 


ATCGTGCGAG 


AATGGCAAGA 


C AT AATTCTC 


CTTGAAAACA 


TACATAAAGC 


TGAAAGTCAC 


9900 


CAGGTTAAAG 


TCATATTTAC 


GATAGATCGG 


ATCTTCTTCG 


TAGAAACGGA 


GGATATCATT 


9960 


CATCCAGCCC 


ATGTTCCATT 


TGTAGTCAAA 


TCCTAGACCA 


CCAATCTCTT 


TCATTCCCCT 


10020 


AATCTTGATC 


GCAGACGAAC 


TTTCTTCTGC 


AATCATCATC 


ACATCTGGAT 


ATTCTAACTT 


10080 


AATAACCTCA 


TTCAAGCGCT 


GAAGGAAATA 


ATAACCTTCA 


TAGTTGAGAT 


TTCCGCCATC 


10140 


TTTATTAGGT 


GTCCATCCAG 


CATCATCATA 


GTCCAAATAG 


AGCATGTTGC 


TAACAGCATC 


10200 


CACACGAATA 


CCATCCAAAT 


GATAGACATC 


AATCCAATGC 


TTAATGCAAG 


AAATTAAGAA 


10260 


GGACTGCACT 


TCATTTTTTC 


CAAGGTCAAA 


ATTAAGGGCA 


CCCCAACCAT 


GGTTATGAGC 


10320 


CTTATTATGC 


TCTTGGTATT 


CAAAAGTCGG 


TGTCCCATCA 


TAATAGGCTA 


AGGCATCATC 


10380 


GTTGATGGTA 


AAGTGACTGG 


TACCCAGTCC 


ACAATAACCC 


CAATATTATG 


GGTATGACAC 


10440 


TCCTCGACAA 


AATCTTGAAA 


CTCCTCTCGT 


CCCCCATAAG 


CATGCTCTAA 


AGCGAAGTAA 


10500 


CCCATAAGCT 


GATACCCCCA 


ACTCAAGCCC 


AAAGGATGGG 


ACATCAAGGG 


CATAAACTCA 


10560 


ATATGAGTAT 


AGTTCATTTC 


AACGAGATAA 


GGAATGAGTT 


CATCCTTGAG 


CTGGGCAAAA 


10620 


CTATAAGGAC 


TGCCATCAGA 


ATTTCTTTTC 


CATGATCCAG 


CGTGAACTTC 


ATAAATATTG 


10680 


ACAGGACGCT 


CTTCAAAGCC 


CCAACGTTTT 


CTTCGTGCCA 


GCCAAAGTCC 


AT CCTTCC AT 


10740 
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A 



TAGCATAGTA 


AGTTGGCTCT 




rfTTTfiTlT A 


AATCTTTTGG 

/W> A V» 4 A A • 


TTTGGTGAGA 


7260 


AAAGAGAATA 


GAATTTTTGT 


GATTCAAGwa 




PfinTGATAA 

UU^ 1 A A An 


TAAGATTTAA 


7320 


CAGAGTGAAT 


ATTGGCTAGA 


TAGCCCGTGT 


AW I W^* 1 ftwW. 




A W A A A A 


7380 


CCAAATCCCG 


TAAAACATAG 


CGCAATTTCT 




4 1 1 1 A 1 /\V*W A 


TPTTCTTCCA 

A \. 4> • A A W W 


7440 


AGTGTTCAAT 


CAACCAAGGT 


GTATCAACGA 


CAAAGATATw 


*pr**r & f* & T a 

lul nunWt 1 A 


1 i*»»/V*W\J* * 1 


7500 


CAGCTGTTGA 


CTTGCT ATC A 


AAGAGTTTAT 


GAGAAAGAAC 


ATCCTCTt»TT 


1 {.A Iv. 1 ntA 1 


/ 3DU 


CCAAGATTGC 


ATTTACTTCT 


GAAATATCTT 


TCTTAGCTAG 


TTTTTTATAA 


AW a ALnu a V*A 


'OaU 


TAGGCTCTTT 


TGI TGT ACTA 


TGTAGGTGGA 


AAACTTGGTT 


CAAATCAATO 


1 i AA a AAUAA 


/ oou 


CATCGCAGTT 


GAGGGCAACC 


GTTTGGTTTG 


AGCCAGAACG 


TTTCAAATAA 


GTAACAAUCT 


«n«rt 
/ /*»u 


CTTGGTAGTA 


TTCTTTTCCA 


ACTGTACTAC 


TTTCTACACG 


GGTATTGTAA 


ATTCCTAGAT 


78U0 


AGTAATGGCT 


AAGAAGGGTT 


GATAAGCCCC 


ACTCGCGTCC 


TG AACGAATA 


TGUlCAAATA 


7obU 


CTGAGCTGAT 


ATTATCCTGC 


TGGAAAATAC 


CAAACACACT 


ACGAACACCT 


GCATTAGCAA 


7920 


GCCTTGAAAG 


TGGGAAGTCA 


ATCAAACGAT 


ATTTCCCACC 


AAATGGCAAA 


CTTGCT ACTG 


7980 


GACGGTGGTC 


CGTCAATGTC 


GACATATTGT 


GAAAACCAAC 


TGTATTTCCT 


AAAATGG C AG 


8040 


AATATTTATC 


AATCTTCATC 


TGTTGCTACC 


CCCACTACTT 


CATTATATCC 


TACAACnGT 


8 100 


ACTTCATCTG 


TTCCATCAAT 


TTCGACACCG 


TCAGAAATAA 


TCCCACCTTC 


ACCAATAATG 


8160 


CCACGTTTAA 


TCTTAGCTCC 


TTGACCAATG 


ATAGCTCCAC 


TCATGATAAC 


X » rp/"» X 

TOAATWAAUU 




ACTTCCGCTC 


CTTCGCGAAC 


TTGCGCGCCT 


GTTGAAAGGA 


TAGAATG 1 i A 


AACAGTTLwA 


Q^an 


TCAACGAAAC 


ATCCGTCTAC 


AACTAATGAG 


TCTTCCACAT 


GAGCATTTGC 


tLLuAlAiAAu 


a J4U 


TTTGGTGGTG 


AAATCAAGTT 


TCTTGAGTAA 


ATCT 7C ATT 


CACGGTTALti 


AW 1 A 1 W Uv\v 


p Ann 


CCATTTTCTG 


GAGAAATATA 


CTCCATGTTC 


GwTTCwwAAA 


V» 1 uAL 1 W/*A J 




8460 


TCTTTCCAAT 


AACCACTAAA 


TTCGTAAGCA 


TAAACACTTT 


CACCTGACTC 


AAGGTAATTT 


8520 


GGAATGACAT 


TTTTACCAAA 


GTCTGACATG 


CCAACCTTGC 


TCTTTTCAGC 


AGCGACTAAC 


8S80 


ATATTACGAA 


GGCCTTGCCA 


ATCAAAAATG 


TAGATTCCCA 


TAGAAGCTTT 


TGTAGATTTA 


8640 


GCTTGAGCTG 


GTTTTTCTTC 


AAATTCAACA 


ATACGATTGT 

n 


TAGCATCTGT 


GTTCATGATA 


8700 


CCAAAACGGC 


TTGCTTCTTT 


AAGAGGGACG 


TCTAAAACTG 


CTACTGTCAA 


GCTGGCATTA 


8760 


TTATCCTTAT 


GAGACTGGAG 


CATATCATCA 


TAGTCCATTT 


TGTAGATGTG 


ATCCCCAGAC 


8820 


AAAATCAAGA 


CATACTCAGG 


ATTGACACTG 


TCGATATAGT 


CGATATTTTG 


CTAAATAGCG 


8880 


TGACTAGTCC 


CCTCAAACCA 


ACGATTTCCT 


TCACTTCCAG 


AATAAGGTTG 


AAGAATAGAG 


8940 
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CCAAAACACA 


CAAGCCTTTT 


ACTTGAGACA 


TCAGTTCTCC 


TCTCTAAACA 


CCCTAAAAAT 


5460 


CGTATGAACT 


CATCATACGA 


TTTTATCTAT 


TAATTAACTA 


AACTATGGTA 


CAAGTCAAGG 


5520 


TATGACTTGC 


AGGCTGTATC 


CCATGAGAAG 


TCACTCTCCA 


TACCTTGTTT 


TTGTAGGTTT 


5580 


CTCCAAATGT 


CTGGATGGTT 


TCTATACAAG 


TCCAATGCTG 


TTTGGAAAGT 


CCAATTTAAC 


5640 


CAATAAGGAG 


ATAGATTCTC 


AAAGCTAAAG 


CCAGTACCGC 


TTCCTTCGAT 


TGGATTGAAA 


5700 


GCGCGAACTG 


TATCTCGCAA 


GCCTCCAACT 


TCATGGACCA 


ATGGCAAGGT 


TCCATAACGC 


5760 


ATAGCCATCA 


TTTGAGACAA 


GCCACACGGT 


TCAAAACGAC 


TTGGCATGAG 


GAAGAGGTCA 


5820 


CAAGCAGCGT 


AGATTTCCTG 


AGCAAGTTTG 


ACATCAAAAG 


TGATATTTGT 


TGATAGCTTC 


5880 


TCTGGGTAAA 


TCTGAGCAAA 


CCATGAGAAA 


CCrCCTTCAA 


ACGCTGGATC 


GCCACTTCCC 


5940 


AAAAGAACAA 


TCTGAACATC 


TTCTTCCAAG 


ATATGGTGAA 


GACTTTCGAC 


CACCACATCA 


6000 


AAACCTTTTT 


GACCTGTCAA 


ACGAGAAACA 


ATTCCCACCA 


GTGGAACGTC 


TGCTCTAACA 


6060 


GGCAAGCCAA 


CTCTTTCTTG 


CAATTTTGCC 


TTATTTTTGG 


CTTTCCCAGA 


CAAATCTTCC 


6120 


TGATTGAAAT 


GATAGTCTAA 


AAGAGCATCC 


CTCTGAGGAT 


TATAAAGATC 


AGCATCAATC 


6180 


CCATTCACGA 


TACCAGATAC 


TTTACCAGAC 


TCCATTTTAA 


GAATCTGATC 


CAAATTACAT 


6240 


CCAAACTCAC 


TACTCATAAT 


TTCATGAGCA 


TAGCTAGGTG 


AAACGGTTGA 


AACACGGTTC 


6300 


G CAT AG AG AA 


T AC CTGCCTT 


CATCCAGTTC 


AGACAGTTCT 


TCCATCGAAG 


CGTGCCATCA 


6360 


GCGTAACGTT 


CAAAGCCAAC 


TCCAAACAAA 


TCACCCAACA 


TTCCTTCTGA 


AAATTCTCCT 


6420 


TCCAATTCTA 


AATTATGAAT 


GGTTAAAACT 


GTTTCAATGT 


CCTCATAGCC 


TTGAATCCAA 


6480 


CGGTATTTTT 


CCTTCAACAA 


GAAAGGAATC 


ATACCTGTAT 


GGTAGTCATG 


AACATGGAGA 


6540 


AGATCAGGAA 


TAAACTCAAT 


CCTTTCCATA 


GCCTCAATGG 


CAGCCAGTTG 


GAAAAACGCA 


6600 


AAGCGTTCTC 


CCTCATCAAA 


ATCACCGTAA 


ACATCACCAC 


GGAACAAATA 


ATATTGATTG 


6660 


TCAATAAAGT 


AGAAGGTTAC 


ACCATTTAAT 


ACTGTTTTCT 


TAATTCCACA 


ATACTGTCTG 


6720 


CGCCAACCAA 


CCCTCACCTC 


AAAATGAAGC 


ACATCTTCAA 


TCTGATTTCC 


AAATTTAGCC 


6780 


TCTACCATAT 


CATAGTAGGG 


TAAAATCACT 


GCAACTTCCT 


GCCCACCTTT 


TACCAGTGAT 


i.840 


TTTGGAACAG 


CGCCAATGAC 


GTCTCCCAAA 


CCACCTCTTT 


TTGAAAAGGG 


TGCACCCTCT 


6900 


GCTGCTACAA 


ATAAAATTTT 


CATGAATGAA 


TATCCTCTGT 


TACTTTACCA 


CCTTTCTTAA 


6960 


CCACAACTGG 


ATGTTCTGCA 


GTTCCTCGAA 


TCACAACACC 


ATGCTCAACT 


TCAACCCCTT 


7020 


TGTCCAAGAT 


AGCATATTCG 


ACCTGAGCCC 


CTTCTCCAAT 


AACAACACGA 


GGGAAGAGCA 


7080 


GGCTATCTTT 


AACCAAGCTA 


TCCTTATGGA 


CATGAATATT 


ACGTGATAGA 


ACAGAATTAG 


7140 


CTACTTGACC 


TTCAATAATA 


CTACCAGAGG 


CAAACTGACA 


AGTGCTTACC 


TTAGATGTAT 


7200 
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GCTAAATCTT GCACATATTT AATAAGTGCC CAATATTGGC AGCCCTGCGC TCCAGATAGA 
AACTGGCATT TTTCAAACTA TCTTCTAAAG GTTCACTTTT CTCC AAAAT A GAAAAGACAG 
CTTGGATATT TTCAAATGCT AGGGGAGGTA AATCTTCAGC AAGACTACCG CAAATAGCAA 
TAACAGGAAC TCCAACAGGG GTTCTTTTTG CAACACCTAT AGGCGCTTTC CCACCAAAGC 
TTTGACTATC AAGTCTTCCT TCTCCAACAA CAACCAAGTC AGCATCTGAA ACTTTCTTAT 
CAAAGTTGAT TAAGTCCAAG CAGGTATCAA TTCCACACAC GATACTTGCC TCAGCAAAGG 
CACACAAACC ACCACCAAGG CCTCCACCTG CTCCTCCTCC TTTAATTTCT AATGTTGCAG 
GTGAGAATTT TTCATAAAAA TCTTGGATCG CCTCATCTAC GACTGCAAAC ATACTCGGAT 
GTAGACCTTT TTGATTGCCA AAAGTGTAAG TCGCACCTTG ATGACCACAT AAGGGACTCA 
CGACATCTGC TAAAATATGA ATTTGAACAC CTTCAGGAAT TTTATAGCAA TTTTCTGTTG 
AAACAGAAGC TAAGTTTAAT AAGGATTGAC CGGAAGCAGG CAAGACATTT CCATCCCTAT 
CATAAAATTG ATAACCTAAA CCAGCAGCAA TCCCCAGTCC TCCATCATTA CTGGCCGTCC 
CACCAACACC CATATAAATA TCTTTAATCC CTTTAGACAT GAGATGAAGA ATCAACTCTC 
CAATACCACA AGTTTGGATT TCAAGTGGAT TTCGTTTCTC TAGCCGAATT TTTCCAACAC 
CAACCAAGTC AGCTACTTCA AATAGTGCCA CTTCCCCTTT TTGAAAATAC CCCATGGCTT 
CTTTTTGTCC AAAAGGGTCT GTCACTTGGA TCCATTTTTC TTTTAGGTCA ACAGAATGTC 
GGATAGCATC TACAGTACCT TCTCCCCCAT CACCAACAGG GCAGAGGAGA CATTCTACAT 
CTGCTATCCA TTGTTGGAAG CCTCTTTTTA TTGCTTCACC TACCTCTTCA CCTCTCAACC 
TTTCCTTAAA CCAATCCCGT GCAATTACAA TCTTCATATT TTCCCTCATT CTAAACAGTC 
AATCAAAGGG ACAACTTCTA AAAAATCCCT CTTGTCAACA TGATGTGCTA TTTCTTTTTT 
GAGCACTTCT TTGGCACAAA AGGCGATTCC TAACTTCGCC GACTTCAACA TTAATAGATT 
ATTAACCCCA TCACCGATTG CCACCGTTCT TTCTTTAGAA AGTTTTAGTT TCTTTCTCCA 
TTTTTCCAGA GTCTCTTTTT TGACCTGGGG ACTTATAATT TGTCCAACTA ATTTTCCTGT 
TAAAAGACCT TCTTTGACTT CAAGCTAGTT GGCAGTGAAA TAGGCAATAC CAAGGGATTT 
. TG 5J^ TCTC . T 5 C ^CTATT GCTGTAAATC CACCAGACAC CAGACCAACT AGGATGCCAT 
TCTTTTGGAG AATAGAGATG AACTCTGGGA CATTTACCCA TAG AT G AATT GAGTTGAACA 
CGTTATCAAA GACCAAAATA GGAAGACCTT CCAACAAGCA CACTCTTTTT CTTAAACTGC 
TTTCAAAGAC CAACTCTCCT CGCATTGCTC GACTTGTAAT CTGCGAAATT TCCGCCTCAT 
GACCTGCCTC TCTCCCTAAA AGATCAATCA CTTCTTCTAG GATTAAGGTT CCATCTACAT 



3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4 500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 
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AAAGTGAACC 


AGTCTTGATT 


TCTCCTGCGT 


TAGTTGCAAC 


TGCAATATCA 


CCGATTGTTG 


1920 


AATCTTCAGT 


TTCACCTGAA 


CGCTGTGATA 


CAACAGCAGT 


GTAACCAGCT 


TCTTTAGCCA 


1980 


TTTCGATAGC 


TTCAAAAGTT 


TCAGTAACAG 


TACCGATTTC 


GTTAACTTTG 


ATAAGGATTG 


2040 


AGTTAGCACC 


ACCTTCTTGG 


ATACCACGTG 


CAAGGTAGTC 


AGTGTTTGTT 


ACGAAGAAGT 


2100 


CGTCACCAAC 


AAGTTGTACT 


TTCTTACCAA 


GACGTTCAGT 


AAGAGCTTTC 


CAACCATCCC 


2160 


AGTCGTTTTC 


ATCCAT AC C A 


TCTTCAATAG 


TCATGATTCG 


GTATTTGTTA 


ACCAATTCTT 


2220 


CAAGGTACTC 


GATTTGTTCT 


GCAGATGTAC 


GAACAGCAGC 


ACCTTCACCT 


TCAAATTTAG 


2280 


TGTAGTCGTA 


AACTTTACGT 


TCTTTATCGT 


AGAATTCTGA 


TGAAGCACAG 


TCAAATCCGA 


2340 


TAAATACCTC 


TTTACCTGGT 


ACATATCCAG 


CAGCTTCAAT 


CGCAGCAAGC 


ATAGTTTCAA 


2400 


CACCATCTTC 


ACTTCCTTCG 


AAACGAGGAG 


CGAATCCACC 


TTCGTCACCT 


ACGCCAGTTT 


2460 


CCAAACCACG 


TGATTTAAGG 


ATTTTCTTAA 


GACCCTGGAA 


GATTTCACCA 


CCGTAACGAA 


2520 


GGGCPrCTTT 


AAATGTTGGC 


CCACCAACTG 


GCAAGATCAT 


GAACTCTTGG 


AAACCGATTG 


2580 


GAGCGTCACA 


GTGAGAACCA 


CCGTTGATGA 


TGTTCATCAT 


TGGAGTTGGA 


AGAACTTTAG 


2640 


TGTTGAATCC 


ACCAAGATAG 


CTGTAAAGTG 


GGATTTCAAG 


CTAGTCAGCA 


CCAGCACGAG 


2700 


CTACAGCGAT 


AGACACACCG 


AGGATTGCAT 


TCGCACCCAA 


TTTACCTTTG 


TTAGGAGTAC 


2760 


CGTCAAGTCC 


CATCATAGCA 


CCGTCAATAG 


CTTGTTG ATC 


ACGTACATCG 


TAGCCAATGA 


2820 


TAGCTTCAGC 


AATGATGTTG 


TTTACGTTGT 


CAACAGCTTT 


TTGTGTACCA 


AGACCACCGT 


2880 


AACGAGATTT 


GTCACCGTCG 


CGAAGTTCAA 


CTGCTTCGTG 


TTCACCAGTA 


CAAGCTCCTG 


2940 


ATGGAACCAT 


ACCACGTCCC 


AAACCACCTG 


ATTCAGTGTA 


AACTTCTACT 


TCAAGTGTTG 


3000 


GGTTACCGCG 


TGAGTCTAGG 


ACTTCGCGAG 


CGTAAACATC 


AGTAATAATT 


GACATTTTTT 


3060 


ACTCTCCTTA 


TGAGTTAAAT 


TTTTTACACC 


TCTATAATAC 


CTTAAAACCC 


CTCCTTTTTC 


3120 


AAGAAAAAAC 


GTTATCTTTC 


TGCAACTTTT 


CCTTAACTTT 


ATAAAGTAAT 


CGCTTTCTTT 


3180 


TGTCTGTTTT 


ATTCTAACTT 


TTATGATATA 


CTGTTTTCAT 


GACAGATTTA 


TCAAAACAAT 


3240 


TACTTGAAAA 


AGCTCATGGT 


GGGTTAAAAA 


TAAATCCGGA 


TGACCAAAGA 


CCCTATCTTG 


3300 


GTACTTTTGA 


GGAAAGAGTT 


CTTGCATATG 


TAGATATTGA 


CACACCAAAT 


AGCCCTCAGT 


3360 


TAGAAAAAGG 


CTTTTTATTT 


ATTTTAGAAA 


ACCTTCAGGA 


AAAAGCAGAG 


CCACTATTTC 


^420 


TGAAGATTTC 


ACCAACTATC 


GAATTTGATA 


AGCAAGTTTT 


CTACTTAAAA 


GAAGCAAAAG 


3480 


AAACTGATAG 


TCAAGCCACC 


ATAGTATCTG 


AAGACCATAT 


TACTTCTCCT 


TTTGGCCTGG 


3540 


TTATTCATAG 


CAATGCACCA 


GTTCAAGTAG 


AAGAAAAAGA 


CCTTCGACTT 


CCTTTTCCAA 


3600 


AACTTTGGGA 


AGTTAAAAAG 


GAAGAACCAG 


CCAAAACATC 


CTTATGGAAG 


AAATGGTTTA 


3660 
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ACTTCTTTGG CAATGATATT CCTAATTCGT CTTTAAAAAA AATTCACTAT ATCGCACCTT 180 

CAGAAATTGT TTCATTTAGT ACGTACGTTC GACAACGTTC TAAAGTAATT CCTAAAATTT 240 

TGGAACATAT ATTAAAATCA ACTTTTTTAT TAGAGAATAT AGATGTTTCT GGTTACACTG 300 

TAAATATTTT AGAAGATCAA TTAACAAAAC ATAGAACAAT CAAAATTAGT AAAAACTAAC 360 

TGGTTGATCT CATGTATAAA TACCTAACAA AACCACGCGC CTTGCCTGCT GATCGAAAGA" 420 

AAGGTACAAA TACATGAATA TCAAAGAAAA AATCAAAAAG AATGGCCAAA GAGTTTATTA 4 80 

TGCTAGTGTT TATCTAGGCG TTGACCAACT AACGGGCAAA AAAGCCCGTA CAACTGTTAC 540 

AGCAACCACT AAAAAGGGCG TTAAAGTAAA AGCGCCTGAT GCGATCAATA CTTTTCCTGC 600 

TAATGGCTAT ACAGTTAAAG ACAAGCCGAC AATTACAACA TATAATGAGC TTGTAAAAGT 660 

TTGGTGGGAT AGTTACAAGA ATACAGTTAA GCCAAATACT CGCCAATCCA TGGAGGGATT 720 

GGTTAGAGTG CATTTATTGC CTGTATTTGG CGATTACAAG CTATCTAAAC TTACTACGCC 780 

4 TATTCTTCAA CAGCAACTAA ACAAATGGGC TGACAAGGCA AATAAAGGCG AAAAAGGGGC 840 

ATTTGCTAAC TACTCTTTCC TCCATAACAT GAATAAGCGT ATTTTGAAAT ATGGCCTAGC 900 

TATCCAGGTA AT AC AAT A C A ACCCAGCTAA TGATGTCATC GTTCCACGCA AACAGCAAAA 960 

AGAAAAGGCT CCTCTCAAAT ACTTAGACAA CAAAGAATTA AAACAGTTTC TTGATTATTT 1020 

AGATGCTCTG GATCAATCAA ATTATGAGAA CTTATTTGAT GTTGTTCTGT ATAAGACTTT 1080 

ATTGGCCACT GGTTGCCGTA TTAGTGAGGC TCTCCCTCTT GAATCGTCTG ATATTGACCT 1140 

AGAAAGCGGT CTTATCACCA TCAATAAGAC ACTAAACCGC TATCAGGAAA TAAACTCACC 1200 

TAAATCAAGC GCTGGTTATC GTGATATACC AATAGACAAA GCCACATTAC TTTTACTGAA 1260 

ACAATACAAA AACCCTCAAC AAATTCAGTC TTGGAAATTA GGCCGATCTG AAACAGTTGT 1320 

ATTCTCTGTA TTTACGGAGA AATATGCTTA TGCTTGTAAC TTACGCAAAC GCCTAAATAA 1380 

GCATTTTGAT GCTGCTGGAG TAACTAACGT ATCATTTCAT GGTTTCCGCC ATACACATAC 1440 

TACTATGATG CTCTATGCTC AGGTTAGCCC GAAAGATGTT CACTATAGAT TAGGCCACTC 1500 

TAATTTAATG ATCACTGAAA ATACTTACTG GCATACTAAC CAAGAGAATG CAAAAAAAGC 1560 

CCTCTCAAAT TATGAAACAG CTATCAACAA TTTATAAAAA ATAAGGGTGA CCCATTTCCG 1620 

GGCTACCCTC TTACTATACC AAAAATTAGT AGGGGTAGTA AAAAGGGTAT TAAATTATAA 1680 

AAAGCACTAA GGGAAAGCCC CCCAAAGTGC TTATTTCAAA GGCTTTATAG CCTATAATCA 1740 

CATAAAGAGA TTATTTTTTA AGGTTGTAGA ATGATTTCAA TCCACGATAT TCAGCTACTT 1800 

CACCAAGTTG GTCTTCGATA CGAAGCAATT GGTTGTATTT AGCGATGCGG TCTGTACGTC 1860 
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AACTGTCAAA 


CAAAAATTTA 


AGTTTATTTT 


CCACTTCGCT 


GA*PGJ^.\TGGC 


GTTTTGAATG 


7440 


CCAAGTGCTG 


AGAGAAATCG 


AGACAGAGGA 


CGAAGAACCT 


TATCTCGTAC 


GTTCGGTTGG 


7500 


AACCTCGCCA 


CAACAATATC 


CAGATTATGA 


TGGTTTTGAC 


TATCAACAAT 


CCTAAAATTG 


7560 


AAATCAGTCT 


GTGTAGGCTT 


AGTATTTCAA 


TAGACTTCCT 


GCAAAACTAG 


AATCCTAGTT 


7620 


CATGATTGAT 


AATACCAGCA 


ATCAAATTCA 


TTCGTAATCC 


GAAGCGTTTA 


CGATGATTTC 


7680 


GATAGGTTGT 


TGAAAACATT 


TTAAACGTTT 


TTACTTTGGC 


AAAGATCTTC 


TCAACCTTCC 


7740 


TTCTCTCCTT 


ACATAGCGCA 


TGGTTATAGG 


CTTTATCTTC 


AGCTGTTAGT 


GGCTTGAGTT 


7800 


TGCTGGATTT 


ACGTGAAGTT 


TGTGCTTGAG 


GACATATCTT 


CATGACCCCT 


TGATAACCAC 


7860 


TCTCACCCAA 


GATTTTACCA 


GCTTGTCCGA 


TATTTCTGCA 


ACTCATTTTG 


AACAACTTCA 


7920 


TATCATGACA 


ATACTTCACA 


GTGATATCCA 


AAGAAACAAT 


TCTCCCTTGA 


CTTGTGACAA 


7980 


TCGCTTGAGC 


CTTCATAGCG 


TGAAATTTCT 


TTTTACCAGA 


ATCATTCGCT 


AATTCTTTTT 


8040 


TTAGGGCGAT 


TGATTTTTAC 


TTCCCTCGCA 


TCAATCATTA 


CCGTGTCCTC 


AGAACTAAGA 


6100 


GGAGTTCTTG 


AAATCCTAAC 


ACCACTTTGA 


ACAAGAGTTA 


CTTCAACCCA 


TTGCCTCCGA 


8160 


CCGATTAACT 


TGCTTTCGTG 


AATACCAAAA 


TCACCCGCAA 


TTTC TTCATA 


AGTCCGGTAT 


8220 


TCTAGGCTTA 


ATTTAGGTTT 


TCGTCCACCT 


TTTGCCTCTT 


TAACTTCATA 


ACCTGTTTTT 


8280 


AATACAGCTA 


ACATCTCTTT 


AAAAGTCCTG 


CGCTGAACAC 


CAACAAGACG 


CTTAAATCGT 


8340 


GTATCAGTTA 


ATTGTTTACT 


TGCTTCATAA 


TTTCCCACGG 


AGTCTATTGA 


CTCTTTGGTA 


8400 


GCTCTCAATC 


TTTTTTTCAT 


CTATCCCGAG 


AATTATTTTC 


CCCCCATTTG 


TATTTGCAAA 


8460 


TGCTGAGTAG 


CTTTCCCAGA 


AAGACTCTGG 


AAGATTCTTT 


TTAGCTTTTT 


TCTATTCTAA 


3520 


ATCAACCCCT 


TCAAATTTTA 


ACTCCATATT 


TTTCCTTTAC 


ATCTGTTTTT 


TCTGGTTCTG 


8580 


GTATTTGTTC 


AAGTTGACTG 


ATAATATAGC 


GAATTGAATT 


TCGAGAGTTT 


TTACTCAGTT 


8640 


AATTTCTTTT 


TTAACCC 










8657 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D1 TOPOLOGY; lir.ear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
TCTATTTTGG GTATAGACTT AC CT ATAAAG AAAAATATCT ATACACTGCC TTACTAGCTA 60 
TACTGAACGA GTCAACAAAA ACGATATATA TTGATGATAT AAATACAGCA AGATTTTTTA 120 



WO 98/18931 



PCT/US97/19588 



415 

CGCTACCTGA TTCTGAACGG TCTTGTGGAT CGTAGTCTCC TCATGTGCCA ACACTTGGAG 
CGCGTGCTGG ATATTGCTGA CAAATATGGT TTCCACTGCC AGATGTGGAG TGATATGTTC 
TTCAAACTCA TGTCACCGGA TGGCCAGTAC GACCGTGATC TGCAAATTCC AGAGCAAACT 
CGTGTCTACC TAGACCCTCT CAAAGACCGT GTGACTCTGG TTTACTGGGA TTATTATCAG 
GATAGCGAGG AAAAATACAA CCGTAATTTC CGCAATCATC ACAAGATTAG CCATGACCTT 
GCATTTGCAG GGGGAGCTTG GAAGTGGATT GGCTTTACAC CTCACAACCA TTTTAGCCGT 
CTAGTGGCTA TCGAGGCTAA TAAAGCCTGC CGTGCCAATC AGATTAAAGA AGTCATCCTA 
ACGGGTTCGG GAGACAATGG TGGTGAAACT GCCCAGTTCT CTATCCTACC AAGCTTGCAA 
ATCTGGGCAG AACTCAGCTA TCGCAATGAC CTAGATGGTT TGTCTGCGCA CTTCAACACC 
AATACTGCTC TAACGGTTGA GGATTTTATG CAGATTGACC TTGCCAACCT CTTACCAGAC 
CTACCAGGCA ATCTCAGCGG TATCAATCCC AACCGCTATG TTTTTTATCA CCATATTCTT 
TCTCCCATTC TTGATCAACA CATGACACCT GAACAGGACA AAC CGCACTT CCCTCAGGCT 
CCTGAGACGC TTCCTAACAT TAAAGAAAAA CCTGGAAACT ATGCCTATCT CTTTGAAACT 
CAGGCCCACT TGAATGC7AT TTTAAGTACC AAAGTAGATG TGGGACGACG CATTCGTCAG 
CCCTACCAAG CGGATGATAA ACAAAGTTTA CAACAAATCG CCACACAACA ATTACCACAA 
CTTAGAAGCC AAATTGAAGA CTTCCATGCC CTCTTTACCC ACCAATCCCT GAAAGAAAAC 
AAGGTCTTTG CTTTGGATAC AGTTGACATC CCTATGGGCC GACTCTTCCA ACGCATCAAA 
CCAGCAGAAA GCCGTATCCA GCTTTATCTG GCTGCTCACC TTGACCCCAT CGACGAGCTG 
GAAGTTGAAA TCCTACCATT TACTGACTTC TACGCAGACA AGGATTTCGC AGCAACTACA 
GCCAACCACT GGCATACCAT TCCGACAGCG TCCACGATTT ATACGACTTA ATATTCTTCG 
AAAATCTCTT CAAACCACGT CAGCTTCCAT CTGCAACCTC AAAACAGTGT TTTGAGCAAC 
CTGCAGCTAG CTTCCTACTT TGCTCTTTGA TTTTCATTCA GTATAAAAAC AAGAACACCT 
TGCTTGGCGC AGCCTCTTTC GCGTGAAACA CAACAATTAT CTGGTTTCAA ATGCTACAGT 
TAGACAAACT TATGATAAAA TAGCAGAAAG TGAATGTTTC CTAAGAGCAA TTGGAGGTAT 
TATGCTACAC TTAAAATTAG TAAAACAAGA AATAGAAGCT GAAAAGCCAG CATCTGTAGA 
AGCTTGGATC ATTTCCGTCA AATTTAAAAA AGGTTGCTAC CGACATATAT AGATTCCAAA 
AACAAAAACG TTAGCGGAAC TAGCAGATGT GATTTTATGG ACTTTTGATT TTGCAAATGA 
TCATGCTCAC GCATTTTTCA TGGATAATGT TGAGTGGAGT CATGCAGATT CTTACTTTCG 
TAGCTTTGTT AGTGACGATG TTGAAGAACG TTACACAGAA AATGTCTATC TGGATAGCCT 
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GAAGGCTATG 


TG CTTCGTT A 


CTACAATATG 


TGTAGTCAAA 


ATGTACCTGT 


GCCAGAAAGT 


3900 


CAACATCTCT 


TCCTTGACCT 


ACTTGAACGA 


CCATACCCAG 


TTCATTCAGG 


ACTATTGGCT 


3960 


CCACAAGAGA 


TTCGTACAGA 


ATTCATCAAA 


AAAGAAGAAA 


TTTAATTTCA 


AAAAGTAAAC 


4020 


ATCAAAAGAA 


AGGAGGGGCG 


AAAAAGTAAG 


AACTAACTGC 


TGATTCGCCC 


CTTTTATGGT 


4060 


AAAAACAATG 


ACCATTGCAA 


CGATTGATAT 


CGGAGGGACT 


GGGATTAAGT 


TTGCCAGTCT 


4140 


G ACT CCTG AT 


GGGAAAATAC 


TGGATAAGAC 


AAGTATTTCA 


ACCCCTGAAA 


ACTTGGAGGA 


4200 


TTTACTAGCG 


TGGCTAGATC 


AACGCTTGTC 


AGAACAGGAT 


TACAGTGGGA TTGCTATGAG 


4260 


CGTTCCAGGT 


GCAGTCAATC 


AAGAGACAGG 


TGTGATTGAT 


GGCTTCAGTG 


CGGTGCCCTA 


4320 


CATCCATGGC 


TTTTCTTGGT 


ATGAGGCGCT 


TAGCTCTTAT 


CAGCTACCTG 


TCCATTTAGA 


4380 


AAATGATGCC 


AACTGCGTTG 


GACTCAGTGA 


ACTACTAGCT 


CATCCAGAGC 


TTGAAAATGC 


4440 


AGCCTGTGTC 


GTGATTGGGA 


CAGGGATTCG 


CGGAGCCATG 


ATTATCAATG 


GTAGACTTCA 


4S00 


TCGAGGTCCC 


CACGGTCTCG 


GTGGAGAATT 


TGGCTACATG 


ACAACCCTTG 


CCCCTGCTGA 


4S60 


AAAACTTAAT 


AACTGGTCGC 


AACTAGCATC 


AACTGGGAAT 


ATGCTACGAT 


ACGTGATTGA 


4620 


AAAATCTGGT 


CATACTGATT 


GGGACGCTCG 


CAACATTTAC 


CAAGAGCCCG 


CAGCTGGTAA 


4680 


T AT CCTTTGT 


CAAGAAGCCA 


TTGAGCGCAT 


GAACCCCAAT 


CTCGCGCAAG 


GCTTCCTCAA 


4740 


TATCCAGTAT 


CTGATCGATC 


CAGGTCTCAT 


CAGTCTCGGT 


GGCTCTATCA 


CTCAAAATCC 


4 800 


AGATTTTATC 


CAAGGTGTCA 


AGAAGGCTGT 


TGAAGACTTT 


GTCGATGCCT 


ACGAACAATA 


4860 


CACGGTCCCA 


CCAGTTATCC 


AGGCCTGCAC 


CTATCACGCA 


GATGCCAATC 


TCTACGGTGC 


4920 


TCTTGTCAAC 


TGGCTACAGG 


AGGAAAAGCA 


ATGGTAAGAT 


TTACAGGACT 


TAGTCTCAAA 


4980 


CAAACGCAAG 


CTATTGAGGT 


TTTAAAAGGT 


CACATTTCTC 


TACCAGATGT 


GGAACTGGCT 


5040 


GTCACTCAGT 


CTGACCAACC 


ATCTATCTCT 


ATCGAGGGTG 


AGGAAGCTCA 


CTATCAATTG 


5100 


ACCTACCGCA 


AACCTCACCA 


ACTTTATCGT 


GCCTTGTCCT 


TGTTGGTAAC 


AGTTCTAGCA 


S160 


GAAGCTGATA 


AAGTAGAGAT 


TGAGGAACAA 


GCAGCTTACG 


AAGATTTGGC 


TTACATGGTT 


S220 


GACTGTTCTC 


GAAATGCGCT 


GCTGAATGTG 


CCTTCTGCCA 


AGCAGATCAT 


TGAGATATTG 


5280 


GCTCTCATGG 


GCTACTCAAC 


CTTTGAGCTT 


TACATGGAAG 


ACACTTACCA 


GATTGAAGGG 


5340 


CAGCCTTACT 


TTGGCTATTT 


CCGTGGAGCT 


TATTCAGCAG 


AGGAGTTCCA 


GGAAATCGAA 


5400 


GCCTATGCCC 


AACAGTTTGA 


CGTGACCTTT 


GTACCATGCA 


TCCAGACCTT 


GGCCCACTTC 


5460 


TCGGCCTTTG 


TCAAATGGGG 


TGTCAAGGAA 


GTGCAGGAGC 


TCCGTGATGT 


AGAGGACATT 


5520 


CTTCTCATTC 


GCGAAGAAAA 


GGTTTATGAC 


TTGATTGATG 


GCATGTTTGC 


CACGTTCTCT 


5580 


AAACTGAAGA 


CTCCCAACCT 


CAATATCGGG 


ATGGACGAAG 


CCCACTTGGT 


TGGTTTGGGA 


5640 
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AATGAACTCT TCCCCGATGT AATCTTTGTT CATACTTCTT TTGATGAATA TGTTCAAGCT 
GTAGAAGCTG CGCTTCCTGA ACACTTATCA ACTGTTACAG GCGAGTTGAC CACTCAGGAA 
ACAGATGGCT CCTACACACT TGCCAACACT TCTTCATCCC GCATTTACCT AAAACAAGCC 
TTCCAAGAAA ATACCAACCT CCTAGAGCAA GTGGTAGAAC CCTTGACTAT TATCACTGGT 
GGACACAACC ACAAGGACCA GTTGACCTAT GCTTGGAAAA CACTTTTGCA GAATGCCCCA 
CATGATAGTA TCTCTGCCTG TAGCGTGGAC GAAGTTCACC GCGAGATGGA AACGCGTTTT 
CCCAAGGTCA ACCAAGTAGG AAACTTTGTT AAAAGTAACT TGCTCAACGA CTCCAACGCT 
AAAATTGCTA CCGATAAGGC TCAAAGTGAC TATCTCTTTA CTGTCATTAA CACAGGCTTG 
CATGATAAGG TCGATACTGT CAGCACAGTG ATTCATCTCC CGACTTGTGA TTTCAAGGAA 
TTCCACCCAA CAGAAGGCTA CAAAAAGATG GCTGCTCTTA TCTTGCCAAG TTACCGTGTG 
GAGGACTTGG ATGGTCGTCC TGTAGAGGCT ACAATCGAAC ACCTCGGAGC TAATTTTGAG 
TATAATTTAC CAAAAGACAA GTTCCGCCAA GCTCGTATTG CTCGTCAAGT GCGCGTGACC 
ATTCCACTTC ACCTAGCGCC GCTTTCTTGG ACAACCTTCC AATTGCTGGA ACGAAAACAA 
GAACACCGTG ACGCTATTTA CCAAAACGGA GTGATTGATA CACCATTCCT AACGCTCACT 
CTGGATGACA ACATCACAGT CTATCACAAG ACAACTCACG AAGCCTATCA AGACTTTATC 
CGCTTTGAAG ACCCTGGGGA CATCGGAAAC GAGTATATCT ATTTCCAACC AAAAGGAACA 
GACCCAATCT TTGCAGAGCT TAACCGCCAC GACGTCTTGG AAAACACAGC TTGCTATGCT 
AAAATCTTGC TCAAACATGA ATTGACCGTG CCTGTCACTG CCCATGAAAA GCTAGAACAA 
GAGCAACAAG GTATCATCGA GTTTATGAAG CCTCAGCCTG GACGGTCACA AGAATTGACA 
AACATTCCTC TGCAAACTGA GTTGACTCTC TTCGTTGACA ATCCACAAAT CCGCTTCAAG 
ACTCGCTTTA CTAACACTCC CAAGGATCAC CGTATCCGTC TCTTGGTCAA GACTCATAAC 
ACGCGTCCAA CCAATCATTC TGAAAGTATC TATGAGGTGG TGACACCACC AAACAAACCA 
GCTGCTTCAT GGGAAAACCC TGAAAATCCT CAACACCAAC AAGCTTTTGT CAGTCTGTAT 
GACGATGAAA AAGGGCTGAC TGTATCCAAC AAGGG ATTGA ATGAATACGA AATCCTTGGG 
GATAACACCA TTGCCCTG AC CATTTTCCGT GCATCAGCTG AGCTAGGTGA CTGGCGCTAC 
TTCCCAACGC CAGAAGCACA ATGCTTGCGG GAGTTTGAAG TCGAGTTTGC ACTTGAATGC 
CACCAAGCCC AAGAACGCTT CTCAGCCTAT CGTCGTGCCA AAGCCTTGCA GACACCGTTT 
ACCAGCCTTC AGCTTGCTAG ACAGGAAGGA AGCGTGCTTC CGACTGGTAC CCTCTTGAGC 
CATTCTGTTC TCAGCATACC CCAAGTTTGT CCAACAGCCT TTAAGGTAGC TGAAAATGAA 
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TTCAACATTG 


AGGAGAACTG 


GAAAGGGCAC 


CACGAGACTG 


ACCACACAGA 


CCTTAACGGC 


360 


TGGATCTGGG 


AGCGCAAGTA 


TCAGGTGGAT 


TCGCTTTGCT 


ATCCTTTGCA 


GTTGCCTTAT 


420 


CTCCTCTGGA 


AAGAGACTGG 


CGAGACTAGT 


CACTTTCATC 


AGATTTTTGT 


CGCAGCCACT 


480 


AAGGAAATTC 


TCCATCTGTG 


GACGGTGGAA 


CAAGACCACA 


AGAACTCTCC 


TTATCGTTTT 


540 


GTCCGACATA 


CGGACCGTAA 


GGAAGACACC 


TTGGTAAATG 


ATGGCTTTGG 


ACCTGACTTT 


600 


GCAGTGACAG 


CTATGACTTG 


GTCAGCTTTT 


CGTCCGAGTC 


ATGACTGTTG 


CCAGTATAGT 


660 


TACTTGATTC 


CGTCAAATAT 


GTTTGCTGTA 


CTAGTCTTGG 


GTTATGTGCA 


AGAAATCTTC 


720 


GCAGCATTAA 


ACCTAGCTGA 


TAGCCAGAGT 


GTTATTGCTG 


ATGCCAACCG 


TCTTCAGGAT 


780 


GAAATCCAAG 


AAGGAATCAA 


AAACTACGCT 


TACACCACCA 


ACAGCAAGGG 


CGAAAAGATT 


840 


TACGCTTTTG 


AAGTGGATGG 


CCTAGGAAAT 


GCCAGCATCA 


TGGATGATCC 


AAATGTACCA 


900 


AGTCTACTAG 


CTGCGCCCTA 


TCTGGGCTAC 


TGTTCGGTCG 


ATGATGAACT 


GTATCAAGCT 


960 


ACTCGTCGTA 


CCATTTTGAG 


CTCTGAAAAT 


CCATACTTCT 


ACCAAGGAGA 


ATACGCAAGC 


1020 


GGTCTCGGCA 


GTTCTCATAC 


CTTCTATCGC 


TATATCTGGC 


CAATCGCCCT 


TTCTATCCAA 


1080 


GGCTTGACAA 


CAAGAGATAA 


GGCAGAGAAA 


AAATTCTTGC 


TGGATCAGCT 


GGTTGCCTGC 


1140 


CATCCTGGTA 


CAGGTCTCAT 


GCACGAAAGC 


TTTCATGTAG 


ATG ATCCG AC 


CCTCTACTCT 


1200 


CGTGAATGGT 


TCTCCTGGGC 


TAACATGATG 


TTCTGTGAGT 


TGGTCTTGCA 


TTACTTGGAT 


1260 


ATTCGCTAAG 


GGCCTCGCTT 


TAGCTCAACC 


CATTCTTATC 


AGAATCACAA 


GTTTACATTT 


1320 


AAAACGTTAA 


AATTTAAATT 


TAGAATGAGC 


TTTTACTTCA 


TGGAAAATGT 


TGTTGTACA? 


1380 


ATTATCTCAC 


ATAGTCACTG 


GGATCGTGAG 


TGCTACTTGC 


CTTTTGAAAG 


CCATCGTATG 


1440 


CAGTTGCTGG 


AATTGTTTGA 


CAATCTCTTT 


GATCTCTTTG 


AAAATGACCC 


TGACTTCAAG 


1S00 


AGTTTCCACT 


TGGATGGACA 


AACTATTGTC 


CTTGATGACT 


ACTTACAAAT 


TCGCCCTGAA 


1S60 


AATCGCGACA 


AGGTCCAACG 


CTACATTGAC 


GAGGGCAAAC 


TTAAAATTCG 


TCCCTTTTAC 


1620 


ATCTTGCAGG 


ATGACTACTT 


CATCTCCAGT 


GAAGCCAATG 


TCCCCAATAC 


CTTGATTGG7 


1680 


CAACAAGAAG 


CTGCCAAATG 


GGGTAAATCA 


ACCCAGATTG 


GCTACTTTCC 


AGATACCTTT 


1740 


GCAAATATGG 


GACAAGCGCC 


TCAAATTCTT 


CAAAAATCAG 


GCATTCACGT 


GGCGGCCTTT 


1800 


GGTCGTGCTG 


TGAAGCCGAT 


TGGATTTGAC 


AACCAAGTCC 


TTGAAGATGA 


CCAGTTTACG 


1860 


TCTCAGTTTT 


CAGAAATGTA 


CTGGCAGGGT 


GTGGATCGTA 


GTCGTGTTTT 


AGGTATTCTC 


1920 


TTTGCCAACT 


GCTACAGTAA 


CGGCAATGAA 


ATTCCAGTTG 


ACAAACATGA 


GGCCTTGACC 


1980 


TTCTGGAAAC 


AAAAATTGTC 


AGATCTGCGT 


GCCTACGCTT 


CGACCAACCA 


ATGGTTGATG 


2040 


ATGAACGGCT 


GTGACCACCA 


GCCTGTACAG 


AAAAATCTGA 


GCGAAGCCAT 


TCGTCTGGCA 


2100 
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CTCCCCATAT 


AAGCACTAAC 


ATTCCAACGT 


GCACATATTC 


GAACGACATC 


CATAACTCCA 


8700 


CACAATCTCT 


AAAGTTTACA 


ATTTAAATGA 


ATTAACAATT 


TTCCCAACTA 


AAAGCACTCC 


8760 


AGTTACCGCA 


ACGATTTGTA 


CTGAATGTAC 


TAAATCGCAT 


TC CATCAACT 


TCATCTGTTT 


8820 


CGTCAACTTG 


AACAGATACT 


AATTGAAGAT 


TTAATACTTC 


TTCTGCCATA 


GCTAGCTCCT 


8880 


CCTATTTAAA 


TTTTTGGGAT 


TAAGTACTTT 


ATCCACCCTC 


ATTATACTCT 


CTCCACCAGT 


8940 


AAAATCCAAG 


CAATTATACA 


ATGTTGTCAC 


ATAGAAAATA 


ATGTTTCCGT 


AACTTTTCAA 


9000 


AGTAACTTCC 


ATCTCTCTCC 


CAAAACTGGA 


AGTTAGTTTT 


AGAAGTTACC 


TAAAAATCAG 


9060 


GTCACCTATT 


TTAAAAAAGC 


AGCAAACTAT 


AAACTAGTAG 


GTTCCACACC 


AAATCTAGTC 


9120 


CCATACTGCC 


CCATAAGTCA 


GATTTATAGC 


GCACCATACC 


TAAAAACATC 


CCAAGTGAAA 


9180 


CAT AC AAAC A 


CCAAGCTAGA 


ATGGTTCCTG 


TATCATGTGC 


TAAGGCAAAT 


AAAACACTTG 


9240 


TCAAAGCAAC 


TCTGATATCT 


AATTTTCTGA 


CCAAATTCCA 


TAAAATTTCT 


CGATACAGAA 


9300 


ATTCTTCAAC 


CATACTCGCA 


TTGATTAACA 


ACAATAAAAA 


TGAAAACCAA 


GGAATTTGAT 


9360 


CTTGAACCCC 


AATTAACTTT 


GCTTGATTCG 


TGCTTCCTTG 


ACCATGAATC 


AGACTAAAAC 


9420 


ATAGACTTAT 


AATCAGTAGG 


CTAACAAATT 


CAACACCAAC 


CCATTTCATC 


CTAGATTTCA 


9480 


TATTGACCTT 


ATGCGCTTCT 


TTGCGTTGGC 


CATACATCCA 


TAAAAAAGAA 


ATGAGTCACG 


9540 


AACCATAGAG 


AATCTGTAGT 


ATACTTmACT 


CACCGATACA 


AAGAAATTTC 


AATAAGTATA 


9600 


GAG r T ACCAA 


TAsGACATTT 


ACTTGTTGGA 


ATATATAAAC 


TGGAATTATT 


CTTTTCATAC 


9660 


TTACCTCCGA 


AATAAATCTT 


CATAATCTAA 


ATCTAATACC 


TGCACAATCC 


TTT 


9713 


(2) INFORMATION FOR SEQ ID NO: 44: 








(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 8657 base pairs 
<B> TYPE: nucleic acid 
(C) STRAND EDNESS : double 
ID) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44; 






AAAGAAATTG 


TCAGAGAGTG 


CCTAGATGAA 


GTAGCAGAGC 


GGGCTAAGGA 


CTATCCAGAG 


60 


TGGCTGGATG 


TTTTCGAGCG 


TTGCTACACC 


GATACCTTGG 


ACAATACGGT 


TGAAATCTTA 


120 


GAAGATGGTT 


CAACTTTTGT 


CTTGACTGGC 


CATATTCCTG 


CCATGTGGCT 


TCGAGATTCG 


180 


ACAGCCCAAC 


TCAGACCCTA 


CCTTCATGTA 


GCTAAAAGAG 


ATGCCCTCCT 


GCGTCAGACC 


240 


ATTGCAGGTT 


TGGTCAAACG 


TCAGATGACC 


TTGGTACTCA 


AGGATCCCTA 


TGCTAACTCC 


300 



DMcnnnn- ^.win oqi pqti &r> i 



WO 98/18931 



PCT/US97/19588 



410 



ATTTACTGCC 


TCATTAATAA 


CCTGTGTTCG 


AATCTCATAA 


TAGTCATCAA 


ACTTGAAATT 


6900 


TTTTACTTTC 


TTAGCTAGTT 


GTTGATAACT 


CCAAAGGATA 


GCTAAATCTG 


AAAACGCAAT 


6960 


TCCTTGATTA 


AAATTCAGAC 


CATAATAATG 


AACTGGGAAG 


AATCTTGATT 


GAAATTCTTT 


7020 


ACGCCACTCT 


CCATAAGTTA 


GCGTAAACCC 


TCTCAATAAT 


TTTATAATAA 


AATCTTGTAT 


7060 


ATCTTGCTCA 


CTCTCGATAG 


TTCTAATCTC 


ATGCATGGGT 


TTTAAAACTT 


TTTTCCTGGA 


7140 


AATATTCTCA 


ATCTGTGGAC 


ATTTAGAATC TAGATATGAC 


#W\ 1 /w»w * * * 


CTACATAATC 


7200 


TATATGTTCT 


CTTGTATAAC 


CCAAAGACTC AAATAGTTTT 


ill V>\p 1 


TCCTCGTTTG 

ft ft* W ft • A * 


7260 


ACTTACATAG 


TTGTATGTCA 


AATCCGATGT 


AGTTACTAGT 


fW.CA.TGT AT A 


AATAATGAGC 


7320 


TATTTGTCTA 


ATACCATACC 


AATCTATCTC 


ACTGGCAAGT 


(ill 1W1V,ULV( 


ATCCTTTAAA 

i\ ft WW ft W * 


7380 


ACCAGGGGCT 


GCAACTTTAT 


GTACAACTTT 


TT CATCAl 11 




WW » * * w w w>#% 


7440 


CTCTATTATA 


CTAATCTCAT 


CTTCATCCTT 


m ft /••/■» ft ft ^ ft M» % 

AACCAAGATA 




RTAAATt?TTG 


7500 


ATGATATACA 


TTTTCAGAAT 


GAAACTTATT 


< won ft k R fWt^f* 

CCTTAAATCG 


ATGAGTTTTT 


W 1 #%W ft *V ft W) * 1 


7560 


TGAAACTCTC 


AATAGATAAT 


CTTTGGTCTT 


ATCAACAAC r 


It HI ft It! ft It ». ^ 

TCATATAAAG 


f* A & A ATT A TT 


7620 


GGTAACCCAT 


CTATTTAGTG 


GAACGCCCTT 


CATATGTTLA 


ATTCCTAAGA 


nwlu I V»C J w 


7680 


CCAGATCTTA 


CCCTGCCAGT 


ATATTTTAGC 


CGvt.TtAV.1 w 




f* A ATTTTT Af^ 


7740 


TGCTTTGCAC 


TCCGAAGCTA 


ATTT CTCTG A 




/~f* K T^* & A A Tf" 


^_ 1 AUMw W 1 V> i 


7800 


ATACCGTCTA 


GCCTCTTTTA 


AAATTATTTT 


TTTCCCATCT 


TL il I 1 nuw w 


TACCATTATA 


7860 


TATCCCACCA 


CTGTTTGAAA 


ATCTAATTGC 


ATTATCTATA 




#*U ft W 4 W W W 4 » 


7920 


TTTTTTATCT 


TTCTTGTCAA 


CC C ATTT ATT 


CAAAAAGTCA 




* **. W W A 4 ft 4 W 


7980 


AATTTTAAAT 


ACTGGTAAAC 


GTTCATCTTT 


AACAACTTCA 


1 WWW W#%/%Wi , l* , \ 


TTAATTCATC 


8040 


AAT AGCAACC 


WJ TITCAT 


CATCCCTTGA 


CGGCCTAAAC 


/%W#\l- W^\ A /WW 


TCAGATATAT 


8100 


TGCTGCTTCA 


TCCCAACGTT 


TATCGCTTAA 


AATATATGCC 


f*C ATT AT ATT 


GCTTT AAGGC 


8160 


ACTTTCTAAC 


CTTTGCAAAA 


CCGACTCTAA 


TTCATTTTGA 


TTTCGATAAC 


ATGTAATAAA 


8220 


TTTACCAGAA 


AATCCTCGAC 


TAACCAATTT 


CCCGTTTCGC 


ATGATAAATT 


TGTCTTCTGT 


at on 


ACTAAGATGT 


TTAAATGGAA 


TTCGCATTTC 


ATCGCAAATT 


TTTGCTACAT 


CTTGTAACAA 


8340 


TTCATCTCAA 


CTGTTATACT 


CTGAACTAAT 


CTGTATTTTC 


CACCCTTGTC 


TTTCAACAAA 


8400 


TTTTCCAATA 


GGGTATTGAT 


AAACCCACTC 


ATCATTATTC 


ATTACTTCGT 


GCCAATTAAA 


8460 


AGGCAGACTT 


ACTTGGTACT 


TTATGCTAGT 


ATCTGTACTA 


TAATCATTAT 


TAGTGAAAAA 


BS20 


GAAAGGATGC 


TCCAAATTGA 


AATTATAATC 


CATAACAAAA 


TCTCCAAGAA 


ATTTTATCAA 


8580 


ACTTAATATA 


TCTATAGCTA 


GACAGACTTA 


TTTAAATAAA 


AAGGGAGAAT 


CCTTTGGATT 


8640 
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CTCCCGAAGA 


AGTTTCCCTT 


AGAATTAGTT 


G ATCTTT CTT 


T TrTTCAGTT 




5160 


CAAGAAAGTA 


CTGTGCTTTT 


TCTGTACTAA 








5220 


CCGTTTCTGT 


CTAAGAGTGT 


ft ft ^ ft. ft. A A ^ m 

CTAACAAAAL 








5280 


TGCCTTTCTG 


AAAGTCTACT 


^» ft MfMWP ft ^+ ft ft 

GACTTTACAA 


AALAA1 iAl X 






5340 


TTAATCGAAA 


AGAGCATTCG 


TTTTCTTCAA 


ALAuTTuvI (_ 


T^f^TVTT A A A/"* 


^ in 1 ww\Au 


5400 


ATTTATAGAA 


TAACTTACTT 


GG CCTCCCGT 


ACTCTTTGGA 






5460 


ATTTACCCAA 


ATAGAACGAA 


CTTTCTACTG 


ft ft ft ft *i*^**f**t*^* 

AAATATCTTC 


AATGATAAA1 




39«V 


TAGTATATTT 


TTTTATTCCA 


ATTAAATTAG 


TCGTACGCAG 


TGAGGATAuA 


a f***"* & a a a a 
ALCAAAACT A 




TATAACTCTC 


ATCAGATGAA 


ATCCTAACAT 


CCTGTAAGAT 


ACTATCATCT 


GGCAAAGTAT 




ATTTTTCCAC 


ATCAAAGACA 


ATTTTAAGTC 


AATTTGAATT 


GTCTAAACTG 


ft ft ft ft ^fft ft ft 

GAAGAACTAA 




CCTTAGGAAT 


CCAGTCATTA 


TCTTCGACAT 


ACCATTCCTT 


TATTACACCA 


GTATTGGGTA 


C*?£A 

3 'DO 


TACTCCAATT 


ATCAAATTGG 


TACCAATATC 


GCCCTCTCCT 


AAATATCAAA 


GAATTCCATT 


5820 


TTTTTAATTC 


CTCAAATGAT 


GAAGAGATAG 


ACCTCTTATA 


GTGTGTTTTT 


TCCTGTATTG 


5880 


TATTTAAAAA 


TATTTCATTA 


CTCTGATTCA 


CAAGTATGAC 


CCCTTAATAA 


TGGT ATCT AA 


594 0 


AT ATT AT ATT 


TGAGGAAGAA 


TCCTCAATTT 


ATTATCCATT 


ATTGATACCA 


ATCCAATTGC 


6000 


AACACCCGCA 


AATCCCGAAG 


CAATATCTGT 


TGTTATCTTT 


AAACCATTAT 


ft ft Mi 

CTCCCGCAAT 


6060 


AACAAATCCT 


TCTTCAATTA 


CACACAAATA 


TCTATAAAGT 


TGTTC AATT A 


ATTTCTTTTG 




TCCTGAAAAG 


TTATCATCGA 


TATCACTATA 


TATATTATTA 


G C AACTTC AA 


^ ft ft ft ft ft ft 

GACCACAAAA 


c i on 
o low 


TCCGTTAAAT 


AAACCTGGTA 


ATACACAAAA 


AACTACATCA 


CTTGC CCTCT 


CTAAAuAAOl 


A 


TAAATATTTT 


AAGTATTTGC 


TTGACAAGAT 


TTCTTTATTT 


CTAll AATAA 


PTi a. a A cf* 


O J UU 


GCCAGCACTT 


CCAGTTGCTA 


CATATGGTAG 


TAATCTATGA 


CCTTGG CTGT 


AL i vj^-/\>V 1\»A 


O JOV 


ATTATTACTA 


TCTACTTTAT 


AAGCAACTAA 


TTCTTTATCT 


ACAGCCAATT 


CTAGACCATT 


6420 


TTTATACATA 


CTTTCACCAG 


TTAATTTATA 


ACCTTCACCG 


AAGAGCCAAG 


CTACCCCTGC 


6480 


GTGACCATAT 


AGTAATCCAC 


CAAAATTCTC 


ATAAGGATCG 


TTACTCTGAA 


CATCACTAGC 


6540 


GCCAACTTT A 


CAAAAACTTT 


CTGCATTTTC 


TATATAATTT 


AAAGTATATT 


CTCTAAGCCT 


6600 


AATTAGTATT 


TCTTCTCCTA 


GTTTATTATC 


AATTCCCCCT 


TTACTAAGAA 


AATACAGTCC 


6660 


AACCAGTAAA 


ATTCCAGCCT 


GCCCACTATA 


TAAATTTTTA 


TTTTGTGAAT 


TCTCAAATAT 


6720 


CTCTATAAAA 


TGAGTTGTAA 


AAAGTTCAAC 


TGCCCGATCT 


ATCTCCCCAA 


ATTCATAAAT 


6780 


GAGCCAGATT 


GTACCAATTT 


TACCATCAAA 


AAGACCAGAA 


AGGGACGATT 


TCTTAAAATT 


6640 
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AAATTAAAAG 


TAGAATCTTC 


TAGTTTATCC 


AACTTTTTAT 


CCGACAAACT 


AATTATTTCT 


3360 


TTAGTAACAG 


AATAAGATTT 


TAATGTCTTA 


AAACCATTAA 


AAATTTCTTT 


TATTATGTGA 


3420 


CTATACTCTG 


CATTGCTGTT 


AGACTACTCA 


TTAGCTGAAT 


TAGACAACAT 


CTTCTTCATA 


3480 


AAGACACCTA 


CTATAATCGG 


CAATGCTGAT 


AATACAATAA 


ATATTATTGA 


nACTAGGAAG 


3540 


TTTAAATAAA 


GCATAAAACT 


TAGAGAGACG 


ATGAACAACA 


ATATTGAAGA 


AATTATTTCA 


3600 


AAAATTTGTC 


TAAAATAGTT 


TTCTTCGATT 


AATCTCAAAT 


CATTTCACAA 


AACTGAAATA 


3660 


ATAGATCAGT 


AATCTTTAAC 


CATTTCAGAA 


GAAAGATACT 


CTTCTCTAAA 


ATATCCTTGT 


3720 


TTAATTTTTA 


CATTTATATC 


TTTAGTTATT 


GATGCTTCCG 


TTACTTCTAA 


ATAGTAATTT 


3780 


GATATATAGA 


TTGCTGACCA 


ACCCAGAATA 


CTTATAGCAC 


CAAATCTTAG 


AACGTCAGAA 


3840 


AATGAGGAAG 


TCTGATTTAA 


ACTACCTGCA 


TATACAATAA 


TTCCTGAGAG 


CAAGACACCA 


3900 


TTAAACGAAG 


ATAGAAATAT 


TAAAATCCCC 


ATTAATATAA 


GTTTAGTCTT 


TTTTATAAAT 


3960 


TTTAAATAAT 


TCATAAGTTA 


TTCCTTCCCA 


CTTCTTCAAA 


GAAATAATTT 


AAAGTATCAA 


4020 


TCATTAAGAG 


AACATCTGAT 


GGAGTAAAAC 


CTCCATCACC 


AGCTGCTTTC 


TTTAAATACA 


4080 


ACAAACTTTT 


AACTCCAATA 


GAATTTAATT 


TCTTTGACCA 


CTCTATCACT 


TCCTTATTAT 


4140 


TAATATATCG 


CTCTTTCTCA 


CCCAAAATAT 


TAACTATAAC 


AGTATTTGAG 


TCTCGTGCCT 


4200 


TTTCAATATT 


TTGCATAGGC 


GAATATGACT 


TTATATAACC 


CTTTACTTCA 


GGGTCTCTAA 


4260 


TATCTCCCCA 


CTCTGCTATT 


TCGGTCTTAG 


AAAGAGGATC 


ATTTCGATTC 


TGAAGTGTAT 


4320 


CATAAGGATT 


TATAAATCGC 


GAAAATAAGA 


GAATGCTTTG 


CAATAAATTT 


TTTTCCTCGT 


4380 


TCAACACCGC 


ACCAGCAATT 


ATTCCACCTG 


C ACT AG AAGT 


TATTAAACCT 


AATCGCTTAC 


4440 


TGTCAATTAC 


ATCATTTTCC 


CTTAAATAAT 


TTACTCCCTC 


AATAAAATCT 


CTGATAGAAT 


4500 


TCCATTTGTT 


TAACGCCTTT 


CCTGACCGAT 


ACCATTCACC 


ACCCAAATAC 


CCTCCACCTC 


4560 


TTACATGAAC 


TATAGCATAA 


ATAAAACCTG 


CATCTATTAT 


AGATAACATA 


ATTTCATCTA 


4620 


AATCAGAATT 


AT CATTCTT A 


CCATAAGCCC 


CATAGACACT 


TAGAATACAT 


TTTTTTCTI'C 


4680 


TTGGGAGCTC 


ATCCGTATCT 


TCACTTTTCC 


AAAATAAAGA 


AATCCGTATG 


CTTACATCAT 


4740 


AACTGTCTTT 


TTTACTCCAA 


ATCACCTTAG 


AAAAATATTT 


ACTATTATTC 


GATTTTATGA 


4800 


TGGCTCTTTC 


AAATTCAGTT 


TTTAATGTAT 


TTTCTATTAA 


AT C AAAACT A 


ACTATTTTTT 


4860 


CGTAAAAAGT 


TCTCCTCTCT 


AAAAACAGAA 


GAACACGATC 


AGAAAATGAA 


TTTTCATAAA 


4920 


GTGTTGTCTT 


TTCATCAAAT 


GTTATCTTAT 


TAACACTCAA 


CTCCCTCAAA 


CTATTATTTT 


4960 


TAAATGTAGC 


AAGATAAAAG 


ACCGAATTCG 


CTGCGTTTGA 


ACACTCTAAA 


AGGATATAAC 


5040 


GTCCTATACA 


CTGAACTCTT 


CTAGCCCTAT 


CTTGATATGG 


TATACTAATA 


CAAACTCTGT 


5100 
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ACCATTGTTT CTTATTATGT AAAAATTGAT AGTAAACTAA TCTGAGCATC TCTATTCCTC 
CGTAGTTGAT TGTACCTCTA AGATTTTATA CAACTCTTCC CCGCTAGGTC TATGAAGTTC 
TTTGAAAATT TTTCCATCTT TCAATATTAA TGCACGATCA CTTTTCGAGG CCAATTCTAT 
ATCGTCCGTT ACCATAATTA CACACTTACC CGCCCCTACT AACTCTCTCA ATAATTCAAA 
AATTACTTCA CGAGAAACGC TGTCTAAAGC CCCAGTTGGC TCATCAGCAA ATATTATATC 
ACTATCAGCA ATAACCGCTC TAGCTATAGC AACCTTCTGT TGTTCTCCAC CAGACAGAGT 
TCCAACAAAA TCGTTTAAGC CAGCATTAAA CTTCATTCTT TTGAGTAAGT TTTCTACATT 
TTTAATAGTT AATTTTTTTT GTGATAATCG CAAAGGAACT GCTATATTTT CTATTACCGG 
CAGGGAAGGT ATTAAATTGT ATGCTTGAAA TATAAAAGAT ACTTCGTTAC CTCTTATACT 
TGACAATTTT GCATTTCTGA TTTTATAGGG CTTGATTCCA TTTAAAATTA CTTCCCCACT 
TGTTGGTTCA AGCAAACTAG AAATACATTT TAATAAAGTT GACTTTCCAG AACCACTAAT 
TCCTAGAATA CTTATAAATT CTCCTCTCGA AGCAGAAAGA GAAACATTTT TCACCACTTG 
CAACGTTTTA TTATTTCCTA GTAAAAATTG ATGATACAGC CCTTTCACTT TTAATATATA 
ATCTTTATCC ATATTCTTGC CTCCAATCAC TTAATTTTCA AAAGTCTTCC ATTTTCCAAT 
TTATATATAT CACTGTATCT CTTCTCATTT AACTCATAAT CATCTCAAAC TTCAATAAAT 
GAAATACCTA AATTGAACAG AATATCATCT ATGGAATTTG AATTATCATT ATCTAAATTA 
GCTGATATTT CCTCAAATAA CTACACTTTA TTATTTCTAA TCAGACCTCT AGCTAAACCT 
ATTTTTTGTT TTTGACCTCC AGACAAATTA CTACCATTTT CACCACATTC ATAATTTAGT 
ATATCTATCT TTTCTAATTC TTCATATAGA TTTACCTTTT TTAACACCTC AATTATCTGA 
TCATCTGAAA AATATTCATT TTGAAATAAA CTTACGTTCT CACGAATAGT AGTGTCAAAA 
ATATATGGTG TCTGATCAAC TGTTGGTATT GAATCTGAAC TCTTTTTCCC ATGTGATAAC 
AAATTTACAT AACCTTTTTG TGGCTTTAAA GAACCATTAA TTAAATTTAA AATCGTTGTT 
TTCCCACTAC CAGAAGTTCC TGTTAATAAT ACCCTAAATG GTGACTTAAA TCAGAAGTCA 
ATACTTAATT TATTTTCTCG TGTAATAGAA TATACAACAT CTTTCATCTG TATCTCATCT 
ATTGATGAAG TATACAGTCC GTTATTATCA TGTTCAGCGT CTATAAAATT CTTCTCTCCA 
CTTAACTATT TTAAAAACGG TTTCCTTAAA TCTTTGGTTG TATTTATCTT ATTTAATGAA 
TACGCAATTG ATTGTATCGG CCCTAAAACT TTATCGTTTG CTAAGAAAAT ACCTATCAGT 
TCACTAAAAG AAACGCTTTT ATGATAAATT ACAAAATAAC ATCCTACAAC CAAGGGAACT 
AGAAAGCAAA AACCTGAAAT TAGTACTGCA ACCAATTTTG AAAGAACCTC TGATCGTTTC 



1620 

1680 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3160 

3240 

3300 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

AAGTTTACAA TTTAAATCAA TTAACAATTT TCCCAACTAA AAGCACTCCA CTTACCGCAA 60 

CGTTTCTACT GAATGTACTA AATCGCATTC CATCAACTTC ATCTGTTTCG TCAACTTGAA 120 

CAGATACTAA TTGAAGATTT AATACTTCTG CTCCCATAGC TAGCTCCTCC TATTTAAATT 180 

TTTGGGATTA AGTACTTTAT CCACCCTCAT ATACTCTCTC CACCAGTAAA ATGCAAGCAA 240 

TGATACAAAA TAGATTTAAC TATTTTATAT AGCGAAAACT TACAAATTTT TAACAAATAA 300 

TTTTTGCATT CTTAAAGATA AAATAGGAAC TTTTAGTAAT AAATATTAAA ATAAATAAAA 360 

TAATAGATAC TATAAAATTT GGAAGTATTA ACCCCAAAAG ATTCATATCA TCTATTAAAA 420 

TATCCTCTAA AGAGTAGTAT ATTAAACCCA TAATTTTAAT CTTAAGTAAA AATGCAATTA 480 

ATGAAGTAAC AAATGTCAAA AATATAGCCT CACCAA C TTT AATCTTAACC ATCTGGTAAT 540 

TAGAAGTTCC TAAAATTTCA AATTGCTGAA TCTCAATCCT TTCTTGATGC CATGACAAAA 600 

ATGCAATTGA AATAATATTT GCAAGTACTA TCAAAATTGC TGCTCCTACA TAGACAATAA 660 

ATGCTACTTT TAGCTCTAAA TCACTGTCAT CTTGAAATTG AGATACTATA TTCTGACAAA 7 20 

TCATTTGAAA ACTAGAAATT AGTAATATAG CTCCTGTAAT TGCAGCACTG ATAGATTTTA 780 

TATAAGACTT ACAATATAGT AAATTCCACT TCGAAACAAT GAACATAAAA TTATTTCTAA 840 

ATATAATTAT AGAAAGTAGT TTGATAAAAC ATGACTGTAT AAAAGGAGAT AATTGATAAA 900 

TAATCACAAT ATCTAAGATT ACAAT ATTG A ATATTATCTG GGCCTTCGCT AAAATTGTCC 960 

TATCTTGGAA AATTTGTTGC AAAGAAAGCA ACCAGATAAC ACTAAAACCA GCCAATAGCA 1020 

GTATTCTTTT TACT ATTG AA AGAACATGCC TTATTTTAGA ACTCTTCCTA TTTCTAATCT 1080 

TCTTGAACGT ATAAAAGCAA CCACTTAGAA AGGCTAAAAA TGAAATCAAC ACTACTGTAA 1140 

TGATACATCC AACAGCACTC CTTTGAAATT GGATATCAGG TAATATATTT TCCCCGAAAA 1200 

AGTATTGTAA AAAATAATAA TAATTTGACG TAACAAATAT A GAG CAT AG A TATGCAATAA 1260 

AACTAATAAT CGAGGAAATG ATAAAAATCT GTCCCCCCAC AAGAAATGAT AGTTGAAGGC 1320 

GACTTGCTCC CAACACCTCC ACAAGTTCGT AATCATCTCT AAAAATTTCA ACCAACATAT 1380 

TTATTATGTT AGAGAGCACA AAGAATAATG TTACTCCTCC GAATACTATC GCAAACATAA 1440 

AAATTGGTTT AGGATCTGGA AGTCCGACAA ATACTTGCGA ATTATTCTCA ACATTAATTA 1500 

CCCCATTAAC AGCCAATCCC ATAACTAAAC TCGAAACAAA AATT A CTGGT GAAACGCCTA 1560 
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AACTTGGAGC AAATTTATCC TGACTAGTGA TGAAGCTTTT GAAAGCAAGT ATGGTAGCCA 
AGCAGATAAG AAGCGTAAGT TATACAACGG AACCTTGAAA GTGGATCTAT ATCAATATTT 
TGGTCAGCGT GTCAAACGGC AAGAGGTAAA ATAGAAAGGG ATACTCATGA GTAAAAAAAG 
ACGAAATCGT CATAAAAAAG AAGGTCAAGA ACCGCAATTT GATTTTGATG AAGCAAAAGA 
GCTAACAGTT GGTCAAGCTA TTCGTAAAAA TCAAGAAGTG GAATCAGGAG TCTTGCCTGA 
GGATTCCATT TTGGACAAGT ATGTTAAGCA ACACAGAGAT CAAATTGAGG CGGATAAGTT 
TGCGACTCGT CAATACAAAA AAGAGGAGTT CGTTGAAACT CAGAGTCTGG ATCATTTAAT 
TCAAGAGATG CGTGAGGCTG TAGAGAAGTC AGAAGCTTCT TCGGAGGAAC TTCCATCTTC 
TGAAGACATC TTACTACCCT TGCCTCTGGA CCATGAGGAC CAAGGCTTGG ATCCTCTATT 
GCTAGATGAT GAAAATCCAA CACAAATGAC TCAAGAAGTG GAAGAGGAGC AAAACCTTTC 
TCGTCTGGAT CAACAGCACT CAGAAAAGAA AAGTAAAAAA GGCTTTATTT TGACCGTTTT 
CCCGCTTGTA TCAGTAATTA TTTGTGTCAG TGCTTATTAT GTCTACCGTC AAGTCCCTCG 
TTCGACTAAC GAAATTGAAA CTTCTCAATC AACTACAGCC AATCAATCGG ATGTGGATGA 
TTTTAATACA CTTTATCACG CCTTTTACAC AGATAGCAAT AAAACGCCTT TGAAAAATAC 
CCAGTTTGAT AAACTCAGTC AACTCAAGAC TTTACTTGAT AAGCTGGAAG GTAGTCGTGA 
ACATACCCTT GCCAAATCTA AATATCATAG TCTAGCAACG CAAATCAAGG CTATTCAAGA 
TGTCAATGCT CAATTTGACA AACCAGCTAT TGTGGATCGT CTGTTGGATA CCAATGCCAA 
AGCCAAATCG GATCCTAAAT TTACGCATAT TAAAACTGGA AATACGGAGC TTGATAAACT 
GCTAGATAAG GCTATCAGTC TTGGTAAGAG CCAGCAAACA AGTACTTCTA GCTCAAGTTC 
AAGTCAAACT AGCAGCTCAA CTTCAACTCA ACCAAGTTCA AATACCACTA GTCAGCCAAA 
ACCAAGTAGT TCAAATGAGA CTAGAAGTAG TCGCAGTGAA GTCAATATGG GTCTCTCGAG 
TGCAGGGGTT GCTGTTCAAA CAACTGCCAG TCGTGTTGCC TATAATCAGT CTGCTATTGA 
TGATAGTAAT AACTCTGCCT GGGATTTTGC GGATGGTGTC TTGGAACAAA TTCTACCGAC 
TTCACGTTCA CGTGGCTATA TCACTGGAGA CCAATATATC CTTGAACGTG TCAATATCGT 
TAACGCCAAT GGTTATTACA ACCTCTACAA GCCAGATGGA ACCTATCTCT TTACCCTTAA 
CTGTAAGACA GGCTACTTTG TCGGAAATGG CGCTGGTCAT GCGGATGACT TAGATTACTA 
ACCAGTCGG 

(2) INFORMATION FOR SEQ ID NO: 43: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9713 base pairs 



I860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3369 
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CCGCGAAAGA TATTTTTGAA CAAGACTTTG GACGTGAGGT Ce©9G* • TT AT AATAAAGTAG 60 

AAGTTGACGA GTTTTTAGAC GATCTCATCA AGGACTATCA AACCTATGCT GCCTTGGTCA 120 

AGTCACTTCG TCAGGAAATT GCGGATTTGA AGGAAGAATT AACTCGTAAA CCGAAACCTT 180 

CACCAGTTCA ACCAGAACCC CTTGAAGCGG CAATTACAAG TTCTATCACG AATTTTGATA 240 

TTTTCAAACG CCTGAATAGA TTGGAAAAAC AAGTTTTTGG TAAACAAATT TT AG AT AACT 300 

CACATTTTTA AGTAGTTATT TGAGATGTGC AATTTTTCGA TAATCGCGTG AGG AG AATTG 360 

TTTCTCATGA GGAAAGTCCA TGCTAGCACA CGCTGTCATG CCTGTAGTGT TTCTGCTAGG 420 

CGAAACCATA AGCCTAGGGA CGAGAAATCG TTACGGCAGT TGAAATGGCT AAGTCCTTGG 480 

ATAGGCCAGA GTAGGCTTGA AAGTGCCACA GTGACGGAGT CTTTCTGGAA ACAGAGAGAG 540 

TGGAACGCGG TAAACCCCTC AAGCTAGCAA CCCAAATTTT GGTCCGGGCA TGGAGTACGC 600 

GGAAACGAAC GTAGTATTCT GACTCCTATC AGCTAGAGCT GTTAGTGGTA GACAGATGAT 660 

TATCGAAGGA AGTGGTCCTA GTCACTTCTG GAACAAAACA TCGCTTATAG AAAATTGCAT 720 

ATAGGTTGGG GCTGAGAAAT TTTCTCAACC TCATTTTTTA AAGTGGACAT ATAGAAAGGT 780 

CTTGCAAGAC TGTAACATGA AAAAACAATT TAATTTAATT GCAACTGTGG CAGCAGGCCT 840 

TGAGGCTCTC GTTGCTCGTG AAGTCCGAGA GTTGCGCTAC GATTGTCACC TTGAAAATGG 900 

ACCTGTTCGT TTTCAAGGAG ACGTG AC AG C TATTATCGAA ACCAACCTTT GGCTTCGGGC 960 

ACCAGATCCT ATCAAAATTA TCGTAGGAAC GTTCCCAGCT AAGACTTTTC AAGAGCTATT 1020 

TCAGGGAGTT TTCGCTTTGG ATTGCGAAAA TTATTTACCA CTTGGACCTC GCTTCCCCAT 1080 

TTCAAAAGCT AAATCTCTTA ACTCCAAACT TCACAATGAG CCCAGTGTTC ACGCTATTTC 1140 

TAAGAAAGCT GTTCTCAAGA AATTCCAGAA ACACTATGCT CGCCCAGAAC GGCTTCCTCT 1200 

GATGGAGAAT GGCCCAGACT TTAAGATTGA CCTCTCTATT CTCAAAGATG TGGCAACTGT 1260 

CATGATTGAT ACGACCGGCT CTAGCCTCTT TAAACGTGGT TATCGTACCC AAAAAGGTGG 1320 

CGCTCCTATC AAGGAAAATA TGGCAGCAGC CATTTTACAA CTTTCTAACT GGTATCCAGA 1380 

CAAGCCTTTG ATTGATCCGA CCTGTCGTTC GGGGACTTTC TGTATTGAGG CAGTTATGAT 1440 

TGCTAGAAAG ATGGCGCCAG CTCTTCGTCG CTCTTTTGCA TTTGAGGAAT GGAACTGGAT 1500 

CAGCCATCCC TTGATTCAAG AAGTGCGCAC AGAAGCGGCT AAAAAAGTAG ACCGTGAGCT 1560 

TG AG CTGG AT ATCATGGGCT GTGATATTGA TGCTCGCATG GTGGAAATTC CTAAGGCCAA 1620 

TGCTCAGGTA GCTCGTCTTG CAGGAGACAT TACTTTTAAG CAGATGCGCG TGCAGCATTT 1680 

ACGTTCCGAT AAAATCAATG GAGTAATCAT TTCCAATCCG CCTTATGGTG AACGTTTGTC 17 40 

AGATGATGCA GGCGTGACCA ACCTCTATGC TCAGATGGGG CAAGTATTTG CACCGCTGAA 1800 
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AAATCCTCCT CATACTCCTT TGCTCAGGCT AGCGTCTTTC AACATTCAAC ACTCAACTGG 

TTGATCTACT TGATAGGAAG GGAGTTACTA TAAAATACTC AGGCTTCCAT CATATTTTTT 

GAAACGATTG TGTAATCAAA ATGTACCAAT ATTGTAGTAT TCCTACAGAA GATCTTGTCA 

ATGGATAAAT ATATCATAAC TCCTATCTCA AAAAGATTTC ATATCTCTGT GCATATATAA 

TAGACTTCCT GCAAAACTAG AATCCTAGTT CATGATTGAT AATACCAGCA ATCAAATTCA 

TTCGTAATCC AAAGCGTTTA CGATGATTTC GATAGGTTGT TGAAAACATT TTAAACGTTT 

CTACTTTGGC AAAGATGTTC TCAACCTTGC TTCTCTCCTT AGATAGCCCA TCCTTATAGC 

CTTTATCTTC AGCTGTTAGC GGCTTGACTT TGCTGGATTT ACCTCCACTT TGTGCTTGAG 

GACATATCTT CATGAGCCCT TGATAACCAC TCTCAGCCAA GATTTTACCA GCTTGTCCGA 

TATTTCTGCA ACTCATTTTG AACAACTTCA TATCATGACA ATACTTCACA GTGATATCCA 

AAGAAACAAT TCTCCCTTGA CTTGTGACAA TCGCTTCAGC CTTCATAGCC TGAAATTTCT 

TTTTACCAGA ATCATTCCCT AATTCTTTTT TTAGGGCGAT TGATTTTTAC TTCCCTCGCA 

TCAATCATTA CCGTGTCCTC AGAACTAAGA GGAGTTCTTG AAATCGTAAC ACCACTTTGA 

ACAAGACTTA CTTCAACCCA TTGGCTCCGA CGGATTAACT TGCTTTCGTG AATACCAAAA 

TCACCCGCAA TTTCTTCATA ACTGCGGTAT TCTAGGCTTA ATTTAGGTTT TCGTCCACCT 

TTTGCGTGTT TAAGTTCATA AGCTGTTTTT AATACACCTA ACATCTCTTT AAAAGTCGTG 

CCCTGAACAC CAACAACACG CTTAAATCGT GTATCAGTTA ATTGTTTACT TGCTTCATAA 

TTTCGCACGG AGTCTATTGA CTCTTTGGTA GGTCTCAATC TTTTTTTCAT CTATCCCGAG 

AATTATTTTC CCGCCATTTG TATTTGCAAA TGCTGAGTAG GTTTCCCAGA AAGACTCTGC 

AAGATTGTTT TTAGCTTTTT TCTATTCTAA ATCAACCCCT TCAAATTTTA ACTCCATATT 

TTTCCTTTAC ATCTGTTTTT TGTGGTTCTG GTATTTGTTC AAGTTGAGTC ATAATATAGC 

GAATTCAATT TCGAGACTTT TTACTCAGTT AATTTCTTTT TTAACCCACT TTAATTGCTT 

TTTTAACACG GGTTAAAAAA GAAATTAAAG TGGGTTAATT TTTCTTGA 

(2) INFORMATION FOR SEQ ID NO; 42; 

( i ) SEQUENCE CHARACTERISTICS : 
^ (A) LENGTH: 3369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



6S20 

8S80 

6640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9S40 

9600 

9660 

9720 

9780 

9828 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
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TACACATATA 


CGGTCTCAAA 


CAAACAATTA 


TTCTTAAAAT 


CCGTCTCATG 


TTTATTTAAT 


6720 


CCCATCAAAA 


AACGGATTTT 


ATTGCCAGAT 


GGCGATATAA 


AGCAGATAAA 


ATCTGTTTCT 


67B0 


AACCTTCTAT 


ATGCTGCTGC 


TTATGCTTTC 


TCCCATTCAA 


CTTTTTTAGC 


TGAAACGGAT 


6840 


GAAAATATTA 


GCTATGTCAT 


ATGGCAGAGA 


AAATTCAATC 


AGTTATCCTT 


GC C ACTTTCT 


6900 


CAGCATCTTT 


TAAAATGAAA 


GATGCTAGAG 


ATATGGAAAT 


ATCGTCCTTT 


TGTATCTGAG 


6960 


TTTTGGAATG 


ATTTTAAAAA 


TAATCATGAT 


AAACAATTTG 


TAGATCCGAT 


TTCTCTTTAT 


7020 


TTCACCTTAA 


AAGATGATGA 


TGACCCACGT 


ATAGAGGAAG 


AGAGTGAAGC 


ACTAGAAAAT 


7080 


ATGATATTAC 


AGTATCTGGG 


AGAAGATCAT 


GCCAGCTAAT 


AGGAAACTTA 


TTTTTCAAGA 


7140 


AATCTTTGCG 


CATTTTCACA 


ACTATTATGT 


TCTGATTGGG 


GGAACTCCTA 


CCTCTATCGT 


7200 


ATTGGATTCG 


CAACGATTTA 


AAAGTCGCAC 


AACAAAAGAT 


TATGATATGG 


TCATCATTGA 


7260 


TGAAGTAAAA 


AATAAGGAAT 


TTTATACTAC 


CTTGAATCAT 


TTTTTAGAAT 


TGGGAGAGTA 


7320 


TCAAGGAAGT 


CAGAAAGATG 


AGAAAGCGCA 


GCTTTTTCGA 


TTTACAACAA 


CTAATCCTGA 


7380 


GTTTCCTTCT 


ATGATTGAAC 


TATTTAGTAT 


CTTACCAGAA 


TATCCATTAA 


AGAAGCACGG 


7440 


TCCAGAAATT 


CCCTTACATT 


TTGACCAAGA 


TGCTAGTTTA 


TCAGCCTTAT 


TATTGGATGA 


7500 


AG ATT ATT AT 


AATATATTGG 


TGCATGAAAA 


AGAAACCATT 


CAGGGCTATT 


CGGTATTGAG 


7560 


TAATTGTGGT 


TT AT ACTCTT 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCTTCC 


ATCTACAACC 


7620 


TCAAAACAGT 


GTTTTGAGCA 


GCCTCCAGCT 


AGCTTC CT AG 


TTTCCTCTTT 


GATTTTCATT 


7680 


G ACT ATTAAT 


TATTTTTAAG 


CCTAAAGCTT 


GCCTGGATAT 


GAGGGAGCGC 


TCTGCCACAG 


7740 


GTGCTCAAGC 


TTTAAGTAAG 


TCCATTAAAA 


AGCATTTGAA 


TGACCTTACC 


CGTTTGACAG 


7800 


CTTCCTTGCT 


AGGAGATGAA 


AAGTTATCGG 


CTATAACATC 


AAGTAGTGCG 


GTAAAAGCAG 


7860 


ACATGCACCG 


CTTTCTGATA 


GAATTAGAGC 


CTCTGAACTC 


AACTATTCTT 


CAAAATAATG 


7920 


ACATTTCATT 


GGATCAAAAT 


GAAATTTTTG 


AAATTCTGAA 


AAATTTTCTC 


GATGGTTAAA 


7980 


ATAATTGTAG 


CGAGATGGCT 


ATATTGAATT 


CGTCTAtATC 


TGGAAACTAG 


AAAAAACTTC 


8040 


AATTTCAGGA 


GAAAATGAAC 


TCAATCTTCC 


CACAATCAAA 


CGTATAGTAT 


CAAGGTTTTT 


8100 


CAAGACCTGA 


TATTATGCGT 


TTTTTGCTTT 


TCAAAACTTT 


TTGCCCAGTC 


TTCGTTTTTA 


8160 


TCCTCTAGTC 


ACTTGATTTG 


TTTCACGTGG 


TTTTTTAGTA 


TAGTAGAATG 


AAACGAGAAC 


822C 


AGGACAAATT 


GATCAGGACA 


GTCAAATCGA 


TTTCTAACAA 


TGTTTTAGAA 


GC AG AAGTGT 


8280 


ACTATTCTAG 


TTTCAATCTA 


CTATAGTTAA 


ATCTGCGGTC 


AAGTCTACTG 


; GTCAATCTAT 


8340 


GATTGTAATA 


CTCTTCCAAA 


ATCTCATCAA 


CCACGTCAGT 


CTTCCCTTGC 


: AGTCTGTATC 


8400 


TT ACT C ACC A 


AGCT AGTGAT 


GGATTTAGAA 


TAGGTGATTT 


GGAGCGTCCT 


' ATT AGCT AGG 


8460 



WO 98/18931 



PCT/US97/19588 



401 

AGGAAATCAG AGAAAATGCC AATAGAACTC ATCTTGAAGT AGAAGATATT ATAAAATTGG 4 980 

GTCAGTTTTA TGGTATCAGT CATAAAGCTA TGTTATATAG ATTGAGGAAT CATGGATACC 5040 

TTGATGCAGA AGAAATTAAA AATATGGATA TTAGTCTTAT AGAGACAGCT TCAAGATTAG 5100 

GCTATGATAC AAGTTTATAT CGTCCTTTGT CAGAAAGTAA AAAAGAAATG GCATTAGGAT 5160 

AATATATTAA TTCAACTGAA CAACTTTTAC AAAATAACAG AATTTCGCAA GGCAAGTATG 5220 

AGGAACTGTT ACTAGATGCT TTCAGATATC ATATTGTATA TGGGCTAGAT GAACAGGGGG 5280 

GACTTGTCGT TTGACTAGTC GTGTATTTAT TGATGCAGAT TGTATTTCAG TATTTTTATG 5340 

GGTTGGCACT CAACATCTTT TAGAAAAGCT CTATTTGGGT AAAATTGTTA TTCCACAAGA 5400 

GGTCTATGAT GAAATCAATA TACCTACAAT TCCCCATTTA AAATCTAGCA TAGATCAGTT 5460 

GCTAGCTAAG GGTTCAGCTC AGATTGTGAG CATACACATT GGAACTCAAG AATACGCATT 5520 

ATATAGAGAT TTAACAAGAA ATCATGATAG TAACAAGATT ATTGGTAAGG GAGAAGGGGC 5580 

ATCTATTTCC TTAGCGAAAA AGCATAATGG GATATTAGGA ACTAATAACC TAAGAGATGT 5640 

TAAATCATAT GTAGAAGAAT TTTCTTTAGA ATATATGACA ACAGCACATA TACTGATTCA 5700 

ACCGTTTAAA G CGT AATTT A TTACTGAATA AGAGGGCAAT C AT ATCTGG A ATAATATGCT 57 60 

TAAAAAGAGA AGGAAAATTG GTGCAAATTC ATTTTCAGAC TATCTTCGTG GAAGTATTCA 5820 

TCAAAATAGA CAAAAATAAA TTTGGATAAA TCGAACTCAC TATTCACGAG GCATATGAGC 5880 

AATTCGAAAA AGAAAAGTGT CAAATTGAGC CTATACGAGT AGAAGTGAAA TAGTAAGTCC 594 0 

TGCATACTGG ATGAGAGAAA AGTTCTCCTT GAA G TTTTCC TGAACTATCA CTCCCATCTC 6000 

AAACGATATG TAGCGTAATG TGAGAGGGGA TAGCGAGTAG TTTTTGGTTA TTTTATCAAA 6060 

AAAC7TATAT TTTATTATAC CCAATCAT.XA AAT.VTAATAA AAATGATA3A ATAACGAAAA 612C 

AACATGAATG TCAAAAAGAT AATGTCAATT TTTCAATCCT TTTATGTTCA TGTCAGTATT 6180 

GAGCAACTGA CTTTGACTTT ACCAATCACT TTTGTAAAAA GCTTTGAGTA TACTCAAATG 6240 

A C T TTT CATA AGGAATCATT TTTATTGATT AAAGAAAAGA GAAGGGGGAG TTTGAGTTCA 6300 

TTTGTTACTC AGGCTCGCAC TATGGGTCAA AAAGCCAATA TGGATGTTCT TTTGGTGTTT 6360 

TCCAAGTTAT CAGACAGTGA AAAAAAGCAA TTACTTCAAG CTAGAGTTCC GTTTGTAGAC 6420 

TTTAAGGGAA ACCTCTTCTT CCCTCCATTG GGACTAGTAC TCAATGCGAA TCATACTGAA 6480 

GTCCCTAAGG AATTAACACC TAGCGAACAA TTAACGTGGA TTGCCTTTTT ATTGACAAAA 6540 

CCTCAAAAAG TACTACATGT TGATTTGCTT TCACAAGTCA CTGGACTTCC AAACTCAACA 6600 

ATTTATAGGT GTTTGAGGAC TTTTAAAGCT TTATATTGGT TAAACAAGCA AAATAAGCTT 6660 
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TTTGCCTCAA 


GTCGGCGTGC 


AACCATATAG 


GCAAGTACAC 


CCACACTCAA 


CTGGGAATCC 


3180 


CCATTCAACC 


GTACAGAATG 


AAGCGGATTA 


TCACTGCCCG 


AAAGACCCAG 


AATTAGCTTT 


3240 


TTTCTCGCAC 


ATTGTTGGGA 


ACGGTTGCAT 


CATGCAGGTA 


GGACCTGTTG 


ATAATGGTGC 


3300 


CTCGGACGTT 


GGGGGCGGTT 


GGAATGCTGA 


GACCTATGCA 


GCGGTTGAAC 


TGATTGAAAG 


3360 


CCATTCAACC 


AAAGAAGAGT 


TCATGACGGA 


CTACCGCCTT 


TATATCGAAC 


TCTTACGCAA 


3420 


TCTAGCAGAT 


GAAGCAGCTT 


TGCCC AAAAC 


GCTTGATACA 


GGGAGTTTAG 


CTGGAATTAA 


3480 


AACGCACGAG 


TATTGCACGA 


ATAACCAACC 


AAACAACCAC 


TCACACCACC 


TTGACCCTTA 


3540 


TCCATATCTT 


GCTAAATGGG 


GCATTAGCCG 


TGAGCAGTTT 


AAGCATGATA 


TTGAGAACCG 


3600 


CTTGACGATT 


GAAACAGCCT 


GGCAGAAGAA 


TGACACTGGC 


TACTGGTACG 


TACATTCAGA 


3660 


CGGCTCTTAT 


CCAAAAGACA 


AGTTTGAGAA 


AATCAATGGC 


ACTTGGTACT 


ACTTTGACAG 


3720 


TTCACGCTAT 


ATGCTTGCAG 


ACCGCTGGAG 


CAAGCACACA 


GACGGCAACT 


GGTACTGGTT 


3780 


CGACAACTCA 


GGCGAAATGG 


CTACAGGCTG 


GAAGAAAATC 


GCTGATAAGT 


GGTACTATTT 


3840 


CAACGAAGAA 


GGTGCCATGA 


AGACAGGCTG 


CGTCAAGTAC 


AAGGACACTT 


GCTACTACTT 


3900 


AGACGCTAAA 


GAAGGCGCCA 


TCGTATCAAA 


TGCCTTTATC 


CAGTCAGCGG 


ACGGAACAGG 


3960 


CTGGTACTAC 


CTCAAACCAG 


ACGGAACACT 


GCCAGACAAC 


CCAGAATTCA 


CAGTAGAGCC 


4020 


AGATGGCTTG 


ATTACAGTAA 


AATAATAATG 


GAATGTCTTT 


CAAATCAGAA 


CAGCGCATAT 


4060 


TATTAGCTCT 


TGAAAAAGCT 


TAATACTATG 


CGTTTTCTTG 


TGGAGATATT 


TCCTTCAATT 


4140 


TTGCTACTAT 


ATTAAACAAA 


AATCAAAAAG 


CAAACTAGAA 


ACTTATGCTC 


AAATAAAATC 


4200 


TAAATTTGAC 


AATCTAAACC 


GAGTCGGATA 


GCTTTAAGTA 


CTGTTTTCAG 


GTTGAAGATA 


4260 


CGATTTTTGA 


TAGGAACTCA 


TCAATTTTAG 


ATTTTTAACC 


AGCATCAATA 


AATTGCTTCC 


4320 


TTCTTTTGTC 


ATAATTTTTT 


TATTTAAAAA 


ATTATCACma 


GAGTGTGCTA 


TTCTTTTTAT 


4380 


GAGAGCTGTA 


TGAATATGAT 


AAATGTATGT 


GATAAATGTA 


TGTGATGTTG 


GAAAAACAAT 


4440 


AAAAGAACTT 


AGAATATCTT 


CAAATCTTAC 


TCAAGATAAG 


ATTCCTGACT 


ATTTGTC TIT 


4500 


GAATCAAAGC 


ATGATTGCCA 


AAATGGAAAA 


AGGTGAAAGG 


AATATCACGA 


ATGGATTTAA 


4560 


CTAATAAAGC 


TTCAAATCTT 


AGAAAAAAGT 


TGGGAGCTGA 


TGGTGAATCG 


CCGATAGATA 


4620 


TTTTTAAATT 


GCTACAAAAG 


ATACAAAATT 


TGACGCTGGT 


ATTTTATGGA 


CTCGGAAAGA 


4680 


ATATTAGCGG 


ACTCTGTTAT 


AAACGAACTC 


AGTTCAGTCT 


CATTGCAGTC 


AATTCAGACA 


4740 


TGCCATTAGG 


AAGGTAAAGA 


TTTTCTTTAG 


CACATGGACT 


GTATCATCTT 


TATTATGATG 


4800 


AGGTGAAGAA 


GAGTTCACTC 


AGTCTTATCT 


TGATTGGTGA 


AGGAGATGAA 


ACTGAAAGAA 


4860 


AAGCGGATCA 


CTTTGCTTCT 


TATTTTTTAA 


TTTTCCCATC 


TTCACTGTAT 


AGGATGGTTG 


4920 
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1 LuL 1 v. i i. e\n 




TCCAATTTTG 


CTATCAAATA 


TTTTTCAACA 


1440 


GCTCTATAAU 


AL 1 v>V- lun i Vj 


^"**T*^^^"^ & TfT*fi.T* 


TGGACCATTT 


CTTGGTCAAG 


AATCCTTGGC 


1500 


TCCAGTAGGA 


GCGAWOA(.Avj 


UuA ill! L\*M 


rCTGATTGTA 


GGTTTTACAC 


TTGGTGTTGG 


1S60 


CAATGGCATG 


GCCAHVILA 


TTGCTCGTTA 


TTATGGCCCT 


CGGAATTTCA 


CTAAAATCAA 


1620 


GGAAGCAGTA 


GCAGCCACCT 


bViAl i I 1AW» 




AGCATTCTAG 


TTATGTTGCT 


1680 


GGGCTTTCTT 


GGCTTGTATC 


i_ i v. itrnucA 


A I AL 1 1 HwA I 






1740 


TCAATCTTAT 


CAATATATTT 


CTATGATTGT 




WO iu i ^nljW 1 




1800 


TCTTTTTCCA 


GCCTTGTTGC 


GGTCTATTGG 


TbALAblV. 1 A 


Ol-/\va\_l_l_ 1 Oo 


u/\ 1 I i v» i \»n i 


1860 


TTTCTCTGCC 


TTGGTTAATG 


jn_— l _r~ jFirnJ^lTiJ — J~" ft. 

TGGTTCTGGA 


TCTCTATTTT 


AllACGCAAT 




1920 


AGTTCAATCC 


GCAGGACTTG 


CTACCATTAT 


I [CGCAACKii 


i l A rwAVjv>uu 


TTCTCTGCTT 


1980 


TT ATT AT ATT 


CGTAAAACTG 


TGCCAGAACT 


CTTGCCACAG 


TTTAAACATT 


TLAAATuIjUA 




CAAAAGCTTG 


TACGCGGATC 


TCTTGGAGCA 


AGCTTTGGCT 


ATGGGCTTGA 


TuAu l IA.AA 1 


? i fin 


TGTATCTATC 


GGCAGTGTGA 


TTTTACAGTT 


TTCTGTTAAT 


ACAi I iwiu 


AO 1 t»M 1 I A 1 


2160 


TAGTGCCCAG 


ACGGCAGCTC 


GACGCATTAT 


GACCn i otC 


CTTCTTCCTA 


Tr.arrrrTAT 


2220 


TTCTGCATCA 


ATGACGACCT 


TTGCTTCTCA 


CAATCTAGGA 


gl iaaccgal 


CTI t»ACt.t* I AT 


4 A O U 


TGTTCAAGGT 


CTTCGAATCG 


GCAGTCGTTT 


ft ft i^#f* ft #¥* Ik VT*^*^* 

AAGTATATCC 


TGGGCAGTTT 


TTGTTTGTAT 


2340 


TTTCCTCTTT 


TTTGCCAGTC 


CAGCTTTGGT 




GCTAGTTCGA 


LAuATUui 1A 


54 Art 


CTTGATAGAA 


AATGGAAGTC 


T CT ATCTGC A 


AATC AGTT tA 


ft #«MM*MM« ft^»/"» 


CCATTTTGAG 


2460 


CCTCTTGTTG 


ATTTATCGCA 


ATTGCTTGCA 






1 i V»V, 1U1 AO 1 


2520 


TTCTAGCTTT 


ft WWU^ ft ft ft ft 

ATTGAACTAA 


TCuGAAAAA > 


CGTTTTTGTG 




TTCCTTGGGC 


2580 


AGGATATAAG 


GGTGTTATCC 


iimiwi'/ '»T»f»a i/T 


T*f^T*T* » TV^T*^^** 


Oil OV.*»-r% 1 Ort 


CAGTTCAACT 


2640 


GTACTTCTCA 


TTATTCCGTC 


ATCCCTTGAT 


AAAAGAAGGC 


AAGGCAATCT 


TGGCAACCAA 


2700 


ACTCCAATCC 


TAGTTGGATT 


TACTGAATAA 


AATCCATTTC 


CTCTAGTGAA 


AATCCAAAAA 


2760 




TCTTCTTTAG 


TTTGGTCTTG 


AAAATAGTTT 


AACAGACTTT 


TCACTTCTTT 


2820 


TATATGATAT 


AATAAAGTAT 


AGTATTTATG 


AAAAGGACAT 


ATAGAGACTG 


TAAAAATATA 


2880 


CTTTTGAAAA 


TCTTTTTAGT 


CTGGGGTGTT 


ATTGTAGATA 


GAATGCAGAC 


CTTGTCAGTC 


2940 


CTATTTACAG 


TCTCAAAATA 


GTGCGTTTTG 


AAGTTCTATC 


TACAAGCCTA 


ATCCTGACTA 


3000 


AGATTGTCTT 


CTTTG T AAGG 


TAGAAATAAA 


GG AGTTTCTG 


GTTCTGG ATT 


GTAAAAAATG 


3060 


AGTTGTTTTA 


ATTGATAAGG 


AGTAGAATAT 


GGAAATTAAT 


GTGAGTAAAT 


TAAGAACAGA 


3120 



/ 
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CTTTTAACGA ATGGGGCAAA AAGATGCGAT TGGTCCCAGA TCCTTCAGAC GTT 
(2) INFORMATION FOR SEQ ID NO: 41: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9828 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



GTGAAGTGCG 


GCAAAAGCTG 


CAAGTGATGA 


GCTCAGGTTC 


TTTAGCTCTT 


GACATTGCCC 


60 


TTGGCTCAGG 


TGGTTATCCT 


AAGGGACGTA 


TCATCGAAAT 


CTATGGCCCA 


GAGTCATCTG 


120 


GTAAGACAAC 


GGTTGCCCTT 


CATGCAGTTG 


CACAAGCGCA 


AAAAGAAGGT 


GGGATTGCTG 


180 


CCTTTATCGA 


TGCGGAACAT 


GCCCTTGATC 


CAGCTTATGC 


TGCGGCCCTT 


GCTCTCAATA 


240 


TTGACGAATT 


GCTCTTGTCT 


CAACCAGACT 


CAGGAGAGCA 


ACCTCTTGAC 


ATTGCCGGAA 


300 


AATTGATTGA 


CTCAGGTCCA 


GTTGATCTTG 


TCGTACTCGA 


CTCAGTTGCT 


GCCCTTCTTC 


360 


CTCGTGCGGA 


AATTGATGGA 


GATATCGGAG 


ATAGCCATCT 


TGCTTTGCAG 


GCTCGTATGA 


420 


TGAGCCAGGC 


CATGCCTAAA 


CTTGGCGCCT 


CTATCAATAA 


AACCAAAACA 


ATTGCCATTT 


480 


TTATCAACCA 


ATTGCGTGAA 


AAAGTTGGAG 


TCATGTTTGG 


AAATCCAGAA 


ACAACACCGC 


540 


CCGGACGTCC 


TTTGAAATTC 


TATGCTTCAG 


TCCCCTTGGA 


TGTTCGTGCT 


AATACACAAA 


600 


TTAAGGGAAC 


TGGTGACCAA 


AAAGAAACCA 


ATGTCCCTAA 


AGAAACTAAG 


ATTAAGGTTG 


660 


TAAAAAATAA 


GGTAGCTCCA 


CCGTTTAAGG 


AACCCGTACT 


TGAAATTATG 


TACCGAGAAG 


720 


CAATTTCTAA 


GACTGGTGAG 


CTTTTGAAGA 


TTGCAAGCGA 


TTTGGATATT 


ATCAAAAAAG 


780 


CAGGGGCTTG 


GTATTCTTAC 


AAAGATCAAA 


AAATTGGGCA 


ACGTTCTGAC 


AATGCTAAGA 


843 


AATACTTGGC 


AGAGCACCCA 


GAAATCTTTG 


ATGAAATTGA 


TAAGCAAGTC 


CGTTCTAAAT 


900 


TTGGCTTGAT 


TGATGGAGAA 


GAAGTTTCAG 


AACAAGATAC 


TGAAAACAAA 


AAAGATGAGC 


960 


CAAACAAAGA 


AGAAGCAGTG 


AATGAAGAAG 


TTCCGCTTGA 


CTTAGGCGAT 


GAACTTGAAA 


1020 


TCGAAATTGA 


ACAATAAGCT 


GTTAAAGCAG 


TGCAGAAATC 


CGCTACTTTT 


TCGATTTTTG 


1080 


ATTCAAGTTT 


TTAGATTATA 


TATAGTAGCT 


1GAAATAAGA 


TATGAACAAC 


TCTATTAGGA 


1140 


AAGTCAAATT 


AATTTCTAGA 


AATGTTTTAG 


CAGCTACAGC 


GTACTATTCC 


AAACTCAACC 


1200 


AACTATAATA 


GATCGAAACT 


AGAATAGTAC 


ATATCTACTT 


CTAAAACATT 


GTTAAAAATC 


1260 


GATTTGACTT 


TCCTTATTTC 


ATTCCGCTAT 


ATATAGTTTG 


CTCTTTCTTC 


TCCCTCCTCT 


1320 


GGAAAGCTGA 


TATAATACCT 


TTATGAATAA 


AAAACGAACA 


GTGGACCTGA 


TACATGGTCC 


1380 
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CCACGAGAAT GATTCTGAGT GGAAAGGTCT TAAATCATGT CTAACAAAAA TAAGGAAATT 12540 

CTGATTTTTG CGATTCTCTA TACAGTCCTC TTTAT G TTTG ATGGCGTTAA ATTGCTGCCT 12600 

TCTTTAATGC CATCTCCCAT TGCAAATTAT CTTGTTTATG TAGT TT TAGC TCTATATGGC 12660 

TCCTTCTTGT TCAAGGATAG ATTGATC C AA CAATGGAAGG AGATTAGAAA CACTAAAAGA 12720 

AAATTCTTCT TTGGACTCTT AACAGGATGG CTCTTTCTCA TTCTGATGAC TGTTGTCTTT 12780 

GAATTTGTAT CACACATCTT GAAGCAGTTT GTGGCACTAG ATGGACAAGG TCTAAATCAG 12840 

TCTAATATTC AAAGTACCTT TCAAGAACAA CCACTACTGA TAGCT G TTTT TGCTTGTGTC 12900 

ATTGGACCTC TGGTACAAGA ATTATTTTTC CGTCAGGTCT TATTGCATTA CTTGCAGGAA 12960 

CGGTTGTCAG GTTTACTAAG CATTATTCTG GTAGGACTTG TTTTTGCTCT GACTCATATG 13020 

CACAGTTTGG CTCTATCAGA GTGGATTGCT GCAGTTGGTT ACTTAGGTGG ACGCCTTGCC 13080 

TTTTCTATTA TTTATGTGAA AGAAAAAGAG AATATCTACT ATCCCCTACT TGTTCACATG 13140 

TTAAG CAAC A GCCTCTCCTT AATCATTTTA GCTATCAGTA TAGTAAAATC AAATGAGAAC 13200 

AGGACAAATC GATTTCTAAC AATGTTTTAG AAGTAGAGGT GTACTATTCT AGTTTCAATA 13 260 

TACTCTAATA TCTGATCAAA ATGCCACTAA TGATACCCAC AAAAAAGCTG AGAAACTTTT 13 320 

CCCAGCTTTA TTTGTTATAC TCAAAGAGAA TGACTTGTTC CTGTCCATCT ACATGAGCAT 13 380 

GGACCCCAAA GGGTACAATT GCTCTTGGAG TTGCGTGGCC GACATTCAGA TTATAGACAA 13 440 

TCGGGATATT GCTGTCAATG ATATCCAATA CTCCCTCTTT ATAGTCGTCA TGGAAAGTTT 13 500 

CATCCATAGG TTTTCCGACC AAGAGTCCAT TGATCACCCC GAATATGCCA GTGTCCTTTA 13560 

AAGTTACCAA CATCTTTTTG AAGTCTTCTG GCTTAGGCTT TTCTTCGCTT GTTTCGAGCA 13620 

AGAGGATTTT CCCTTCCCAC TCTCACX\CT CACGGAAAAG TTT^TAfTTT TCGCACACTT 13 580 

CCGTGCTATC TGCGTATCGA GAGTTGTCAA AGATATCGTA GAGGCATTCG ACGCAACCAC 13 740 

CCAGGATTTT CCCCTCGAAC TGGGCACTTC CTTCCAACAA CTCAAAACCT GTATTTCTAT 13800 

GACTGACACG ACCTGTTCCC AGGCCCGTGG GACTAAAATC AGTTCGTTCC TCATACCAAA 13860 

CGTCACTAGG GCGGATTTCT GAAATTCTTC CCGTCTCAAT CAATTCTTTA AAGTAGTGAA 13920 

GGCTATAGGC TAGCATTTCT TTGTCTAATT CAC AAATGTC TGCTAAAAAG GATTGACCAT 13980 

AAAAACTCTT GATTCCTAAT TTATGCAACA TGAGGTGGTT CATGGTTGTA TCCGAGAAGC 14040 

CAAGAAAAAT TTTTTGCTTG ATAACCTTTT GGAGTTGGTC ATTTTCAAAA AGATAAGGTA 14100 

GCAAGCGATA GGTATCGTCT CCACCGATGG CACATAGGAT CATGTCGATG CTATCATCAG 14160 

AAAAGGCATG AATCAAATCC TCTGCACGAG CTTCAGCATG CTCCTTGATA AAGTCTAATC 14220 
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ATTATAGCAT CTATACAGGG AGGTGTTTTC CCGAAGTGAC AACAGGGTTC AACTGTTACA 1074 0 

TAAAGCGTCG CTCCGACAGG GCATTCTCTA CA G TTTTTAA CACCATTTCT CTCAGCATGT 10800 

GGGCCACCAA AAAACTCATG ATAACCTTGT CCGATAATGT GATTATCTTT TACAATAACT 10860 

CCCCCCACCA TAGGATTGGG ATTGACGTAA CCAGCCCCTT TTTGTGCCAG TTTTATTGCT 10920 

AATTTCATAT ATTTTGAATC GCTCATCTCG CTACCTCCAA AAAAATATAC CTTGAATAGG 10980 

GG ACT ACT C A AGGCATACAA AAGAAAACTT ATGCGATTAA CAAAAATGCT CTGAAATGAC 11040 

AAGTAATCAT TTCAGAGCAC GCAAAAAGCA CAAATATACT TTTATCTTCT TTCATCCAGA 11100 

CTATACTCTC GCCTTTGCAA TTTCACCAAA TCATGCCTTT CGGCTCGTGG GCTATACCAC 11160 

CGGTAGGGAA TTTCACCCTG CCCTGAAGAT AGTTATTCAA TTACAGATGA TTATAGTACT 11220 

TAATTTTGAA TATGTCAACA GATAAATACC GATTGTTTTT CATATACTGT ATTTGTGATA 11280 

ATCGATTCTC GCTCCTCGGA TAAAGAAAAT ATGATATACT AGATAAACGA AATAAGAGAG 11340 

AAGGAATACT ATGTACGCAT ATTTAAAAGG AATCATTACC AAAATTACTG CCAAATACAT 11400 

TGTTCTTGAA ACCAATGGTA TTGGTTATAT CCTGCATGTG GCCAATCCTT ATGCCTATTC 114 60 

AGGTCAGGTT AATCAGGAGG CTCAGATTTA TGTGCATCAG G TTGTCCCTC AGGACGCCCA 11520 

TTTG CTTT AT GGATTTCGCT CACAGGATGA GAAAAACCTC TTTCTTAGTC TGATTTCGCT 11580 

CTCTGGGATT GGTCCTGTAT CAGCTCTTGC TATTATCGCT GCTGATGACA ATGCTGGCTT 11640 

CGTTCAAGCC ATT C AAA CCA AGAACATCAC CTACTTGACC AAGTTCCCTA AAATTGGCAA 11700 

GAAAACAGCC CAGCAGATGG TGCTGGACTT GGAACCCAAC GTAGTAGTTG CAGGAGATCA 11760 

CCTTCCTGCC AAGCTCGCAG TGCAAGCAAC TGCTGAAAAC CAACAATTGG AACAACCTAT 11820 

GGAAGCCATG TTGGCTCTCG GCTACAACGC AACAGAGCTC AAGAAAATCA AGAAATTCTT 11880 

TGAAGGAACG ACAGATACAG CTGAGAACTA TATCAACTCG GCCCTTAAAA TCTTGGTCAA 11940 

ATAGGAGCAG AGAATGACAA AACGTTGTTC GTGGGTCAAG ATCACCAACC CGCTCTACAT 12000 

CGCCTATCAT GATGAGGAGT GGGGCCAGCC CCTCCATGAT CACCAAGTAT TGTTTGAGTT 12060 

GTTGTGTATG CAAACCTATC AGGCAGGCCT CTCTTGCGAA ACGGTACTCA ACAAACGCCA l2120 

AGCTTTCCGA GAACTCTTTC ATAGCTATCA AATTCACTCA GTCGCAGAGA TGACTGACAC 12180 

TGAATTGGAA GCCATGCTGG AGAATCCAGC TATCATTCGA AATAGAGCCA ACCTTTTTGC 12240 

TACACGCGCT AACGCCCAAG CCTTTCTACA GTTACAGGCA GAGTACCGCT CTTTTGATGC 12300 

CTATCTTTGG TCTTTTCTTG AGGGGAAAAC TGTCGTTAAC GATGTTCCTG ATTATCGCCA 12 360 

AGCGCCACCT AAAACACCCT TATCTGAGAA ATTAGCCAAA GATCTCAAAA AACGAGGCTT 124 20 

CAAGTTCACA GGCCCAGTCG CCGTATTGTC TTTTCTACAG GCTGCAGGGC TACTTGATGA 124 80 
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ATTCAAACTC 


TAAAAGCACT 


CTTATGATTA 


TCTGTATTCT 


TTTCAACCAT 


AGGTGAAAGC 


9000 


ATTAATTCAT 


TACCTAAACT 


TTCGCTCATA 


GGCATACAAA 


TTAATCCTTT 


GGCATAAGTA 


9060 


GCCATAAAAT 


TAACATTTTC 


TGTTGTAGCT 


GCTTCTGCAG 


AACAAATTAA 


GTCTCCTTCA 


9120 


TTTTCTCT AT 


CCTTCTCGTC 


TATAACAAGA 


ACAAGTCGTC 


CCTTCTGCAA 


TGCTTCTAAT 


9180 


GCTTCTTGTA 


TTTTTCGATA 


TTCCATTGAC 


TGATTATCCT 


TTCTGCTAAA 


ATCCATTTTG 


9240 


ATATAATAGT 


TCCTTAGATA 


TTTCTGATTT 


TGGAGAGTTA 


TCCATCACTT 


TTTGCACATA 


9300 


TTTACCTAAG 


ATATCATTTT 


CAAGATTTAC 


TCTACTCCCG 


ACTTGTTTAC 


TCTTAAGAAT 


9360 


GGTTTGTTCC 


AAGGTATGAG 


GGATAACAGA 


TACTGAAAAG 


TTTACTTTGG 


AG ACTTT AG C 


9420 


GACAGTCAGA 


CTAATGCCGT 


CAATTGTAAT 


AGATCCTTTT 


TCAACTATTA 


AATCTAAAAT 


9480 


TTCTTTTTGT 


GTGTTGATTT 


GATACCATAC 


AGCATTATCA 


TCTTTTTTTA 


TTGACGAGAT 


9540 


TTTTCCTGTA 


CCATCAATGT 


GTCCTGTAAC 


GACGTGACCC 


CCAAGTCCAC 


CGTTGACAGA 


9600 


TAAGGCTCTT 


TCTAGATTCA 


CCTCACTTCC 


ATCTTTTAAT 


AGAGTAAGAG 


CTGTTCGACT 


9660 


CCATGTTTCA 


TTCATTACAT 


CAACTGTAAA 


GGATTGATGA 


TTGAAATGAC 


TAACTCTAAG 


9720 


ACAGATACCA 


TTTACTGCTA 


TACTATCGCC 


TAAATGGATA 


TCCCTTAATA 


TTTTTGAGGC 


9780 


TT7AATTGAT 


AGTTTACAAT 


TACGACAGTC 


TTTCTCTATT 


CTTTCAACTT 


TTCCGATTTC 


9840 


TTCAATTATT 


CCTGTGAACA 


TGGATAAATC 


ACTTCACTTT 


CTATGAGATA 


GTCATTTCCT 


9900 


ATTTGAGAAA 


ATGCATAAGG 


TTTCAATCTA 


ATACCGTCAT 


TTGGCAAAGA 


AATACCTTCA 


9960 


CCTCCGACAG 


CAAACTTCGC 


ACTACCTCCA 


AAAACTTTTG 


GTCCAATATA 


TATTTTCAGC 


10020 


TCATCAACAA 


TTTGTTCTTC 


CAAAGCACTC 


CAATTCATTA 


GACTGCCCCC 


TTCTAGAACT 


10080 


AGGCTATCAA 


TCTGCATCTT 


TCCTACATGT 


TGCATT AAAC 


TCGATAAGTC 


TATATCATTC 


10140 


CCTTTTTTCT 


TTATGGAAAG 


TATTTCACAG 


CCATGATTTT 


GATATAGCTT 


CATTTTATTT 


10200 


TTGTCTTCAG 


AGGAAGTGGC 


AATGTAAGTT 


TTAATATCAT 


TTGCTGTTTT 


TACGATTTTA 


10260 


GAGGTAAGAG 


GAGTTCGTAA 


ATCTCTATCG 


CATATGATAC 


GGATAGCATT 


TTTCCCTTCC 


10320 


TCCAATCTAC 


ATGTCAGCAA 


AGGATCGTCT 


TGAATAACAG 


TATTGACTCC 


CACCATAATT 


10380 


CCACTAACAT 


GCTGTCGTAA 


CTGATGCACA 


TGCTTTCTTG 


CTTCTTCTTC 


AGTAATCCAT 


10440 


TTGGATTGAT 


■nvrTTTAGT 


GGCTATTTTT 


CCATCCATTG 


ACATTGCATA 


TTTCATAAAA 


10500 


ACATAGGGTA 


CATGCTCGGT 


AATATACTTT 


CTAAAACTTT 


TTATTAAGTT 


AAGACACTCA 


10560 


TTTTCTAAAA 


TTCCAACAGT 


AACTTGAAGA 


TTATTTTCCT 


CAAGTATCTT 


TACTCCTTTT 


10620 


CCAGATACAA 


TAGGATTACA 


GTCTAGGCTT 


CCAATGACTA 


CTCTTGTAAT 


ACCACTATCC 


10680 
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ATATGGAAAA 


GATGTTCCCA 


CGT ATTCAGG 


AAAATCACAC 


CAGTCTCCCT 


GAGTTGGGGA 


7200 


AATATTAAAA 


GTATAAAAAA 


CTCTGGGAAA 


AATTTTCAAA 


ATCAAAAAAA 


CGCATAAAAT 


7260 


CAGGTGTTCA 


AAAACCTTGA 


TTTTATGCGT 


TTTATCATGG 


AAATACTTAC 


TTCATTTTTT 


7320 


CCTAATTCTT 


TTCGAAACTC 


TTTTTAAACG 


ACGTCAGTTT 


TATCAGTAAT 


CTC AAAACAG 


73B0 


TCTTTTGAGC 


TAATTTTGCC 


AGT TTTCTCT 


GTAACATCGA 


AGTTGTGTTT 


TACCACTCTG 


7440 


CGACTGGTTT 


CCTAGTTTGC 


TCTATGATTT 


TCACAGAGCA 


TTAAATTGCG 


ATTTTGCCAA 


7500 


GTTTCTTTAT 


TCGTCTAAAA 


CTAGAGTCTG 


TTCTATGCGT 


CTAATGTACG 


AATCAGGTTG 


7560 


ACCATTTCAA 


TAGCTCCTTG 


TGCACACTCA 


GAACCCTTAT 


TTCCTGCTTT 


AGTACCAGCT 


7620 


CGTTCTATGG 


CTTGTTCAAT 


TGTATCTGTC 


GTTAGCACAC 


CAAACATAAC 


AGGAATTTCC 


7680 


CTATTTAAAC 


TGATTTGGGC 


GATTCCCTTA 


GATACCTCGC 


TACATACATA 


ATCATAATGA 


7740 


CTTGTATTCC 


CTCTAATGAC 


AGCTCCCAAG 


CAGATAATTG 


CATCATATTT 


TTTACTTTTT 


7800 


GCCATTTTTG 


ATGCAATCAG 


TGGTATTTCA 


AAAGCTCCTG 


GAACCCAGGC 


TACCTCTATA 


7860 


TCTTTCTCGT 


TTACATTCTC 


TCTTTTGAGA 


TTATCTAGTG 


CTCCACATAA 


TAATTTTGAA 


7920 


GTTATAAATT 


CATTAAATCT 


CGCTACAACA 


ATACCTATTT 


TAATATTCTT 


TGCTACTAAA 


7980 


TTACCTTCAT 


AAGTCTTCAT 


TTATTTTTCC 


TCCATATTTA 


AAATGTGACC 


CATTCCATTT 


8040 


TTCTTTCTTT 


CTAAATAAAA 


ACTATCGTAA 


CGATTGCCTT 


CTATTTCGAT 


TCATATTCTA 


8100 


CTGGAAATGG 


TAATTCCATA 


TTTTTCTAAC 


TGTTCAACCT 


TGTCAGGATT 


ATTTGTCAGT 


8160 


AAATGAACTG 


ACTGAACTCC 


CAGATCTTTA 


AGCATTTTTG 


CTCCAATATG 


ATATTCTCTT 


8220 


AAATCACCTT 


CAAAGCCTAA 


TGCAAGATTG 


GCATCAAGCG 


TATCCATCCC 


TTCATCTTGT 


8280 


AAATGATAGG 


CTTTTAATTT 


ATTGATAAGT 


CCAATTCCTC 


GTCCCTCCTC 


TCGCAAGTAA 


3340 


AGTAAGACAC 


CCCAACCATT 


CTCAACAATC 


ATTTTCATAG 


CCTTATCGAA 


TTGCTGTCCA 


8400 


CAATCGCAAC 


GTAAAGAGCC 


TAAAACATCT 


CCTGTTAAAC 


ATTCGGAGTC 


GACCCCACAT 


8460 


AATACATTGG 


CTTCATCCTC 


TATATTTCCC 


ATAATAAGAG 


CAACATGATG 


TTCCCCATTT 


8520 


ACTTTATCTA 


TATAGCTAAT 


TGCTTTGAAA 


TTACCGTATC 


TAGTAGGCAT 


ATTGACAGTT 


8580 


GAAACTCGTT 


CTACCAGCTG 


ATCATATACT 


TTTCTATATT 


CTTCTAATTC 


TTTGATGGTA 


8640 


ATTAGTGGAA 


TGTTGTCTTT 


TTTCGAGAAC 


TGAATTAAAT 


CATCTGTTCT 


CATCATTTTG 


8700 


CCATCATGAT 


TCATTATTTC 


ACAACATAGG 


CCACACTCTT 


TTAGTCCACC 


TAATTTTAAT 


8760 


AAATCAACAG 


TTGCTTCTCT 


GTGTCCATTT 


CTTTCTAGGA 


CACCACCTTT 


TTTTGCAATT 


8820 


AAAGGAAACA 


TGTGTCCTGG 


CCTGCGAAAA 


TCAGAGGGTG 


TTATATCTTC 


AGCTACACAC 


8880 


ATACGTGCGG 


TCAGTCCTCT 


TTCCTCGGCA 


GAAATACCTG 


TGCTCGTTTC 


TTTATAATCA 


8940 



WO 98/18931 



PCT/US97/19588 



393 



AGGAAGCTGG 


TCTCAAGAAG 


GTTCAAATCA 


CGGATAACGG 


TCATGGAATT 


CCCCACCATG 


S460 


AGGTGGAGTT 


GGCCCTGCCT 


CGCCATGCGA 


CCAGTAAGAT 


AAAAAATCAA CCAGATCTCT 


5S20 


TTCGGATTCG 


GACGCTTGGT 


TTTCGTGCTC 


AAGCCTTGCC 




X V 1 V* 1 1 /\U 1 VJ 


SS80 


TCTTGACTCT 


GTTAACGGCG 


GTGGATGGTG 


CTAGTCATGG 






5640 


GGGGTGAAGT 


TGAGGAACTC 


ATCCCAGCGA 


CTAGTCCTGT 


GGCAACCAAG 


GTTTGTGTGG * 


5700 


AGGATCTCTT 


TTTCAACACG 


CCTGCCCGTC 


TCAAGTATAT 






S760 


TGTCTCATAT 


CATTGATATT 


GTCAACCGTC 


TGGGCTTGGC 


\m.\-J\ I V.V- l\**\Kj 


nl 1 IL1 1 1 1 A 


5820 


GCTTGATTAG 


TGATGGCAAG 


GAAATGACGC 


GGACAGCAGG 


wnt 1 LAA 


TTCv.GL.CAAG 


5880 


CAATCGCAGG 


GATTTACGGT 


TTGCTCACTG 


CCAAGAAGAT 


vJAi iXjAAA I I 


^ ft ft ft ^fti^fm^v 

GAGAACrTCTG 


S940 


ACCTAGATTT 


CGAAATTTCA 


GCTTTTGTGT 


CCTTGCCTGA 


GTTGACTCGG 


GCTAACCGCA 


6000 


ATTATATCAG 


CCTCTTCATC 


AATGCCCGTT 


ATATTAAGAA 


CTTCCTGCTC 


AATCGTGCTA 


6060 


TTTTGGATGG 


TTTTCCAAGC 


AAGCTTATCG 


TTGGACGTTT 


TCCACTGGCT 


GTCATTCACA 


6120 


TCCATATCGA 


CCCTTATCTA 


GCCGATGTCA 


ATGTGCATCC 


lk X ft ft ft Ik 

AAL 1 AAGCAA 


GAGGTGCGGA 


6180 


TTTCCAAGGA 


AAAACAACTC 


ATGACTCTGG 


TTTCAGAAGC 


TATTGCAAAT 


AGTCTCAAGG 


6240 


AACAAACCTT 


GATTCCAGAT 


GCCTTGGAAA 


ATCTTGCCAA 


ATL7GACCGTG 


CGCAATCCTG 


6300 


AGAAGGTGGA 


GCAAACTATT 


CTCCCACTCA 


AAGAAAATAC 


GCTCTACTAT 


» ^ » ft ft K mmu* 

GAGAAAACTG 


6360 


AGCCGTCAAG 


ACCTACTCAA 


ACTGAAGTAG 


CTGATTATCA 




A ft ft ft M 

ACTGATGAAG 


6420 


GGCAGGATTT 


CACCCTGTTT 


GCCAAGGAAA 


CCTTGGACCG 






6480 


TGCATTTTGC 


AGAGAGAAAG 


CCTGCTAACT 


ACGACCAGCT 


AGAPrATCrA 


u i 1 Av*A 4 <■» 


6540 


TTGCTAGCAT 


CGATAAGGCT 


TATGACAAAC 


TGGAGCGAGA 


AGAAGCATCC 


AGCTTCCCAG 


6600 


AGTTGCAGTT 


TTTCGGACAA 


ATGCACGGGA 


CTTATCTCTT 


TGCCC AAGGG 


CGAGATGGAC 


6660 


TTTACATCAT 


AGATCAGCAC 


G CTGCTC AGG 


AACGGGTCAA 


GTACGAGGAC 


TACCCTGAAA 


6720 


GCATTGGCAA 


TGTTGACCAA 


AGCCAGCAGC 


AACTCCTAGT 


GCCCTATATC 


TTTGAATTTC 


6780 


CTGCGGATGA 


TGCCCTGCGT 


CTCAAGGAAA 


GAATGCCTCT 


CTTAGAGGAA 


GTGGGCGTCT 


6840 


TTCTAGCAGA 


GTACGCAGAA 


AATCAATTTA 


TTCTACGTGA 


ACATCCTATT 


TGGATGGCAC 


6900 


AAGAAGAGAT 


TGAATCAGGC 


ATCTATGAGA 


TGTGCGACAT 


GCTCCTTTTG 


ACCAAGGAAG 


6960 


TTTCTATCAA 


GAAATACCGA 


GCAGAGCTGG 


CTATCATGAT 


GTCTTGCAAG 


CGATCTATCA 


7020 


AGGCCAATCA 


TCGTATTGAT 


GATCATTCAG 


CTAGACAACT 


CCTCTATCAG 


CTTTCTCAAT 


7080 


GTCACAATCC 


CTATAACTGT 


CCTCACGGAC 


GTCCTGTTTT 


CCTGCATTTT 


ACCAAGTCGG 


7140 
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AATCAATTCT CCATCGGAAT AATTTCCCTA GAGTTCGGCT AATTTTTCCT TATTTCTACC 1620 

AAGCAAGATG AGTTGGTCAT TGGGCAGCAC TTTGACCATT TCTTCACCTA GACCAOCGCT 1680 

ACCTCCGGTA ATGACAATAG TAGCCATACT TATCCTTTCT GTGACTGCTA GATTTCCACT 1740 

TCTTCCAACT CTTTGACCAC ATGGACATTT TCAAAAATTC TGGCAGCGTC TTTCTTGAGT 1800 

TTGCTAATAT CTTTTGAGAG GAAACGGGCA CTGATATGGT TGAGTACGAG GCGTTTGGCA 1860 

CCTGCTTCTA CCGCTACTTG TCCAGCTTGC ATATTAGTTG ACTGACCATG GTTACGAGCA 1920 

ATTTTTTCAT CACCCTTGCC ATAAGTGGAC TCATCAACTA GGACATCTGC ATTGACAGCC 1980 

AGACGCACAC TGGCACCCGT TTTTCGAGTG TCTCCTAAAA TAGTGATAAT CTTACCTGGA 2040 

CGTGGCGCTG AGATATAGTC TGCTGCCTTG ATTTCAGTTC CCTCTTCCAA AACAAGATCC 2100 

TGGCCGTTTT TGATTTTACC AAAAAGCGCG CCGAACGGAA CACCAGCACC CTTGAGTTTT 2160 

TCAGCATCCA GCCTCCCTTC TAGATCCTTT TGCATGACAC GATAGCCAAC ACAGAAAATA 2220 

GTCTGGTCCA GCTCCTCTGC ATACACAGTG AATTTATCCG TTTCAAGAAT TTTACCCAGA 2280 

GAATCTTGGT CAAACTCATG GAAATGAATG CCGTAGGGCA GACCAGAACC TGACACACGA 2340 

A CGCTCCTT A AGACAAATGA CTTGATTCCT TGAGGTCCGT AGATTTCCAA ATCTGTCTGC 2 400 

TCTTCATTGG CCTGAAAGGC ACGGCT AG AA AGG AAACCTG GCAAACCAAA AATGTGCTCT 2 460 

CCATGCACAT GCGTAATAAA GATTTT C CTG ACCTTACCTG GTCGAATTCT GGTTTCCAGA 2520 

ATGCGATTTT GCGTACCTTC TCCACAGTCA AAGAGCCAAA CTTCGTTAAT CTCATCCAAA 2580 

AGTTTCACGG CGAGACTTCA AACGTTCCCG GCTTTAGAGG GCTCACCAGC CCCCGTTCCT 2640 

AAAAATTGAA TATCCATTCG ATACTTTCTA ATTAATCAAT ATATAACATG GCTGTGCCGT 2700 

TTTCCCATCC CAAATAGCGT TTGCCACAAA AAGCAGCAGC TTCTTGCAAT AAATCCTCTT 2760 

GGCTCTAGCC TTTGAGACGT TTTCGACCAT CAGCCAATCT TTCCAAATCA GTCAAAGCTC 2820 

TGACACTTTC TACGCTGATA ACTTCCTCCT CCTCGACAGG CTTCATGTAA ATCTT ACC AG 2880 

ACTCTTCAAA GACTAATTGA TGGGGGAAAA TTTGCGCAAT TTCAAAGAGC AACTCATCCG 2940 

AGATTTTCTC CTCATTTTCA AAGAAAATCC GACCAAGGCC GTCACTCTCA TAACAAAAAC 3000 

CAAAGGATTT ACCAGACAGA TTAAGCCGAA TAAAAGGCTT ATTTTCTAGG GTGAAACTTG 3060 

GCTCAGTATT CTAAAGATTC AGTTCCTGAC TGAGTTCTGC AAAATAATCC CTCGCAGCCT 3120 

GAGGACTCTT TTTCTGATAG AGTTCTGCAA AGTAGGCATT AACAACACTT GGCGGAGGTG 3180 

TAATAACTGT TAACTCCTCC TGATCTGTTT TACCAGCTAG AAGCTGATCC AGATAGACCT 3240 

TGTCCAGACT TGTATAACCT CCATACTTTA GAGCCAAAGT TTTAATATCA GTCATAAAAT 3 300 
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TCTTCTAACC 


TCCATTTATT 


TTTCTCGGAA 


ATGTACCCTG 


TAATCT.^TTC 


GCCGTCTTCC 


3360 


TGATAATCAC 


GTTCTTCCAG 


AATTCCAACA 


CTCTCTAAAT 


CATGAATCTT 


GTAGGACTTT 


3420 


GAAAAAGGCA 


CTCGCAGGGT 


AAATGCTTCA 


AAAATTTCCT 


TAATCTTATC 


TAGCAATAAT 


3480 


GCTTGCAACT 


TTTCACGACT 


GTCCTCAGAC 


TTGGCAGAAA 


TGAGGGTATA 


TGGCCTTTGG 


3S40 


GTAGGCCTGA 


AATCCTCCAC 


CAAATCCGCT 


TT ATT AT AAA 


GCGTCAAGTG 


AGGAATATCT 


3600 


TCCATCTC C A 


CGTCTTTCAT 


GATGGAGAGA 


ACCGTTTTTT 


CATGCTCCTC 


GTGGTAAGGA 


3660 


TTGCTAGCAT 


CGATAACATG 


AACCAGAAGG 


TCCACATGCT 


TGCTTTCTTC 


CAAGGTTGAC 


3720 


TTGAAACTGG 


ACACCAACTC 


TGTCGGCAAA 


TCTTGGATAA 


AGCCAACGGT 


ATCTGTCAAA 


3780 


GTTACTTGGA 


GATTGCCTCC 


CAGATGAATA 


CTCTTGGTTG 


TCCCATCCAC 


ACTCGCAAAG 


3840 


AGCTCATCTG 


CTTCATACTG 


GGTCTTACTG 


GTCAAGATGT 


TCATGATACT 


TGATTTCCCA 


3900 


GCATTAGTAT 


AACCAATCAA 


ACCAATCTTA 


AAAGTGCTAG 


ACTCCAAACG 


TTTTTCTCTG 


3960 


ACACTCGCAC 


GATTTTTCTC 


AACCACCTTG 


AGCTGCCGCT 


CGATATCCGT 


GATTTGATTG 


4020 


CGAACGCTAC 


GACGCTTCAG 


CTCCAGCTGG 


CTTTCACCAG 


CACCACGGGA 


ACCAATTCCC 


4080 


CCTgCCTCAC 


GCCTGAGCAT 


AATC CCCTG A 


CCAACCAAGC 


GAGGCAAAAG 


GTATTTGACT 


4140 


TGGGCTAGGT 


GGACTTGGAG 


CTTCCCTTCA 


TGGCTTCGAG 


CCCGCATGGC 


AAAGATATCC 


4200 


AAAATCAACT 


CCATACGCTC 


AATGACCTTA 


ACACCGAGAA 


CTTCCTCTAG 


ATTG AC ATTC 


4260 


TGCCTTGGGG 


TCAGACGATT 


GTTGACGATG 


ACAGTAGTGA 


TTTCTTCTGC 


ATCCACCATA 


4320 


AGCGCAATCT 


CTTCCAACTT 


ACCAGAGCCG 


ACGAAGGTCT 


TGGAATCATA 


TTTTTCACGT 


4380 


TTTTGTCTGT 


AGCTATCTAC 


AACCACTGCC 


CCTGCCGTTT 


TCGCTAAACT 


AGCCAATTCT 


4440 


TCCATGGAGA 


GGTCAAAACT 


CTCCATACCC 


TGCAATTCCA 


CACCAATCAG 


CAGCACTCGC 


4500 


TCCTCTTTTT 


TCTCCGTTTC 


AATCATCTAA 


AAACTCCTCT 


ATCTGGCTTA 


AAATGCGGTC 


4560 


TTGTACACCA 


GATTCTCCAA 


TCTGATAAAA 


GGTGACCTGC 


ATGCGATTAC 


GGAACCAGGT 


4620 


CAGCTGACGC 


TTGGCAAAAC 


GACGAGTCGC 


CTGTTTAAGA 


CTCTCACTAG 


CTTCCTCCAA 


4680 


GGTCTGCTCT 


CCACGGAAAT 


AAGGAAAGAG 


TTCCTTATAG 


CCAATTCCTT 


TAGCAGCCTG 


4740 


TACATTAGGG 


GAATGGTCAA 


ACAGCCACTT 


GGCCTCATCC 


AAAAGCCCAG 


CCTCAAACAT 


4800 


CAAATCCACT 


CGGTGGTTCA 


TACGCTCATA 


AAGTTGACTA 


CGTTCATCAT 


CCAAGCAGAT 


4860 


AATCAGCGGT 


TCATACAAGG 


TCTCTTGATT 


TTCCAAATCC 


TGACCAAAAT 


CGGCAATTTC 


4920 


TAAGGCACGC 


ATAGCACGAC 


GACCATTAAA 


CTGGGGAATC 


TCAAGGCCTG 


CTTGATCCAC 


4980 


CAAATGGGCT 


AATTCCTCAT 


CTGAATATGG 


CTCCAAACTA 


GCTCGATAAG 


CTAAAATCTC 


5040 


CTCATGAGGA 


GTCTCCCCAC 


CTAGGTGGTA 


ACCTTCTAGC 


AAGCTCTGGA 


TATAAAGTCC 


5100 
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AGTCCCACCG 


CCGATAATGG CTAGCTTGCC 


ACGCTTGTGA 


ATACCCTCAA 


TACTCATCTT 


S160 


AGCTTCTGAA 


ACAAAATCAA AAGCCGAGTA 


AGACTCGGTT 


ATCTCTCTAA 


CATCCATTAA 


5220 


ATGATGAGGA 


ACAGCTGCCT GCTCTTCTGG 


ACTAGCCTTG 


GCCCTCCCAA 




3«flU 


TCGATAGACT 


TGCTGGCTAT CTCCACTAAC 


CACTTCGCCA 


TTAAAACGCT 


TTGCOGGG 


3 J JO 


(2) INFORMATION FOR SEQ ID NO: 51: 








<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19446 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 








(xi) ; 


SEQUENCE DESCRIPTION : SEQ ID NO: SI: 






CGGAAACCCA 


TCTAGTerrcc atcgtttgcg 


AGACCAAGCA 


ACACGAATCT 


TAGATGCTTC 


60 


TCGCCAACAG 


attccacatt taatcggtaa 


GAAAAGCGAT 


GAAATCTTCT 


TTACCTCGGG 


120 


TCGAACAGAA 


ggggataact ggcttatcaa 


GGGTGTGCCC 


TTTCAAAAAG 


CTCAGTTTGG 


180 


CAAGCACATC 


attgtttcag ccattgaaca 


TCCAGCAGTC 


AAAGACTCAG 


CCCTCTGGTT 


240 


GAAAAGTCAA 


ggatttgaag tggattttgc 


TCCAGTTGAT 


AAGAAAGGCT 


TGCTCGATGT 


300 


TGAGCCCTTA 


caggtttgat acgccatgat 


ACAATCCTCG 


TTTCCATCAT 


GGCTGTGAAC 


360 


AATGAAATCG 


CCTCTATCCA ACCTATTGAG 


CCTATTTCAG 


AATTCTTGGC 


AGACAAGCCG 


420 


ACTATTTCCT 


TCCACGTTGA TGCGGTTCAC 


GCGCTTCCCA 


AAATTCCCAC 


TGAAAACTAT 


460 


CTGACAGAAC 


GCGTGGATTG CCCGACTTTC 


TCTAGTCACA 


AGTTCCACGG 


GGTTCGAGGT 


540 


GTTGGCTTTG 


TCTATATCAA ATCTGGC AAC 


AAGATTACAC 


CTCTTCTTAC 


AGGTGGTGGC 


600 


CAGGAGCGAG 


ATTATCGTTC GACAACTGAA 


AATGTGGCAG 


GGATTGCAGC 


GACAuCCAAVt 


660 


GCCCTCCCTT 


TGTCTATGGA AAAGCT AG AT 


ATCTTTAGGA 


CCAAGACTGG 


GCAGATGAAG 


720 


CCAGTGATTC 


GCCAAGCTCT TCTGAACTAT 


CCGGATATTT 


TTGTCTTTTC 


AGATGAGGAA 


780 


AACTTTGCAC 


CTCATATTCT GACTTTTGGA 


ATCAAAGGTG 


TTCGAGGTGA 


AGTCATCGTT 


840 


CACGCCTTTG 


AAGACTATGA TATTTTCATC 


TCAACAACCT 


CAGCTTGTTC 


ATCTAAGGCA 


900 


GGAAAACCAG 


CCGCTACCTT GATTGCCATG 


GGAGTGGACA 


AAGATAAGGC 


CAAGTCAGCT 


960 


GTG CCTCTTA 


GCCTAGACTT GGAAAATGAT 


ATGAGTCAGG 


TCGAGCAGTT 


TTTGACCAAG 


1020 


TTAAAATTGA 


TTTACAATCA AACTAGAAAA 


GTAAGATAGG 


AGCATTCATG 


CACTATTCAC 


1080 


AAATTATGAT 


TCGCTACGGA GAGTTGTCAA 


CCAAGGGTAA 


AAACCCTATG 


CGTTTCATCA 


1140 



onetwin. not ooo i a rj i ^ 
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ATAAACTTCG 


TAATAATATT 


TCGGACGTTT 


TGTCTATCTA 


TACCCAAGTT 


AAGGTAACAG 


1200 


CAGATCGCGA 


CCGTGCCCAC 


GCTTACCTCA 


ATGGAGCTGA 


TTACACAGCA 


CTTGCAGAAT 


1260 


CTCTCAAACA 


AGTTTTTGGA 


ATTCAAAACT 


TTTCTCCTGT 


TTATAAGGTT 


GAAAAATCTG 


1320 


TAGAAGTTTT 


GAAGTCTTCT 


GTCCAAGAGA 


TTATGCGGGA 


CATCTACAAC 


GAAGGTATGA 


1380 


CCTTTAAGAT 


TTCTAGCAAG 


CGTAGCGACC 


ACAACTTTGA 


ACTTGATAGT 


CGTGAACTCA 


1440 


ACCAAACACT 


TGCAGCCGCT 


GTATTCGAAG 


CCATTCCAAA 


TCTGCAAGTT 


CAAATGAAAA 


1500 


GTCCTGACAT 


CAATCTTCAG 


GTGGAGATTC 


GTGAAGAAGC 


AGCCTATCTT 


TCTTATGAAA 


1560 


CCATTCGTGG 


GGCTGGTGGT 


TTGCCAGTTG 


GAACTTCAGG 


TAAAGGGATC 


CTCATGTTCT 


1620 


CACGAGGGAT 


TGACTCACCT 


GTAGCAGGTT 


ATCTTGCTCT 


TAACCGTGGC 


GTGGATATCG 


1680 


AGGCAGTTCA 


CTTTGCTAGT 


CCACCATATA 


CTACTCCTGG 


TGCCCTCAAG 


AAAGCGCAGG 


1740 


ACTTCACCCG 


TAAATTGACC 


AACTTTGGCG 


GAAATATCCA 


CTTTATAGAG 


GTGCCTTTCA 


1800 


CAGAGATTCA 


AGAGGAAATC 


AAAGCCAAAG 


CGCCAGAAGC 


TTATTTGATG 


ACTCTAACTC 


1860 


GTCGCTTTAT 


GATGCGGATT 


ACTGACCGTA 


TTCCTGAGGT 


ACGAAATGCT 


TTGGTTATCA 


1920 


TCAATGGGGA 


AAGTCTACGT 


CAAGTAGCCA 


GCCAAACCCT 


TGAAAGTATG 


AAGCCTATCA 


1980 


ATGCTGTTAC 


CAACACTCCC 


ATCATTCGTC 


CTGTGGTTAC 


CATGCACAAG 


TTGGAAATCA 


2040 


TTGACATCCC 


CCAGGAAATC 


GATACCTTTG 


ACATTTCAAT 


CCAACCGTTT 


GAACACTCTT 


2100 


GTACCATTTT 


TGCACCAGAT 


CGTCCAAAAA 


CAAATCCTAA 


AATTAAGAAT 


CCGGAGCAGT 


2160 


ACGAAGCGCG 


TATGGATGTT 


GAAGCCTTGG 


TTCACCGAGC 


AGTGGCTGGA 


ATCATCATTA 


2220 


CTGAAATCAC 


ACCTCAAGCC 


GAAAAAGATG 


AAGTTGATCA 


CTTGATTGAC 


AATCTCCTCT 


2280 


AATTCAGAAA 


ATCCAAAAGA 


ATAGCGAAAA 


TCAGTAAAAA 


AACTTAGTTT 


TTTCTCTAAA 


2340 


AACAGGTAAA 


AAACTAACT7 


TTTTTATTTT 


TATGATATAA 


TGATATAAAA 


TTTTGAATAT 


2400 


AGAGAGTTTT 


CTGACAATGA 


ATCAATCCTA 


CTTTTATCTA 


AAAATGAAAG 


AACACAAACT 


2460 


CAACCTTCCT 


TATACAGCTA 


AGGAGCGCCG 


TGTACGTATT 


CTTCTTCCTA 


AAGATTATGA 


2520 


GAAAGATACA 


GACCGTTCCT 


ATCCTGTTGT 


ATACTTTCAT 


GACCGGCAAA 


ATG TTT TTAA 


2580 


TAGCAAAGAG 


TCTTTCATTG 


GACATTCATG 


GAAGATTATC 


CCAGCTATCA 


AACGAAATCC 


2640 


GGATATCAGT 


CGCATGATTG 


TCGTTGCTAT 


TGACAATGAT 


GGTATGGGGC 


GGATGAATGA 


2700 


GTATGCGGCT 


TGGAAGTTCC 


AAGAATCTCC 


TATCCCAGGG 


CAGCAGTTTG 


GTGGT AAGGG 


2760 


TGTGGAGTAT 


GCTCACTTTG 


TCATGGAGGT 


GGTCAAGCCT 


TTTATCGATG 


AGACCTATCG 


2820 


TACAAAAGCA 


GACTGCCAGC 


ATACGGCTAT 


GATTGCTTCC 


TCACTAGGAG 


GCAATATTAC 


2880 


CCAGTTTATC 


GCTTTGGAAT 


ACCAAGACCA 


AATTGGTTCC 


TTGGGCGTTT 


TTTCATCTGC 


2940 
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AAACTGGCTC 


CACCAAGAAG 


CCTTTAACCG 


CTATTTCGAG 


TGCCAGAAAC 


TATCGCCTGA 


3000 


OfAfiCCCATC 

W LAU^WWA I W 


TTCATCTATG 


TAGGAACAGA 


AGAAGCAGAT 


GATACAGACA 


AGACCTTGAT 


3060 




ATCAAACAAG 


CCTATATCGA 


CTCCTCGCTT 


TGCTATTACC 


ATGATTTGAT 


3120 




CTACATCTGG 


ATAATCTTGT GCTAAAAGTT 


CACTCTGGTC 


CCATCCATAG 


3180 


I u#vW% 1 www 1 


TfiCTCAGAAA 


ATCTACCAGA 


TTGTCTGAGA 


TTTTTTGCAG 


AAAAATGGTA 


3240 


AGTTAAbAAA 


/v» x x * x x Af*fl 


AAATGCATAT 


TGAACATCTT 


AGCCACTGCA 


CTGGTCATCT 


3300 


i» x t^rpTf & » 
TAALLW 1 unn 


#V iu 1 *tw in 


ACCGTTATGC 


ACATGGTGGG 


ATTCCAGTTC 




3360 


TTCATCAGGT 




ACGAATACTA TGATTTTGGC 


ATGATTGATG 


CCTGTGCTTC 


3420 


Li iTATCGAG 


(»AAuU.L 1 Ivi 


TCCAGTTCTT 


TACCCTATCT 


AGTTTGCATA 


CTGAGAGCTG 


3480 


GTTGGCTACT 


x x X x x 

TGCAAAAATu 


CTCATGACCA 


ACCGGAAATG 


WnWWVJ 1 VJWW k 


ACGAACGTTA 


3540 


TGTCATTGAG 


GAGGCCATTw 


TTTTATCAAG 


CACAAGACAG 


k k w«J k k k *J*» 


TGGCATGATG 


3600 


ACGACAGGTT 


CCTCTA ioOU 


AGCCTATCAT 


CCACTCAATT 


TCTTCCTCCA 

i w * * ww • ww^^ 


GCATCCAGAT 


3660 


GTCTTTACCA 


x » t"*T , f x ^*mf* 
AACTCAI iGC 


TCTCAGTGCT 


GTTTACGACG 


ws^w w * * * w k * 


TGTCGGTGAT 


3720 


TACTACAACG 


ATGATvjCTAT 


TTACCAAAAC 


i CGCCAGT AG 


ATTATATTTG 


GAACCAAAAC 


37B0 


GACGGCTGGT 


nwri X «r»W* X /■* f f~ 

TTATTunLLU 


TTACCCTCAC 


GCAGACATTG 


TGCTGTGTAC 


GGGGCTTGGA 


3640 


^ ^ J"MTW i ft ft ^■fc 

GCCTGGGAAC 


AAGATCCTTT 


GCCATCCTTT 


TACAACCTCA 


AAGAAGCCTT 


TGACAAGAAA 


3900 


CAAATTCCAG 


w w ruu x i iwv. 


TGAATGCGGA 


CATGATGTCG 


CCCATGACTG 


GGAATGGTGG 


3960 


w t»T AAAv. A/\A 


T GC C'l u r AXTT 


CCTCGGTAAT 


CTCTATTTAT 


AAAAGGACTT 


ACCTATCAAT 


4020 


TACCTTGTTA 


1 1 1L1 LV>(. 1 « 


CTATCCACAA 


AACTTTCAAC 


AGTTTACCAT 


CCAACTAGCT 


4080 






GGGAATTGGT 


CAACAGTCTT 


ACGAGCAATT 


GGATGAGCCC 


4140 






GTATTTTCGT 


GTTCATAATC 


TTGAGAACAT 


AGATGAACTC 


4200 


AAACGTGCAG 


TTGCTTTTCT 


CTTTTATAAA 


CATGCTCCAA 


TTGGCCGCAT 


CG AGTCTC \C 


A *1 £ A 

42 bO 


AATGAATACT 


GGCTTGAGCT 


AGACGCAACA 


CTCAGAGAAC 


AATTCAATCT 


TTTTGGTGCC 


4320 


AAACCAGAGG 


ATCTCAAAAA 


. GACGAAATAT 


AAGTCTGAAA 


TGAAGAAACT 


TTTCAAAAAA 


4380 


GCAGGTGTTC 


CTCTGGTACC 


TGGAGCTGTT 


ATCAAGACGG 


AAGCAGATGT 


TGATCAAGCA 


4440 


GTCAAAGAAA 


TCGGTCTTCC 


AATGATTGCC 


AAACCTGATA 


ATGGAGTGGG 


AGCAGCCGCA 


4SO0 


ACCTTTAAAC 


TTGACACAGA 


AGACGATATC 


AATCACTTCA 


AGCAAGAATG 


GGACCATTCA 


4560 


ACCCTTTATT 


TCTTTGAAAA 


ATTTGTCACT 


TCCACCGAAA 


TCTGTACCTT 


TGACGGGCTC 


4620 


GTGGACAAGG 


. ATGGAAAGAT 


TGTCTTCTCA 


ACAACCTTTG 


ACTACGCCTA 


TACACCGCTT 


4680 
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CACCTCATGA TTTATAAGAT GGACAATTCT TATTATGTGC TCAAGCATAT GGATCCTAAA 
CTGCGCAAGT ATGGGGAAGC AATTGTCAAA GAATTTCGTA TGAAAGAACG CTTTTTCCAT 
ATTGAGTTCT TCCGTGAGGG GGACGATTAT ATTACCATCC AGTACAATAA CCGCCCTGCA 
GGTGGTTTTA CCATTGATGT TTATAACTTT GCTCATTCCT TGGACCTTTA TCCTGGCTAT 
GCAGCTATTG TCGCAGGAGA GGAGTTCCCG GCGTCACACT TTGAAACTCA CTATTGTTTC 
GCTACTTCTC GCCGTGCAAA TGCTCACTAT GTTTATTCAG AAGAGGATTT GCTTGCCAAA 
TATAGCCACC AGTTCAAGCT TAAAAAAGTC ATGCCACCTG CCTTCGCGGA ACTTCAAGGA 
GATTACCTGT ATATGCTGAC CACTCCGAGT CGACAAGAAA TGGAGCAGAT CATTGCAGAT 
TTCGGACAAC CTCAAGAATA AGAACTATCG GATTAAGGAA ATTAACTCCC TTAATCCTTT 
TGTTTTGTCT GATAAAAAAT AAGAGCATCC CAACAAGGTA GCTATCATAA AACTTGTTCG 
ATAACTATTT GAACCAGGAT TAGCTGGTCA GAAATTAAAT TTTAATATTT CAATTGAGTC 
ATAGTATTGT CTTTGCCTAT CCTTAAATCA GCTAAAAGGA TCCATGACGA CACCTATACC 
ATATACTTTT CAAGATACCA AACAACTCTA TTAATATTCA ATGAAAATCA AAGAGCAAAC 
TACGAAGCTA GCCGCAGGTT TCTCAAAACA CTCTTTTGAC GTTGTGGATA GAACTCACAC 
AGTCAGTATC ATATACTACC GCAAGGTGAA GCTGACGTGG TTTGAAGACA TTTTCGAAGA 
GTATAAAATA TTCAGGTCAC GCATAGATAT AGTTAATTGA AC CTTTGTTT GAAATCTGAT 
AAAATAATGA TATTACTAAG TTTTAAAAAC TAAAGAAAAG GGAAGATATG ATTACAGGCG 
AATTAAAAAA TAAAATCGAT CAGCTGTGCC AAATTCTTTG CACAGAACGA AACCCAAATC 
CTTTAACAAA TATTCAACAG TTGACTTATC TCTTATTTAT CAAAGATTTC GATAGTGTCG 
AGCTTGGACG TGAAAGTGAT GCTGAATTTC TAGCGATTCC TTATGAGCGA CTTTTTCCAA 
AAGATAAACC TCAATACCGT TGGTCAACTT TTAAAAATAT ACGACATGCT CACGAAGTTT 
ATCGTTTAAT CACTCAGGAG ATTTTTCCCT TTATTAAAAA TCTCAACCCG GATACAGATG 
ATACAGCCTT TTCACGATAT ATGCCAGAAG CTATTTTTCA AATAAATAAA CCTGCTACGC 
TTCAAAAGGC AATTTCTATC TTAGATGTTT TTCCAACTAC GGGATTAGAT GTAGATTTTG 
ATAATGACAA ACAAAGTATT ACTGATATCG GAGATATCTA TGAATATCTG TTATCAAAAT 
TGTCGACCCC AGGTAAAAAT GGACAGTTCC GTACACCTCG TCACATCATC GATATGATGG 
TTGAGTTGAT GCAACCCACT ATCAAAGATA TCATCTCAGA TCCCCCTATG GGTTCTGCTG 
GCTTCTTAGT ATCTGCTAGC CGTTACTTAA AGCGTAAGAA AGATGAATGG GAAACCAATA 
CAGATAATAT CAATCATTTT CATAATCAGA TGTTTCATCG AAATGATACG GATACGACTA 
TGTTGAGACT TGGGGCGATG AACATGATGC TACATGGAGT AGAAAATCCA CAAATCAGTT 



4740 

4600 

4860 

4920 

4980 

5040 

S100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 
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ACCTTGACTC 


GCTGTCTLAA 


l»A 1 AA 1 w/v\Vj 


AAGCCCATAA 


ATATACTTTG 


GTTTTAGCAA 


6540 


ATCCTCCTTT 


TAAGGGrCTCA 


U l 1\*AV 1 Ht« 


ATTCAACCTC 


TAATGACCTT 


CTTGCAACCG 


6600 


TAAAAACCAA 


tt ft * ft. * * Ik » 

AAAAACAGAA 


TTACTCTTTC 


TTTCTCTTTT 


fTTCCGAACT 


TTAAAACCAG 


6660 


GTCGACGAGC 


AGCAGTTATw 


CTACCTU A l\» 


W 1 w 1 »— ^- £111 


A nJ*J 1 A V- W J "k- 1 


AAAGCTCATA 


6720 


AACGAATTCG 


TCACGAAATT 


GTAGAGAATC 




4 wU i \J 1 AA I V. 


TCAATGCCTA 


5790 


GTGGTGTGTT 


CAAGCCTTAT 


GCTGGAG 1 l T 


I. AAV. luV.UA 1 


TCTCATCTTT 




6840 


CTAATGGTGG 


TACTGACAAA 


GTCTGGTTTT 


ACOATATGAA 




1 i AAw 1 1 1 V»VJ 


69O0 


ATGATAAGCG 


ACAACCGATT 


ACCGACAATG 


* ft *T**TV**"* ft. r* & 

ATATTCCAGn 


TA I 1AL LuAA 


CG CTTTCAT C 


V y « w 


ATCTTGAAAA 


AGAAGCAGAA 


CGTCAGAGAA 


CGGATCAATC 


TTTCTTTGTT 




7020 


AGATAAAGGA 


AAATGATTAT 


GATTTCTCTA 


*M^*ft 1km ft. * ft. IV Ik 

TCAATAAATA 


tftlk Ik ft ft. r~ 1MR 

TAAAGAGAi I 


VjAOXA IIjAAA 


7080 


AAGTTGACTA 


TGAACCAACA 


GAAGTCATAT 


TAAAGAAAAT 


^*lk * M*^* ft: Ik 

CAATGATTTA 


/*kKkikr&n 
GAAAAAGAAA 


1 1 Art 


TTCAAGCTGG 


CTTGGCTGAA 


TTGGAAAAAT 


TACTCAAGTA 


****** ik ^^mw^^ 

GGGAGGTGGC 


TGTATGAAAA 


/ fiUU 


AAGTGAAGTT 


GGCGGAAGTC 


TTATCTCTAA 


ft fc ft ■ ft A ft 4k 

AAAAAGGCAA 


GAAAGCCACT 


GTACTTGC7G 


^5 Art 


AACAAACAAC 


TCTAAGCCAA 


CGTTATATTC 


AAA T AG ATG A 


FVk^^«*k m « « A A Ik 

11 1 AAGAAAT 


AA I AA I AArl 


* } £.\J 


TAAAATTCAC 


TGAAACTTTA 


AATATGACTG 


AAG C ACTCCC 


AG AT GAT ATT 


LlGATAGt. AT 




GGGATGGAGC 


TAATGCAGGA 


ACAGTTGGTT 


ATGGATTATC 


OkjbAVfU I L> 1 X 


UU i AO 1 ALAA 


7440 


TTACGGTCTT 


AAAAAAGAAT 


GAGCGATACA 


AAGAAAAAAT 


fk n Ik f^/"* Tk |k M 

TATATCAGAT 


a AL 11 GUGnu 




TCTTTTTGGA 


AAGTAAATCG 


CAGTATTTAC 


GAGATCA. TC 


aacaggtgla 


AV, AA 1 1 v. 


7560 


ATTTAAACAA 


GAATATATTA 


CTTGATTTAC 


AAnAGAAn 


Ov- 1 AUG I A 1 v- 


uAAV^/V^wMnU 


7620 


ACAACATTA? 


CTGTATTCTT 


AATACuA* */* 


AAAl»\^. i . A * 




*W% * * • ^»*%VJ • 


7630 


TAGATGAACT 


AAACTTGCTC 


GTCAAATt-t-V- 


UAi 1 1 AAV- OA 


\ff\ i w ill vvO 


r.AAAATAAAA 


7740 


TATTTGAAAG 


CATTGATAAC 


TTATTTGATA 


TTATAGATGC 


TGATACGGGC 


AAAAATTATC 


7800 


CTAAATCAGA 


TGAGTTGTTT 


AGTGAGGAGT 


ACTGTTTATT 


TTTAAATACA 


AAGAATGTTA 


7860 


CTAAAAACCG 


ATTTTCATTC 


GATACAAAGC 


AATTTATCAC 


TAAAACAAAG 


GATAAATTAC 


7920 


TTCGAAAAGG 


CAAACTTGAG 


CGTTATGATA 


TAGTCTTCAC 


AACAAGAGGT 


ACTGTTGGAA 


7980 


ATGTAGCCTA 


CTACGATGAA 


TTAATAAAAT 


ATAAACATTT 


ACGTATAAAT 


TCAGGTATGG 


8040 


TAATATTACG 


TCCCAAGACA 


CCAAATCTAA 


ATCAGAAATT 


TATTATCCAT 


GTTTTAAGGA 


8100 


ATAATAATTA 


TAGTCGAGTG 


ATATCAGGAA 


GTGCTCAGCC 


TCAGTTACCA 


ATTACAAAAT 


8160 


TAAAAAAAAT 


ACTTCTCCCC 


CTCCCCCCAC 


TAGCCCTCCA 


AAATGAGTTC 


GCAGACTTTG 


8220 
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TAGTCCAGGT CGACAAATCA CAATTTGCTT CTCAGATAGC TATAAAAGTC TGGAGAAATA 8280 

CCTTGAAATT TAGTATAATA TAG CTAAACT ATTTGTTTAA AGTGAGAAAA AAATCGGAAA 8340 

TTTTAGCTTT CTTTTAAAAA ATGACGAATA TGAATCTTTT TCAAAACCTT GCATTGAAGC 8400 

TCAGAATATG ATTGCTACAT CAACTGTGGC TACTGCCTTT ATGGCGCGTC GTGCTTTAGA 8460 

GCAGGCTGTC CATTGGATAT ATAGTCACGA TTCATATTTA GAAGCTCCCT ATCGTGCTAC 8520 

TCTATCTTCT TTACTATGGG ATGATGATTT TAGGGATATC GTAGATTCTG AACTCCACAA 8S80 

CCAGATAGTT CTGTTGATTC GGTGGGGAAA CCATGCTGCT CATGGTGGTG AAATTAAGGA 8640 

ACGAGAAGCG ATTTTAGCTT TGCATCATTT GTATCA G TTT GTTAATTTTA TCGATTATTG 8700 

TTACAGCAAT GAGTTTGTGC AGCCTTATTT TGATGAGAAG TGCTTACCAC TTTCAGCAAA 8760 

CATCAAATAC CGAGAAACTC CACAATCTAT GATAAAGTTA CAAGACAGTT TACCAGAACT 8820 

GCCTGATTTT CATGAACAGA TGGCTGCTCA GTCCGTAGAA GTTCAAGAGA CTTATACTGA 8880 

AAAACCTGAG ACTGCAGCGC AACGGCAAGA TCTGCCTTTC CATATTGATC AATTATCTGA 8940 

GGCAGACACA AGAAAGCTCT TTATTGATAT CGATCTCCCT TTAGCAGGAT GGATATTTGA 9000 

AGAAAACTGT CGTGTTGAGA TAGCCGTTGA TGGTCTCAAC CACGGTTCAG GAATTGGTTA 9060 

CTGTGACTAT CTACTTTATG GTAAAAATGG GAAAATTTTA GCCATTGTCG AGGCTAAAAA 9120 

AGCCTCTGTC AATCCAGAAG TAGGGGAAGT ACAGGTCAAA GAATATCCTG AAGCTTTGGA 9180 

GAAACATATC GGCTATCAGC CAATTTGCTT TATTACAAAT GGCTTGAAGC ACT AT AT ACT 9240 

TGATGGTCCG AACCCCCCCC AGATTGCAGG CTTTTACTCT CAACAAGAAT TGCAATTAGT 9300 

GATGGATAGA CCTCATCTTC AAAAACCGCT TGAGGATATT TCTAGTAAAA TTAGGGACGA 9360 

TATTTCCCGG CGTCACTACC AAAAACATGC CATTGCAAGC GTTTGTGAAG C TTTCTCTGA 9420 

TCATCGTAGA CACGCACTTT TGGTTATGGC AACTGGGGCG GGGAAAACTC GTACAGCAGT 9480 

TTCTCTAGTT GATATCTTAT CACGTCATAA CTGGGTAAAA AACCTTCTCT TCTTAGCCGA 9540 

TAG AACTT CC TTGGTTAAGC AAGCATATGA TTCGTTTAGA AAATTACTCC CAGATCTTTC 9600 

CCTTTGTAAC TTCTTAGAAG ATAAAGAAGG AGCTCAATCA AGTCGCATGG TCTTTTCAAC 9660 

TTATCCGACC ATGATTGGAG CGATTAGTGG TCAAGAAGAA GTAAATCAAC GCCCTTTCAC 9720 

TGTTGGGCAT TTTGACCTTA TCATAATTGA CGAATCTCAC CGTTCTATTT ATCAGAAATA 9780 

CAAGTCCATT TTTGATTATT TTGATGCAAG AATTGTAGGC TTAACAGCTA CTCCGCGTCA 9840 

AGATTTAGAT AAAAACACCT ATGGATTCTT T AATTTGG AG AATGGGGTTC CAACATATGC 9900 

ATATGATTTG GAAGAGGCTG TTAAAGACGG ATATTTAGTA GCCTATCATT CTATCGAAAC 9960 

CAAACTGAAA CTACCTACGG ATCGTCTACA TTATGATGAT TTGTCCGAAG AAGAAAAGGA 10020 
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ACATTTTGAT AGCAAATTTG AAGACAATAG CTGTGAAAAA CATATTGATG GGACTGTATT 10080 

TAATTCCTTT ATTTTCAATA AAAGTACAGT AGAAATTGTT TTAAATGAAC TCATGACAAG 10140 

AGGAATTCAG ACAGCCTCGG GTGATGAAAT TCGTAAAACT ATTATTTTTG CTAAAAATCA 10200 

TGATCATGCG GAATATATCA CAGGTATTTT TAACAACCGC TATCCTGAAA AAGGGAGCGA 10260 

CTATGCTCAG CTCATTGATT ATAGTATTAA CCATTATCAG ACCTTGATTG ATGATTTTAA 10320 

AATTAAGGAG AAGTATCCTC AAATTGCGAT TTCTGTCGAT ATGTTAGATA CAGCTATTGA 103 BO 

TCTACCAGAG GTTCTTAATT TAGTCTTCTT CAAGAAAGTA CGCTCTAAAA CTAAGTTTTG 104 40 

GCAGATGATT GGTCGAGGAA CCCGTCTATC TAAAGATTTA TTTGGACCTG AGCAGGATAA 10S00 

GCAAAACTTC TTGGTATTTG ATTATGGGGA CAATTTTGAT TATTTTCGTG CAGATCCAAG 10560 

AGATGGAGAG GGTCGTCACA TTGTTTCGCT GACTCAGCCT TTATTTAATA TCAAAGTGGA 10620 

CTTGATTCGA GAACTTCACG GACTCCAATA CCAAGAAGAT CACTTTGCGA CAGCATACCC 10680 

TCAGCAGCTT GTCTCGGAAC TTCAAGGTCG TATAGAGAGC TTAAATGAGT TCGACTTCAG 10740 

CGTTCGTATG GTTTTAGATA CAGTTTATAC CTATAGGAAA TTGGAAAGTT CGCAGAATCT 10800 

AACTGCTGTT ACAAGTCAAA CCATTCAAAA AAATCTCTCT C CGCTTTT AT TTCATCAAGA 10860 

TAAAGAAGAT GAGATGGCGA GGAGATTTGA TTTGTGCTTC CTTCATATTC AGTTGGGGCA 10920 

ACTGACAGCT AAATCTTCCA CTCTTCATAT TTCCCAACTG ATGAACACGG CTAGACCTCT 10980 

TTCTCCTATT GGCAATATCC CGCACCTTTT TCAGCAGCCT GAAATTATCA GGAAAGTACA 11040 

GGAGCCTGAA TTTTGGAAAG AAGTTAACTT GTCTGATTTC GAAAAAATTC CTCTTGCTAT 11100 

TCGACATTTA TTACAGTTTT TGGATAAAAC AGACCGTAAA CCCTACTATG TTAACTTTGA 11160 

AGATCGTATA CTCTCCACTG TTCACGAGAC CACAGCATTT TTGCAGGTCA ACGATCTTCG 11220 

GTCTTACAAT GAAAAAGTTG AGCATTATTT GAAAACTCAT CTGGATGAGG AGTCCATTTC 11280 

TAAGCTATAC CATAATAAAA AGTTGACATC TGATGATATG CTTCCACTTG AAAAATTGCT 11340 

TTGGGAAAAA TTAGCTAGTA AACCAGACTA CCAAAGTCAT TATGAAAATA AGGCAATTCC 11400 

GAGATTGGTT CGTCAGATTA TTGGCTTAGA TAGAGAGTCT GCCAATCGTA TTTTTTCTAA 11460 

ATTTTTGTCG GATGAGAATC TTAATGCCAG GCAGATTTCA TTTGTAAAAT TGATTGTAGA 11520 

CTACATTCTA GAAAATGGTT TTTTACAGAC CAAAGTGTTA ACGCAACAGC CCTTTAAATC 11580 

TTATGGTTCT GTTCAACTAC TCTTCC AAC A CCAACTACCA GTACTTCGTA ATATTGTTCA 11640 

AATCATTGAA CTTATCAATA ATCGAGCTGG ACAAGCGGCT TAAATTCTAA AGTCATTGCC 11700 

ATGCTGAGAC TCATTTAAAA TTAAAAAGAC TAGAAATTTA TGCTATATAT GAGAAGTTTT 11760 
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CGCACAACTA 
GTATGTCACA 
TCTTTATGCC 
AGGGCTTGAA 
ATCTACTCTG 
AACTTGGTCA 
TAGAGTTGAG 
AGCAAGCGAA 
GGATAGAGAA 
TCCCTTCCGT 
AGAGCGTTTT 
CTTGGATACT 
GAGAGAGCGT 
TAGACTAGAC 
TGTTTGGGAT 
TTCTGCAGTA 
GAAAAATCTG 
TATTGATATC 
TAGTAAACAT 
ACATGTCTTG 
CAGTTTTCCT 
CAATAGTAAG 
AAGGCAAACC 
CATTCCTCTA 
GGCTAGCGGA 
GATGCAAGAG 
AGAAGACAAA 
TGAGCAACCA 
CGTCGCCACT 
AAGAAAGCAT 



GAT AC AAAAA CACTCTATAG 
GCAGCTAAAG AATACGGCTA 
GCTTTCGACT TTCTAGAGAT 
ATGACACTGT TTGTAGATGA 
GGCTATCAGC AGTTGATGAA 
GTCCTGTCCC AGTACCTGGA 
TCGTTAGAAC TAGGCTGTGA 
TTTCATCATC CTATCTTACC 
GTTCTTCAAG TTTTAACACC 
TCGAGACAAG ATGTCTTTAT 
CCGCAACCTT TGGACAATTT 
AGTCTGAAAC TCCCTCGTTT 
GCTGAACTGG GCCTTGTTCA 
CAAGAATTGT CTGTT ATTC A 
TTGTTGCCTT TTGGACAATC 
GGCACTTTCG TTTCTTATGC 
ATTTTTGAAC GCTTTCTTAA 
CCAGATATTT ATCGTCCACA 
GCGGCACAAA TCGTTACTTT 
AAACGCTTTG GTGTGCCAGA 
GACAATCTTA AGTCGGCCTA 
TTAGAATACC AAAAAGCTTT 
TCTGTCCATG CGGCTGGTGT 
AAGTATCGTG ATGAAATTCC 
CTTTTGAAGA TGGACTTTCT 
TTGCTTGCTC AAACAGAAGG 
GAAACGTTAG CTTTATTTGC 
GGTGCCATTC GTCTGCTTAA 
ACTTCTCTAA ATCGACCGGG 
GGGCAGGAAG AAGTGACTCT 
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CACTCATTTG 
TACAAAAAAA 
TCAGGGAGTG 
GCTTTCGACA 
GGATATCGCG 
TTACTATATA 
TCTTTATCGG 
GATTAAAGAA 
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GGAGCCAAGC 
TCTGCAATTA 
CTCC AGTTTC 
TGCAAGATAG 
GACCAAGATT 
TATGATGCTC 
AATTTGACCT 
AAAATTGAAG 
AC AAAAGGT A 
CCAGTCTGTT 
TATATCAATA 
GTACTGGAGG 



CGATCGAAAA 
ATATTGACAA 
ATCCTTTGCT 
TTTTAGCTCT 
AGGGGGAGAA 
CTTATTTTGA 
CAGAAACACT 
TTGAAAGCAG 
TCAGAGAAGT 
AACTATTCCA 
TTTCTTACGA 
TAGAGGACTT 
AATATCAAGA 
ATTTCTTGGT 
TGGGAAGGGG 
ACCCAGTAGA 
CTGATATTGA 
ATAAATATGG 
AAGCTCTTCG 
CTAAGAAAAT 
GTCACCAAAT 
AGGGCTATCC 
TAACCAACTA 
ATGGAGTTGA 
TTGTCCAGAA 
AAATCGATTT 
TCTTTCAATT 
TTGAAGATGT 
ATTTT GTGGC 
ATATTTTGGC 
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TCTTCCAGGT ACAATCCATA ACGATATCGT TGGTACTGAC 
AGCGGTTACT ACTGCCATGG ACGCTATCGA TAAGATTCGT 
TCGTACTTTT GTAATCG AAG TTATGGGACG TAACGCTGCT 
TATTGCAACT GGTGCTGATG AAATCATCAT CCCTGAAGCA 
CGTAGCAAGC ATCAAAGCTG CTTATGAATC TGGTAAAAAA 
TGAAGGTGTG ATGTCAGCGC CTGAATTTGG TCAAAAACTT 
CGACCTTCGT GTAACAGAAC TTGGACATAT TCAACGTGGT 
CCGTCTTTTG GCGTCACGTA TGGGTGCACA TGCTGTTAAA 
TGCTGTTGCG GTTGGTATTC GTAACGAAAA AATGCTTGAA 
AGAAGAAGGG GCATTCTTTA CCCTTACTGC AGAACGTAAG 
TACAAA 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

|A) LENGTH: 16593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY : linear 



TTTAf \ TCG 
GATACATCAT 
GATATCGCTC 
GGCTTCAAGA 
CACAATATTA 
AAAGAAGCTG 
GGTTCTCCAA 
CTTCTTAAAG 
AATCCAATTC 
ATTGTGGTTA 



GTTTTGACAC 
CAAGTCACCG 
TTTGGGCTGG 
TGCAAG AT AT 
TCGTCTTAGC 
GAGATACAAG 
CTGCGCGTGA 
AAGGTATCGG 
TTGGTACTGC 
ACAACCCAGC 



16900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19446 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TCCTAAATAT CCTCTCTTTT TGGATTTTGT TTCTTAATCT CTTTCG CAAG 
ATAGAAATAG GACCACACAT ATAGACGGTT GCATCTTCCG GCACTTCTTT 
TTAAGATACC CGTCTTTCCT ACTGTCGATT AGATGGAGTT CAAAATTAGG 
GCATAGTTAC GGAGTAAATC TAGCTAGACT CCATTTTCAT CTCCACGGAA 
AAGTGAACCT GTTTATCTAA AATAGGATGT TCACGGATCT AAGAGATGAA 
CCAATACCTC CAGCAATCCA AACCTGATTT TCTCCTCCTT CTTCTATGAT 
TAAGCTCTGT CTAGGCTTAC TTTGCTGCCG GCTTGAAGAT TATCATAGAT 
TGGTCCCCTG AAGTTTTAAC AGTAAACTAA AGAGTTTGAC CATGACCTCC 
AAGGGATGCG GAGCACTTTC AAAGCCTTCT TGGAAAATCT TTAGAAAGGC 
GATTGATACT TGAAAGGTCT GCTAAGATGG ATTTGAATTT CTCTAGTATC 
CCTTTGAGAT GGGTAATTTT CCCTAGATAG GGGAAGGAAA TCTTTTGATA 
ATATAAAAAC CAGCTAGTAA GCCTAAAAGG GCATAGCTAC CAACAAGAAA 
TTAAATGTAA GGAGACGATT GCCCATTATC ATGTAGATCT GAAAGAGTCC 



TGCCTTCATC 

TTGTTCAAAA 

ATTTTTCTGA 

GCTATAGTAC 

GGGCGTGATC 

CATGTGTCCG 

ATTCTTGGTA 

TGAGATAGAA 

AAATTGTCCT 

GTGATTTAAG 

TACAAAAATG 

ACTTAGAAGA 

TAAAATATAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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GCTAGGTAAA 


CCAGGCGGTG 


AATCCATCGC 


CAAGCTTCGT 


ATTGGATGTA 


TTTCCCTAAA 


840 


TACGCGACAA 


GGATGATGCT 


GGCAAAGATA 


TAGATGCCAA 


GATTGCCAAA 


CTGAGCAGCT 


900 


AAGCGACAGC 


CCCACAAACC 


GCCCATACTA 


AAGTTATGAA 


AG ATT AGT AG 


GATGATTGAG 


960 


AGAAAGGCTG 


TGAATTTGTG 


GACGGTGTAG 


ACCTTCTCCA 


AACTGTGAAA 


CCACCTTTCT 


1020 


AG7AGTGGGA 


CACGAGTGCC 


TAGGATAAAA 


GTCACAGATA 


GCCTTGTTAA 


AGCTACTCCT 


1080 


GGAATCATGA 


ATTGGGGAGA 


AGTCTTCATC 


CAAGTCAAAA 


GAGTCAAGAT 


AAAACTAGCT 


1140 


ATGATAAAGA 


GTAGTCCTTT 


GACTGATTTC 


ATAGAAAATT 


CCATTTCATT 


TAGATTTCGA 


1200 


TTTGTTGTAA 


ATAAATTTGT 


TACATTTTAT 


CATAGAAAAT 


GTATGGTGTC 


AAATTGAGGT 


1260 


CTATAAATAT 


CTACTCTCAT 


CAAAAAACTC 


TCCAATTGAA 


CTGGAGAGTG 


GCTGTTTATA 


1320 


CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


GCTAGCCGCA 


ACTTGCTCAA 


AACACTGTTT 


1380 


TGAGGTTGCA 


GATAGAGCTG 


ACGTGGTTTG 


AAGAGATTTT 


CGAAGAGTGT 


TATTCTGCAG 


1440 


CTTGTTGCCA 


ACGTTTGGCT 


AGCATATGAG 


ACAGGCTAGA 


AATTGCTAGG 


TTAAAGCTGA 


1500 


AGTACATGAG 


GGCAATCAGG 


ATGTAAAGAC 


TGAAGACCTG 


CTCTGGTTCG 


AAATAACGGC 


1S60 


CCATGAGAAT 


TTGGCTGGCT 


CCAAAGAGTT 


CTTGTAGGGC 


GATAACAGAG 


TAGAGGAGAC 


1620 


TGGTATCCTT 


AATCACGGTA 


ACAAACTGAG 


AAATGATGGC 


TGGTAGCATT 


TTGCGGATGG 


1680 


CTTGTGGGAG 


AATGATGTAG 


TAGAGGATTT 


GCGCTGAGGT 


GAAGCCTTGT 


GACATTCCTG 


1740 


CTTCCTACTG 


TCCCTTGTCT 


ACGGCATTGA 


GACCGCCTCG 


AATAATCTCA 


GCCAACGCTG 


1800 


CTGATGTAAA 


GAGAGTAAAG 


GCTGTAATAC 


CTGCTGGTCT 


GGATTTCATT 


TTGAACACCA 


1860 


AAAAGATACT 


AAAAATCCAG 


AGAAGGTTGG 


GAACGTTGCG 


CACAAACTCC 


ATATAAATAC 


1920 


TGGAAATAAT 


CCGTAAGACA 


GGATTTTTGC 


CATTTCTCGT 


GACAGCTAGC 


ACCGTACCGA 


1980 


TGATAGTAGA 


GAGGATGATG 


GCAATCAGAG 


AAATATAGAG 


GGTCAAGCCA 


AATCCTTTAA 


2040 


AGATAAAGAC 


TAGGTTATCT 


rrrrtT & & & a 

(aiuu 1 1 JyJ\J\J\ 




AGA.TTCCATJV 


GTAACCTCCT 


2100 


AAAGTCAATA 


GGCTTTTTTG 


TTGGCTTGCT 


CCATCTTGCG 


ACCAAACTGG 


GCAACAGGGA 


2160 


ACCATAGAGC 


AAAGTAGAGA 


AGAGCAGCAC 


CTAAAAAGGC 


TGGTATATAG 


TTTCCGTTGA 


2220 


GAGCCGACCA 


AGACTTAGTC 


ACAAACATCA 


ACTCTACTCC 


AGAGATGATA 


GCTACAGTAG 


2280 


AGCTCTTCTT 


GATGAGGTTA 


ACAATTTGGT 


TGGTC AATGG 


AGGGAGAATG 


ATGCGGAAGG 


2340 


CCTGAGGCAA 


GATAATCAAG 


CGCATGGCAC 


TGATATAGGT 


AAAACCTTGC 


GACAAGGCGG 


2400 


CCTCCATCTG 


ACCACTAGGA 


ATAGACTGAA 


TCCCTGAACG 


AATAACCTCA 


GCGATATAAG 


2460 


CCCCGTGATA 


GAGTCCCACG 


CAGAGAACGG 


CTGTCCAATA 


AATTGGAATC 


ATCATGATAT 


2520 
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GCTCACTGAT 


AAGAGGTAGG 


CCATAAAAAA 


CAATAACAAA 


CTGCACCAAG 


AGGGGAGTAT 


2SB0 


TTTGGTAAAA 


TTCAACAAAG 


ATGCGAGCTA 


AAATGCGTAA 


AATTGGACGT 


TTACTGGTTG 


2640 


ACATGGCACC 


AAAGAAGATG 


CCCAAAACCA 


TAGCGAGGAT 


AAAGGAACCA 


ACCGCTAGGG 


2700 


CAAGGGTCAA 


GAGGAAACCA 


TTGAAAAATT 


GTCCAAAATC 


CTGAAAATAG 


GCTCTCCAAG 


2760 


ATGATAAATC 


TGTCATGGGG 


TGTCCTCCTT 


AATCTGCAGT 


ATGGCTAGAT 


GGTTTGAGCT 


2820 


TGTAACGGTC 


ATAAAGTTTC 


TGCAAACTAC 


CATCCTTGCT 


CCATTTAGTA 


ACCAACTTAT 


2880 


CAAGATAGTC 


GTTGAGCTCT 


GTATTTGATT 


TCTTGGTAAC 


AATACCGTAG 


TCAGATGGCT 


2940 


TGAAACTATC 


AT CT ACT ACT 


GCTGTCCGTT 


TACTAGTGTA 


GCCAGATAGA 


ATAGAGCGGT 


3000 


CAACGGAAAA 


GGTATCGATA 


CGATGAGCGT 


GCAGGGAAGT 


AATCAATTCT 


GGGTAGGAAC 


3060 


CAACTTCGAC 


GAATTTAAAC 


TTCAGACCTT 


TCTTTTTACC 


CAGTTCACTA 


ATCAGGCGTT 


3120 


GGGTGATAGA 


ACCTTGGGCG 


ACTCCGATGC 


TTTTGCCGTT 


TAGGTCCTCA 


ATCTTTTTGA 


3180 


TTTTGGCAGA 


TTTATTGACC 


AAAAATCCAG 


AAGCGTCTCT 


GTAGTAGGGA 


CTGGTAAAGT 


3240 


TGTAGAGTTT 


TTTGCGTTCG 


TCCGTGATGG 


TAAACCTCCC 


CATATCCATA 


TCGACCTGTT 


3300 


CATTGTCTAG 


AAGGGGGCCG 


CGGGTTTGTG 


CTGTAACCGC 


CACATAGCGA 


ATCTTCACCT 


3360 


TGAGTTCATC 


AGCTACCATC 


TTGGCCAAGT 


CGGTTTCGAT 


ACCACAATAA 


GTACCGGTCT 


3420 


TGGGATCTTT 


GTAACCAAAA 


TTCGGAACGT 


CTTCTTTGAC 


ACCGACAACC 


ACTTCGCCTC 


3480 


TTTTTTGAAT 


GTCTCCGATA 


CTTGTATCAG 


CCTCGACTGG 


TTTGGCAGCA 


GCAAGGCCGA 


3540 


AAAGGCTAAT 


CAATAATGCT 


GATAAAAAGA 


ATTTTTTTTC 


ATACGCGCCT 


CCTTATTTGA 


3600 


CTTTCTCACT 


TTCGTGGTTG 


ATAATTTTGC 


TGAGGAATTG 


TTGGGCACGA 


GGTTCCCTTG 


3660 


GATTGTCAAA 


AAAGTTATCG 


ACATCTGTCG 


TATCTACTAA 


AACTTCTCCG 


TCGGCCATAA 


3720 


AGATAATGCG 


GTCCGCAACC 


TCTCGAGCAA 


ACCCCATTTC 


CTGGGTAACC 


ATGATCATCT 


3780 


TCATCCCATC 


ATGCGCCACT 


TTCTGCATAA 


CTGCTAGAAC 


ATCTCCCATA 


GTCTCAGGAT 


3S40 


CAAGAGCAGA 


TGTTGGTTCA 


TCAAAGAGGA 


GGAGTTCCGG 


ATGCATAGCA 


AGACCACGAG 


3900 


CGATGGCGAT 


CCCCTGTTTT 


TGTCCACCAG 


ATAGCATGGC 


GGGATAGGAA 


TCTTTCTTGT 


3960 


CCCACATATT 


TACAAATTCC 


AGATATTTTT 


GGCCGGTTTT 


TTCAGCTTCT 


TTTTTATCAA 


4020 


TTCCTAGAAC 


TTCAATGGCT 


CCAACCCTTA 


CGTTTTCTAA 


C AC AGCTTTG 


TGTGGATAAA 


4080 


GCTTAAAATG 


TTGAAAAACC 


ATGCCGACTT 


CCTTGCGAAG 


AGGTACCAAA 


TCTTTCTGGC 


4140 


TGGCACCAGC 


AACTTGGTGC 


CCATTGACTA 


GGAGACTTCC 


TTTGTCAACA 


GTCTCTAAAC 


4200 


CATTGATCGT 


ACGGATAAGA 


GTGGACTTCC 


CAGAGCCAGA 


AGGTCCAAGC 


AGGACAACAA 


4260 


CTTGTCCTTT 


TTCAAAACGG 


AGATTGATGT 


TGCGGAATGC 


GTGGTAGTCT 


CCGTAATATT 


4320 
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TTTCCACCTT TTTAAATTCT ACTAAAGCCA TGAGAGATCT CTATTGTGTT ATATTTTATA 4360 

ACACCGTTCT ACAATAAAAG AATGTTCTTG TCAAATCATA TCTGAAAAAA TTCACTATAC 4440 

TGAAATAAGA ACAGGAAAAA TCGATCCCCA CAGTCAAATC GATTTCTAAC AATATTTTAG 4500 

AACTACACCT GTACTATTCT AGTTTCAATA TACTATAAAA TGTTATAAAA AAGCAATCTC 4560 

GATAGAGAAA ACGTCTAAAT CATGTTATAA TGAACCAATA GAATTCTTAG AAAGACTGGA 4 620 

TGTCTTTTTG ATAACACCTA CTTATGAATG GCAGTTTGCC CTGCAGCTAG AAGATGCGGA 4 680 

TTTTACAAAG ATAGCCAAGA AGGCTGCACT GGGTCCTGAG CTGGCTCCGT TATTGTTTGA 4740 

GAGAGGGATT CAGAACCAAG AAAGTCTGAA GAAGTTTTTA CAACCTTCCT TGCAGGACTT 4800 

ACATGATGCT TATCTGCTCC ATGATATGGA CAAGGCACTG GAGCGGATTC GTCAGGCTAT 4860 

TGAAGAACGG GAAAATATTC TTGTTTATGG AG ACT ATGAT GCGGATGCCA TGACTTCGGC 4 920 

TTCTATTGTG AAGGAAACTT TGGAACAACT TGCTGCTGAG TGCCGAGTTT ACCTGCCAAA 4980 

TCGTTTTACC „ GATGGCT ATG GCCCTAATGC TAGTGTTTAT AAATACTTTA TCCAGCAAGA 5040 

AGGGATTTCC TTGATTGTGA CGGTCGACAA TGGGGTTGCT GGTCATGAGG CTATTGCATT 5100 

GGCTCAGTCT ATGGGAGTAG ATCTCATTGT GACAGACCAT CATTCCATGC CTGAAACCCT 5160 

GCCACATGCT TATCCTATTC TCCATCCTGA AC ATCC AG A T GCGGATTATC CTTTTAAATA 52 20 

TTTGGCTGGT TGTGGAGTTC CTTTCAAGTT GCCTTCTGCC CTGTTAGAAG AAGTGCAAGT 5280 

GGAATTGCTT GATTTGCTCG CTATTGGAAC TATTGCAGAT ATGCTGACTC TGACCGATGA 5340 

AAATCGTATC TTAGTTCAAT ATGCTCTGGA AATGTTG GGT CATACCCAGC GCATTGGTCT 5400 

GCAAGAAATG CTGGACATGG CTGGGATTGC TGCCAACGAA CTAACAGAAG AAACGGTTGG 5460 

TTTCCAGATT GCTC CTCGTT TGAATGCCTT GGGTCGCTTG G ATGAT C CCA ATCCTGCCAT 5520 

TGATTTGTTG ACTGGATTTG ATGATGAGGA AGCGCATGAG ATTGCCCTTA TGATTCACCA 5580 

GAAAAACGAA GAGCGCAAGG AAATCCTTCA GTCTATCTAT GAAGAAGCCA AGACCATCGT 5640 

GGATCCTGAG AACAAGGTTC AGGTCTTGGC CAAGGAAGGC TGGAATCCTG GGGTTCTAGG 5700 

AATCGTGGCT GGTCCTTTAT TGGAAGAATT GGGACAGACA GTCATTGTTC TTAATATAGA 5760 

AGACGCTCGT GCCAACGGCA GTGCTCGTAG TGTGGAAGCG GTCGATATTT TTGAAGCTCT 5820 

GCATCCCCAT CGAGACCTCT TCATCGCCTT TGGAGGTCAT GCAGGTGCAG CGGGTATGAC 5880 

G CTGG AAGTT GAG C AACTCT CAGATTTATC TCAGGTTTTC GAAGATTATG TTCGTGAAAA 5940 

AGGTGCAGAT GCTGGTGGCA AG AAT AAGTT AAACCTAGAT GAAGAGTTGG ATTTGGAGGC 6000 

ACTTAGCTTG GAAACGGTCA AAAGTTTTGA ACGTTTAGCT CCTTTTGGAA TGGATAATCA 6060 
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GAAACCTATT TTTTATATCA AGAATTTTCA GCTCGAAAGT GCTCCTACTA TGGGCGCAGC 6X20 

t 

TAATGCCCAT CTAAAGCTGA AAATTTCCAA GGGTGAGGCG AGTTTTGAAG TGGTAGCCTT 6180 
TGGTCAAGCC AGATGGGCGA CAGAGTTTTC TCAAACCAAG AATCTAGAGT TAGCGGTTAA 6240 
ATTGTCTGTC AACCAATGCA ATGGCCAAAC TGCCCTCCAG TTGATGATGG TGGATGCGCG 6300 
AGTGGAAGGT CTTCAACTTT TTAACATTCG TGGAAAAAAT GCAGTCTTGC CAGAAGGTGT 6360 
TCCAGTCTTG CATTTTCCTG GAGAACTGCC AAATCTTGCG GCTAGTGAAG CTCTTGTCGT 6420 
AAAAAACATT CCAGAGGATA TTACTCAGCT GAAGACCATT TTTCAGGAAC AGCATTTCTC 6480 
TGCTGTCTAT TTCAAAAATG ATATTCACAA GGCTTATTAT CTGACAGGTT ATCGGACTAC 6540 
AGATCACTTT GCCAAATTGT ACAAGACTAT TTACCAGTTC CCAGACTTTC ATATTCGCTA 6600 
CAAGCTGAAA GATTTGGCTG CATATCTTAA TATTCAACAA ATCTTGCTGG TCAAGATGAT 6660 
TCAACTATTT GAAGAACTAG GCTTTGTGAC GATAAAAGAT GGTGTGATGA CACTCAATAA 6720 

AGAGGCGCCA AAGCGGGAGA TAGGAGAAAG TCAAATTTAC CAAAATCTCA AACAAACCGT 6780 

TAAAGACCAA GAAATGATCG CGCTGGGTAC GGTCCAACAA ATTTATGATT TTTTGATGGA 6840 

AAAAGAGTAG AACTTAGGAA AGACTTGGGA AATCAACTCT TTTTTGAAAA CAGACCTTCA 6900 

TTTTGAAAAT CATCAAAAAA ATCGTATAAT GGTAGGAAAA CATTCGGCTG AAAGTATCAG 6960 

AACTTTTAGA ATAAGAGGCT AGAATTGCCC TATAATCAAG AT AAA CT AAG ATTTTCGAGC 7020 

AAAAATGAGT AATATCAGTT TAACAACACT TGGTGGTGTG CGTGAGAATG GAAAAAATAT 7080 

GTACATTGCT GAAATTGGAG AGTCCATTTT TGTTTTGAAT GTAGGGTTAA AATATCCTGA 7X40 

AAATGAACAA TTAGGGGTCG ATCTCGTGAT TCCAAACATG GATTACCTTT TTCAAAATAC 7200 

CGACCGTATT GCTGGGGTTT TCTTGACCCA CGGGCATGCG GATCCCATTG GTCCTCTACC 7260 

CTATCTCTTG GCAGAGGCTA AAGTTCCTGT ATTTGGGTCT GACTTCACCA TTCAGTTGGC 7320 

AAAGCTCTTT GTCAAAGGAA ATGATCCCGT TAAGAAATTT AATGATTTCC ATGTCATTGA 7380 

TGAGAATACG GAGATTGATT TTGGTGGGAC ACTCCTTTCC TTCTTCCCTA CGACTTACTC 7440 

CCTTCCAGAG AGTCTGGGAA TTGTCTTGAA GACATCGGAA GGAAGCATCG TTTATACAGG 7500 

TGACTTCAAA TTTGACCAAA CGGCTAGTGA ATCTTATGCA ACTGATTTTG CTCGTTTGGC 7 560 

AG AG ATT GGT CGTGACGGCG TCCTGGCTCT CCTCAGTGAT TCGGCCAxTG CAGACAGCAA 7620 

TATTCAGGTG GCTAGTCAAA GTGAAGTTAG GGATGAAATT ACCCAAACTA TTCCTGACTG 7680 

GGAAGGTCGT ATCATCGTTG CAGCTCTTTC CAGTAATCTT TCTCGTATTC AGCAGATTTT 7740 

TGACGCTGCG GATAAAACAG GTCGACGTAT CGTCTTCACA GGATTTGATA TTGAAAATAT 7800 k 

CGTCCGCACA GCGATTCGTC TTAAGAAGTT GTCTTTAGCC AACGAAATTC TTTTGATTAA 7860 



+ 
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GCCTAAAGAT 
CCGTCAGCCT 
CAAGGATGGG 
TCCGCCTGTG 
TTTACATGTA 
ACCTAAGTAC 
TGCCATCGCA 
GGCTTACGAG 
TCATGGGAAT 
AGAGGATGGA 
TAGCCCTCCT 
TGAAAGTTCA 
CTCGGCACAT 
CAACCGTCGC 
AG AG AG AAAG 
GTGATGAAAA 
TACCCTGATG 
TTCCACGGGA 
CTTCGAGGAA 
ACCCACTA7S 
CGCTTCTTCC 
CGAGGCTACG 
TTTTCAGGTG 
GCCTACTGGA 
GAAAGTCTGG 
GATTTCTTCT 
GTGACCTATA 
GTTTTTTTAA 
CTTCAGCATA 



ATCTCTCGCT 
ATCAATGGAC 
GACCTAGTCT 
GAAAATATGA 



TC^GGGCACG 
CTCTTCCCTG 



GTTGGGATGT 
AATGGAGACT 
GCCATTGGTG 
ATTTTCATCG 
GTTCACACGC 
GAATTGATTA 
CTCAAAGCTA 
CCAGCCATTT 
TCGAGTTTCG 
TCGAGTATTA 
CCAATCCAGT 
TCT CTGG AAA 
CTAATCTCAT 
GTTTTCACTA 
CTAATATGAC 
GCTGCTTCAA 
CCCTCAGCTT 
GAGGTGTTTT 
CTAAAAAATC 
ACGAAGCCAA 
GCCATAGCGC 
CAACCCTACC 
GGGGGAGTAG 



TTGAAGACCA 
TTCGTAAGAT 
ATATTGCTAC 
TTTATCAGGC 
GAAATGTGCG 
TCCAAGGGCA 
TGCCAGAACG 
TTGTTCCACC 
ATGTTGGAAA 
TGGCTATTAC 
GTGGATTTGT 
ACCAAACGGT 
AGCTTCCTCA 
TACCAGTAGT 
GCTTTTTCTT 
CTCACAACTA 
GGAAGAACCA 
TCATAATAGT 
CGTTGTTATG 
CTACACGGCT 
GAGCAAGCGT 
ACTGGCTCTT 
TCAAAACTTT 
TGGACAGATT 
GGATAAAAAG 
TAATCTCGCA 
TGGAACTCAC 
AATTGATTTC 
AACTAAAATA 



471 
TGAGTTGATT 
GTCGATTGGT 
GGCTCCGTCT 
AGGTGGGGTT 
TGATTTGCAG 
GTATCGTGAG 
CATCTTCATT 
TGGATCGGTT 
TGTTCTTCTT 
AGTCAACCGT 
TTATCTCAAG 
ACAAGAGTAT 
CAATCTGACC 
CATGGAAGCA 
ATACAAAAAT 
TTGGATATCG 
CACTGTGAAG 
TCCCTTAACC 
CCCAATACCA 
CTAGCACAGG 
GAAAAGACCT 
ACGACAAATC 
TCTCCTCAAA 
AGAGACTGGA 
ACCAAACTTT 
GTGAAAAATC 
GAGTGGTACT 
AAATTAGAAG 
AAATATGTTT 



ATTCTTGAGA 
CGCCATCGTT 
ATTGCTAAAG 
CTCAAATTGA 
CTGATGATCA 
TTGCATGCTC 
CCTAAAAAGG 
TCAGCAGGAG 
CGTGACCGTA 
CGTGAGAACA 
AAGAGTCGCC 
CTTCAAGGAG 
AACTACCTCT 
AAATAATCGT 
AGAAGGAGAA 
ACTGGGGCGT 
ATATTCCCGT 
GGACCAATGT 
GCAATGGTTG 
AATTCCCACA 
TT AT CGCTGG 
GTTTTTCTCA 
GTCAAAATCT 
CAACTAGTCC 
GGGCCTGGTG 
TCAAAAAACT 
ACTGGGAAAA 
AGAGACTGAC 
TCACTAGACT 



CAGGTCGTAT 
ATGTAGAAAT 
AAGCCTTTGT 
TTACCCAAAG 
ATCTTTTGCA 
ACGCTAAGGC 
GGACGACCAT 
ATATCTTGAT 
AGGTCTTGTC 
AAATTGTGGC 
ATATTCTCCG 
ATGACTTTGA 
TTGATCAAAC 
TGAAATAAAC 
AATCATGCCA 
GAATGTCCTC 
CTTGTACCTT 
AGAACGCTTG 
GTACACCGAT 
GGTTCTGAAA 
TCTTTCTATC 
TGCAGCTAGT 
GGCAACTCCA 
CTATTCTCTT 
TGGCGAACAG 
AGGTTTTGAT 
ACAATTGGAA 
TTAGTTTGAA 
TTTCAAACGm 



7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8S20 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 
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AAGTACTAGA 


ATAGTAATAA 


AATACTGGAG 


472 
GAAAGAGAGT 


AGGAAATGTA 


CCGTTATCAA 


9660 


ATTGGCATTC 


CCACATTAGA 


ATATGATCAG 


TTTGTCAAAG 


AACATGAATT 


AGCCAATGTA 


9720 


TTACAAACTA 


GTGCTTGGGA 


GGAAGTTAAG 


TCTAATTGGC 


AACATGAGAA 


GTTTGGTGTT 


9760 


TACACGGAAG 


AAAAATTACT 


GGCGACAGCT 


AGTATTTTGA 


TTAGAACTCT 


TCCGCTAGGC 


9840 


TATAAAATGT 


TTTACATCCC 


AAGAGGACCT 


ATATTCGATT 


ATGGGGATAA 


AGAACTCTTG 


9900 


AATTTTGCCA 


TTCAGTCTAT 


TAAGTCCTAT 


GCTCGCAGTA 


AGAGAGCGGT 


TTTTGTGACT 


9960 


TTTGACCCAA 


GTATTTGCCT 


ATCTCAAAGT 


TTAATCAATC 


AGGAAAAGAC 


AGAATTTCCT 


10020 


GAAAATCTGG 


CTATTATTGA 


TAGTTTGCAA 


CAAATGGGAG 


TAAGGTGGTC 


AGGAAAAACG 


10080 


GAGCAAATGG 


G AGACAC CAT 


TCAACCTCGT 


ATTCACCCGA 


AAATATACAA 


CCAAAATTTT 


10140 


RAACAAGATA 


AACTTTCCAA 


GTCAACAAAA 


CAGGCTATTC 


GAACAGCACG 


AAACAAAGGG 


10200 


LI 1 Wnw i tV> 


AATATGGTGG 


ACTGGAACTA 


TTAGATTCAT 


TTTCCCAGTT 


GATGAAAAAA 


10260 




RAAAACACAT 


TCATTTGAGG 


AATGAAGCCT 


ATTATAAAAA 


ATTGTTAGAT 


10320 


AATTTTAAGG 


ACAACCCCTA 




GCCACCTTGG 


ATGTTTCTAA 


ACCTTCGCAA 


10380 




AACACTTAGC 


GAAAAATAGA 


GCCTTGGAAG 


AGACCTTTAC 


TGAGTCGACT 


10440 


tUnAV. 1 1 l_/V/\ 


AARTACAACC 


GCAGAAGAAG 


GAAAAAGAAC 


GTTTGTTAGA 


GGAATTCACC 


10500 




AATATATAGA 


TGTAGGTCAA 


GCGAGAGTTC 


CTTTAGCCGC 


TACTTTGAGT 


10560 


1 IVAflwii i IV* 


GTACTACCTC 


TGTCAATATA 


TATGCTGGTA 


TCGATCATGA 


TTTTAAACGT 


10620 


TACAATCCAC 

A S\V»*V% X W*»*"»V» 


CAATTTTAAC 


TTGGTATGAA 


ACGGCTCCCT 


ATGCCTTTGA 


ACGAGGTATG 


10680 




ATTTAGGTCG 


TGTTGAAAAC 


TCTCTCAATG 


GTCGACTTTA 


TCATTTTAAG 


10740 


CAAAAATTTA 


ATCCAACGAT 


TGAAGAATAC 


TTCGGTGAAT 


TTACAATGCC 


CACTCATCCT 


10800 


CTCTAT CCTC 


TGTTAAGACT 


TGCTCTTGAT 


TTCCGTAAAA 


CATTAACAAA 


AAAACATAGA 


10860 


AAGTAAGTAT 


ATGGCACTAA 


CAACACTCAC 


GAAAGAAGAG 


TTTCAGACTT 


ATTCTGATCA 


10920 


GGTTTCTT CX 


CGTTCCTTTA 


TGCAATCTCT 


CCAGATGGGC 


GATTTGCTAG 


AAAAAAGAGG 


10980 


GGCTCGAATT 


CTTTATCTTG 


CTTTGAAACA 


AuAnuununn 


ATTrAAflTTC 
nil w/v/vvj liv* 




11040 


TTATAGCCTG 


CCCATGCTGG 


GTGGTCTGCA 


TATGGAACTC 


AATTCGGGGC 


CGATTTATAC 


11100 


CCAACAAGAT 


GCTCTTCCAG 


TTTTTTATGC 


AGAGTTAAAA 


GAATATGCCA 


AGCAAAATGC 


11IS0 


TCTATTACAG 


TTGCTTGTAA 


AACCCTATGA 


AACTTATCAA 


ACTTTTGATA 


GCCAAGGTAA 


11220 


TCCAATAGAT 


GCTGAGAAAA 


AAAGTATTAT 


TCAAGATTTG 


ACTGATTTAG 


GTTATCAATT 


11280 


TGATGGCTTA 


ACAACAGCTT 


ACCCAGCTGG 


AGAACCACAT 


TGGTTATACT 


ATAAAGATTT 


11340 


AACTGAATTA 


ACTGAAAAGA 


GTTTGCTTAA 


AAGTTTTAGC 


AAAAAGGGTA 


AACCCTTGGT 


11400 
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CCATTCGGTT 


(■>kkk> ft /■» »|*/T» ft 

UAAAAAOl 1A 


>\/\/\V»-lj i UnrtU 


AALTATCGAT 


1 MCA 

1 14 OU 


TTTTAAGAAT 


»» #»» » ft & & a. ft ^ 
ATAAuAAAAC 


kit /"»/T»rT^™ * 

AAACt 1 L i\»A 


AtGTAGAOAA 




AAACTTTAOA 




ft « ft mmftm/ ■ ft 

ATATTATGAv* 


CATTTTTATG 


ATACTTTTGG 


AGAACAAGCO 


bnu 1 1 1 V— ./* 


m ft^?/*ft >/*r<w 
TAGCAAGV.il 


i icon 


AAATTTTTCG 


^ ft ft. T* m n*^* ft 

GACTATATGA 


GCAAATTGCA 


ft /■"^•V" ft ft ft ft 

AGGTGAACAA 


AbTAAAk 1 /\« 


ft ft ft ft ft ft ^HOT* 

AAGAAAAv.il 


llo4U 


GGACAAGTTG 


CGACTTGATT 


TGAGTAAAAA 


TCCTCATTCT 


IjAuAAnAnnL 


ft ft ft ft ITKf* ft ft i —11 

AAAATCAACT 


11/UQ 


GAGAGAATAT 


TCTAGTCAAT 


TTGAAACGTT 


TGAAGTTCGA 


ft ft ft /"»^ ft ^ JL ft ^ 

AAAGCAGAAG 


**+*^ **%***> ft & ^^■MVt 

CGCGAGACTT 


1 1760 


GATTGAAAAA 


TATGGAGAAG 


AAGATATTGT 


TTTAGCTGGG 


AGTTTATTTG 


TTTATATGCC 


11820 


TCAGGAAACG 


ACTTATCTCT 


TTAGTGGTTC 


CTACACTGAG 


TTTAATAAGT 


TCTATGCCCC 


11880 


TGCACTGCTT 


CAAAAATATG 


TTATGTTGGA 


AAGCATAAAA 


CGTGGAATAC 


CTAAATACAA 


11940 


CTTCCTAGGC 


ATT CAAGGG A 


TTTTTGATGG 


AAGTGATGGT 


GTTTTGCGTT 


TTAAACAGAA 


12000 


TTTTAATGGC 


TATATTCTAC 


GCAAAGCAGG 


TACTTTCCCT 


TACCATCCAT 


CGCC TTT AAA 


12060 


ATACAAACCT 


ATCCAGTTAC 


TCAAAAAAAT 


AGTAGGACGT 


TAAGATGAAA 


AAGTCACTAT 


12120 


TTAGATTTCT 


TTTAGCTTCT 


TTTAGTAAAA 


TAATTCTTAT 


TTGCTAGAAA 


GCTGGAGAGA 


12180 


CATGCGCTGG 


CTTTTTCGTT 


TGATAGGGGC 


TTTCTTTTCT 


TTTGTCTGGC 


GTTTGTTTTG 


12240 


GCCTCTGGTT 


TGGATACTTG 


TGCTCTTATG 


TGTGCTTGCT 


TTCGGACTTC 


TCTGGTATCT 


12300 


GAACGCAGAT 


TTTCAAGGAG 


CGCTAAAGCA 


AGCAGAACGG 


TCAGTAAAAA 


TTGGTCAACA 


12360 


AAGTATTGAC 


CAATGGGAGA 


AAACAGGGCA 


ACTGCCTAAG 


TTAAGCCAGA 


CAGATAGTCA 


12420 


CCAGCATTCT 


GAAGGAAGGT 


GGGCACAGGC 


CTCTGCTCCT 


ATTTACCTGG 


ATCCGCAGAT 


12480 


GGATTCACCC 


TTTCAAGAGG 


CTTATTTAGA 


AGCAATCCAG 


AACTGGAATC 


AAACTGGTGC 


12540 


TTTTAACTTT 


GAACTCGTGA 


CTGAGTCTAG 


#m ft « *-, *m ******** *^ f. gm 

TAAGGCGGAT 


ATTACGGCTA 


***>****** fk **** ft f|M lk ft 

CGGAGATGAA 


12600 


****** « *** ******* %. *—>**->***• 

CGACGGAGGC 


ACTCCTGTGG 


*** fk. ******* % ^** *K f*****^ 

CAGGACAGGC 


GGAAAGTCAA 


ACTAATCTCT 


TAACAGGGCA 


i i c £ n 


ATTCTTCTCC 


GTAACCCTGC 


GGTTGAATCA 


TTATTATTTG 


TCCAATCCAT 


ACTATCCCTA 


12720 


CTCCTATGAA 


CGCCTTGTCC 


ATACGGCAGA 


ACATGAGTTA 


GGTCATGCGA 


TTGGCTTGGA 


12780 


CCATACAGAT 


GAGAAGTCTG 


TCATGCAACC 


AGCAGGTTCC 


TTTTATGGTA 


TCCAGGAAGA 


12840 


GGATGTTGCA 


AACCTCCGAA 


AAATATATGA 


GACTAGTGAG 


TACGGTACTA 


TCTTTCCCTA 


12900 


CTTTTTTTGC 


TATAATGGAA 


CTATGAACAA 


CTTGATTAAA 


TCAAAACTAG 


ACCTCTTCCC 


12960 


GACCAGCCCT 


GCTTGCTACA 


TTCATAAGGA 


TAAAAATGGC 


ACCATTATCT 


ATGTAGGAAA 


13020 


GGCTAAAAAT 


CTGCGTAATC 


GAGTACGGTC 


CTATTTTCGT 


GGAAGTCATG 


ATACCAAGAC 


13080 


AGAGGCTCTG 


GTGTCTGAAA 


TTGTGGATTT 


TGAATTTATT 


CTTACGGAGT 


CTAATATTGA 


13140 



PCT/US97/19588 

WO 98/18931 



474 



GGCACTTCTC CTAGAAATCA ACCTGATCAA GGAAAACAAG CCCAACTACA 
CAAGGATGAC AACTCCTATC CTTTCATCAA AATCACCAAT GAG CGCTATC 
TATCACTCGT CAGGTCAAAA AGGACGGAGC TCTTTATTTT GGACCCTATC 
GGCAGCCAAT GAAATCAAGC GCTTGCTGGA TCGGATATTC CCTTTTCCTA 
CCCCCCCTCT AAGGTCTGTT TTTATTACCA TATCGCCCAG TGTATGGCCC 
TAAGAAGGAT GAGGCTTATT TCAAGTCTAT GGCCCAGGAG GTCTCTGATT 
TCAGGATG AC AAAATCATCG ATGATCTCAA CAGTAAAATG GCAGTAGCAG 
GGAGTTTGAA CGTGCGGCGG AATACCGTGA CCTGATTCAG GCTATTGGAA 
CAAGCAACGG GTCATGGCGA AAGATTTGCA AAATCGCGAT CTCTTTCGCT 
TAAGGGCTGG ATGTGTGTGC AGGTTTTCTT TGTCCGTCAG G C AAGCTCAT 
GTCAATCTCT TCCCCTACTT CAATGATCCA GATGAGGATT TTTTGACCTA 
TTCTATCAAG AAAAATCTCA TCTAGTTCCC AATGAGGTAC TGATTCCGCA 
AAGAAGCTCT CAAGGCTTTG CTGGATTCCA AGATTCTTAA GCCTCAACGT 
AACAACTGGT CAATCTAGCC ATAAAAAATG CTCCTGTTAG TCTAGAGCAG 
TGCTAGAAAA ATCTCTCGAA AAGACTCAAC GAGCTATTGA AAATCTAGGC 
AAATCCCGAC CCCAGTACCT ATCCAGTCCT TCGATAACTC TAATATCATG 
CTGTTTCGGC TATGGTGCTC TTTGTCAACG GTAAACCGAG TAAGAAGGAT 
ACAAGATAAA AACGGTTGTT GGACCAGACG ACTATCCCAG CATGAGAGAC 
GACGCTATGG TCGAGTACAG CGTGAGGCTT TGACTCCTCC AGATTTGATT 
GGGGCCAAGG TCAAGTCAAT ATCGCTAAGC AGGTTATCCA AGAGCAACTG 
TTCCAATTGC TGGGCTGCAA AAGAATGATA ACCACCAAAC CCATCAATTG 
ATCCGCTTGA GGTGGTGGAT TTGTCTCGCA ATTCTCAGGA ATTTTTCCTC 
TCCAAGATGA GGTGCACCGC TTTCCTATCA CTTTCCACCG CCAACTCCGC 
CTTTCTCATC TCAATTGCAT GGGATTGACG GTCTGGGACC TAAACCCAAG 
TGAAGCATTT CAACTCTTTG ACCAAAATCA AGGAAGCCAG TGTGGATGAG 
TTGCGGTACC TAGAGTCCTT GCAGAGGCTG TGCAAAGAAA GTTGAACCCG 
CCTTGCCTCA AGTAGCAGAA GAAAGAGTAG ATTACCAAAC GGAAGGAAAC 
CATAAAATCG CAATTTTATC ACATGTTCAT GGCAATGCGA CGGCGCTAGA 
GCAGATGCTA AAAATCAAGG GGCCAGTGAA TATTGGCTTC TGGGACATAT 
GGTCCAGGCG CAAATCACTT AGTCGCCCTG CTAAAGGACC TTCCTATCAC 



AT ATC ATGCT 
CACGCTTGAT 
CCGATGTGGG 
AGTGTACCAA 
ACACCATCTG 
TTCTGAAAGG 
CACAAAGTAT 
CCCTTCGAAC 
ACTATGTGGA 
CGAGCGCGAT 
TGTAGGACAA 
GATATTGACG 
GGACAGAAAA 
AAGTTCAATC 
C C TTTGCTCC 
GGAACTAGCC 
TACCGTAAGT 
GTCATTCGCA 
GTGATTGATG 
GGCTTGCATA 
CTCTTTGGAG 
CTCCAACGCA 
TCCAAAAATT 
CAGAATCTTA 
ATTGTCGAAG 
CAGGGAGAAG 
CACAATGAAC 
AGCAGTGATT 
TTTTCTTCCT 
AGCAAGTGTT 



13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 
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CGAGCCAATT GGCATGATCG TGTCCTTGAG GCTTTAGATG GGCAATATGG CTTAGAAGAC 15000 
CCACAGGAAG TTCAGCTCTT G CGTATG ACA CACTATTTGA TGGAGCGAAT GGATCCTCCA 15060 
ACGATTGTCT GGCTACGAAG CTTCCCTTTG CTGGAAAAGA AAGAAATTGA CCCATTGCGC 15120 
TTTTCTATCT CTCATAATTT ACCTGACAAA AACTATGGTG CTCACTTGCT ACTTGAGAAT 1S180 
GATACAGAGA AATTTGACCA ACTGCTAGAT GCGGAAACGG ACGTGGCAGT TTATGGTCAT 15240 
CTTCACAAGC AGTTGCTTCC TTATGGAAGT CAAGGGCAAC AAATCATCAA TCCAGGCTCG 15300 
ATTGGCATGC CCTATTTTAA TTGGGAGGCC TTAAAAAATC ACCGTTCCCA GTATGCCGTG 15360 
ATAGAAGTTG AAGATGGGGA ATTACTCAAT ATCCAATTTC CTAAAGTTGC TTATGATTAC 15420 
GAAGCTGACT TAGAATTGCC CAAGTCCAAG GGGCTTCCCT TTATCGAAAT GTATGAAGAA 15480 
CTGCGTCGTG ACGATAACTA TCACGGGCAC AATCTGGAAT TATTAGCCAG CTTAATAGAA 15540 
AAGCATGGGT ATGTAGAGCA TGTGAAGAAT TTTTTTGATT TTTTGTAAGA GTTTCCTAAA 15600 

ATAGCCAATG CAAACTAAAA AAGCGATTTC CTGCTCCAAT CGCTTTTACT ATATCTTATA 15660 

CTCAATGAAA ATCAAAGAGC AAACTAGCAA CCTACCCGTA GCTTCCTCAA AGCACAGCTT 15720 

TCACGTTGCA GATAAAGCTG ACGTGGTTTG AAGAGATTTT CGAAGAGTGT TATTGTAACT 15780 

GAGATTGATC TGGCAGCTAA GAACCACCTA GATAGGTATT CCTGAGTTTT TCAAGGGTTC 1584 0 

CGTCTTCATA GAGTTCTTTG AGCGCTTTAT CAAATTGCTC TTTAAACTCT TTTTGCTCGC 15900 

TTGAGAAAAT CATATAATTG CTGGGGCTAT CTGCAGAAGG TAAATCAACG ACTCAGAGCT 15960 

CTAAACCACG GTCCTTGATA AT C TTTTCAA CGGATACCTT CTCAAAAACT ACCAAATCAA 16020 

ACTCTCCGTT AGCAAGGTCT AGGATTCCTT TACCAATATC CTCACCAGAA AAATTAATTG 16080 

TAGCGGGATT ATCAGTGTGT TTCTGATTCC AGTTATTCAT GAATTGAGCC TTAGAAGTTC 16140 

CGGTATCCTC TTGTGTTCTT TTACCAGCGA TCTGGTCAAG AGAACTCAAA GGATTTTTCT 16200 

TCTTGCTCAC AAGGACGAGG GGATTGTTGG AAATTGGAAG CGAGTAAAGG TATTTTTCAG 16260 

CACGCTCTTT TGTGTAACTC AAGTTATTGG CCGCAGCCTG ATAGTGACCA CAATCAAGTC 16320 

CTCGCAAGAT CCTCTCCCAG GCGCTTCTTT GGAATTGAAT CTCGTAGTCG CTGAGTTTTT 16380 

CATCTACTGC CTTTAAAACT TCGATATCAA AGC CTGTCAG ATTGCCCTTG TCTTCGTAGT 16440 

CAAATGGTGG CACGTCGCCA GCTGTAGCAA GCACGATTGT CTTTTGAGCG CTAGTCTCTT 16500 

TGGGTGTAGC TTGATTCTCA CAGCCAACCA AAAATGGTAG GATAGCTACT AATAGGCTAA 16560 

ATTTTTTCAT ACTGTCTCCA TTCAAATGTA AAG 16593 
(2) INFORMATION FOR SEQ ID NO: 53: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3510 base pairs 
IB) TYPE: nucleic acid 

(C) STRAND EDNES S : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



CGGATATCCT 


TATATCCTTG 


TTCCTGGAAC 


CATTGTGGGA 


ATTGCTCAAC 


ACTTTTTTCA 


60 


CCTTGAATTC 


CTGGTGCAAT 


GACAGTAAGA 


ATTTCGAAAT 


CACGATCTGG 


TTTCGCCGCT 


120 


AGTTCCATCA 


ACTCTGGCAT 


ACTTTTCTTG 


CATGGACCAC 


ACCATGAAGC 


CCAAAACTTC 


180 


AACTAAACCT 


TTTTACCCTT 


AAAATCAGAT 


AACTTAACTT 


CTTTGCCATC 


CATGGATTGC 


240 


AATGTGAACT 


CTGGAGCATC 


TTTTCCAACA 


GCAATTTGTT 


GTACAGTCGT 


TTGTTGTTTT 


300 


GGCTGTTGTG 


CTGCTTGAGT 


CTTTTTAGTT 


TCTTCCTCAC 


CACAGCCCAT 


CAATACAACT 


360 


AATGACAAGA 


GACTTAAGCC 


AGCAAACATT 


ACTTTTTTCA 


TTTGTCCTCC 


TTTATTCAAA 


420 


AATTCCAGCT 


AGAACATTTA 


CTTCTCCTAA 


TAGTAACAAA 


ATTCCCATTA 


AAACAATGAG 


460 


GAAACCACCA 


ATTTTCTTTA 


CTAGCATCAT 


ATGACGCTTC 


ATTTTACTAA 


AATATGGCAT 


540 


GACTAGACCT 


GAAGCTAGTG 


CCAATACCAA 


GAAAGGAAGG 


GCCATGCCaG 


AGTGTAAATG 


600 


AGACTATAAA 


TCGCTCCTTC 


CCAAGCGCCA 


TTGCCTCCAG 


AAGCCGCAAG 


TGCTAAAACA 


660 


GAACTTAAAA 


CTGGACCAAT 


ACAAGGTGTC 


CAACCAAAGC 


TAAAGGTAAT 


ACCAAGTAAA 


720 


AAAGCTG AC C 


AATAACGATT 


AGAATCTGAT 


TTTTTAAAGG 


TAAAACTTTT 


TTGAACTTCT 


7B0 


AATTTCTTCA 


AATGAAAAAT 


TTCCATCTGG 


TGAAGACCCA 


AAATGATAAT 


AATAGCTCCC 


840 


ATGCCATATC . GAAACCAATT 


TGCATAGAGA 


ATATGACCAA 


AGTAACCAGC 


ACCAAAGCCT 


900 


AGAATAAAGA 


AAATCAGAGA 


GATACCACCG 


ATAAAGCAAA 


CTGTTCGAAT 


CAAGCCTGAC 


960 


CAGAGAACCT 


TTCTCCCAAA 


CAAAGAAAAG 


CTTTTTGCAC 


TTTCTTGATC 


ATCCAATAAA 


1020 


ATCCCAGCAT 


ACACTGGCAG 


AAGAGGAAAA 


ATACAAGGAG 


AAAAAAAGGA 


TAAAACACCT 


1080 


GCTACAAAAA 


CAGAGATTAA 


AAATACTATC 


GTTTCCAATA 


AAGAACCAAC 


TTTCTTAATA 


1140 


ATTCTAATCC 


TATTTTACTA 


TATTCAATTT 


TATTTGTAAG 


CTTTCTGCTA 


CCCAAAATCG 


1200 


TATCGGGCAC 


TATTGGACCA 


A'fCTTTTCTT 


TTGCTAGTCA 


ACGCGGATCT 


TATCCCCCAA 


1260 


AATAGCCAAA 


AAGCAACGAC 


AAGGATTACT 


CATCGCTGCT 


TTTCTGAACG 


AAAATCTCTT 


1320 


TTACGTCTGA 


CATTTCATAA 


ATCATGTTTT 


ACTTGAGTTT 


GTCAAGGATT 


GCTTTAAGCT 


1380 


CCTCTACTAG 


TTTAGTTTCT 


GTCTCTGCTG 


AGCCATTTTC 


TTCTTTCACG 


AAATCAAGCG 


1440 


TTTCTTGGAG 


AAGGTTTTGG 


GCTTTGGCAA 


GGACTTTTTT 


ATCCGCTTTT 


TCTCCATCTA 


1500 
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rrT(7TrCTAC 


AACCTTGATC 


AATTCCGTGC 


TTAATTGCTG 


GATT-. rGAC 


TCTTTCTTAC 


1560 


/••r^V"* & ft 2k 


rr AGAAGGCA 


ATCACGCCTA 


GGAGGGCAAG 


TAGACTGACC 


ACAATCACTC 


1620 




TGAGTTTGTT 


TCAGTCATCT 


TATCTGAATC 


CTT ACT ATCT 


TCCCTTCCTT 


1660 


Vil 1 t IVilnlv. 


CTTCTTGTCC 


TGTGCAGGCT 


TGCTGTCGCT 


AGCATTTGCT 


TTCACATCTT 


1740 


TuAUAunu 1 V- 


CAAGGCAGCC 


CAGCCTTCAC 


AGACTCTACT 


GCAGTATGCA 


GACCTTACTC 


1800 


*rr*¥*r k k r~nr* ik 


CTATCTTCCG 


GAGCTTTTTG 


AGCATCTAGG 


AGG AC AGCCT 


TGGTTGCATC 


1860 


GAl 1 J ivuuft 


TrAGATACTG 


TTGCCAAAGC 


TTTCAAGCGT 


TGGTCTAACT 


CTTGACTCAA 


1920 


GGCACGAAuT 




CAACTTGCTC 


TTGAGCTTGT 


GTGCTCGTTG 


AGCTAGCCGA 


1980 


AGCGCTTt»L. 1 




GATCTTGAGT 


CGGAGCTGAG 


CTTGGAGCTG 


GGACAGGGCT 


2040 


TGCAGblluA 




TTATGGTATA 


TTGAAACTAG 


AATAGTACAT 


ATGGACTTCT 


2100 


& ft ft ft ^» ft if Ml 

AAAACATTGT 




TTTTACTGTC 


CTGATCGATT 


TGTCCTATTC 


TTATTTCATT 


2160 


TTACTATAAT 




CTGGTTAATG 


TTGGTAAGAG 


AAACTTCTGA 


AACCAAGCTT 


2220 


CAAAAAAGTL. 


firrrcTCATC 


GTCTCTTCGT 


AAGTCATTGG 


AGCGATTAAT 


TCACCATTTG 


2280 


T TAGACL I GC 


& AfTAAACAA 


ATCCTCTGAT 


ATCTTCTTCC 


AG AT ACTTTG 


CCTCTTATTA 


2340 


ACTGACCTTT 




CCATATTCTC 


GATAAAAATA 


AGTATCGAAT 


CCTGTTTCCT 


2400 


^4 k m mmvi unto ^ 

CAATCTAAAC 


Auw 1 w*» 1 nuu 


TCCTTTAAAC 


TATTAAAATT 


CTTAAGAAAT 


AAGGCTACTT 


2460 


T I TL i GWjI v- 




TAGGTGTGGT 


TCTTTTTTTC 


GAGTGTAGCC 


CATAGCTTTG 


2520 


Autumn 1 W-t 1 




TCGATGACAG 


CCAAAltTCAG 


AAGCTATTTC 


AGTCAAATAA 


2580 




TfTTCAGTAAG 


ATAGTTTTTA 


AGTCTATCTC 


TATCAACTTT 


TCTTGGTTTT 


2640 


ul i *— 1 l i i n 


CTTCGTGGTT 


TAGCTCTCCT 


GTTTTCTCTT 


TTAGCTTTAA 


CCAGCCATAA 


2700 


A IuVi inl 1 t\\m 


GTGAGATTTG 


GAAAACGTGT 


GATGCTTCTC 


TTATACTACC 


TATTCCCTCA 


2760 


CAATAAGAGA 


GAACTTTTTT 


APflAAAATCT 


ATTGAATATG 


CCATAAGAAG 


ATT AT ACC AC 


2820 


ATTGTGTACT 


ATTTTTGGTT 


CATTTCACTA 


TAACACAAAA 


TAGATTATTA 


TTACATAACA 


2880 


AAAAAGAGGT 


CTAAACCTCT 


TAACTCAATT 


ACTCCGCCAG 


TAGGACTCGA 


ACCTACGACA 


2940 


TCATGATTAA 


CAGTCATGCG 


CT ACT AC C AA 


CTGAGCTATG 


GCGGATTAAA 


GCTAAGCGAC 


3000 


TTCCCTATCT 


CACAGGGGGC 


AACCCCCAAC 


TACTTCCGGC 


CTTCTAGGGC 


TTAACTTCTC 


3060 


TGTTCGGCAT 


GGGTACAGGT 


GTATCTCCTA 


GGCT ATCGTC 


ACTTAACTCT 


GAGTAATACC 


3120 


TACTCAAAAT 


TGAATATCTA 


TTCAATTTAA 


GAAAACCGTT 


CGCTTTCATA 


, TTCTCAGTTA 


3180 


CTTTGGATAA 


GTCCTCGAGC 


TATTAGTATT 


ACTCCGCTAC 


ATGTGTCGCC 


! ACACTTCCAC 


3240 



nucfwin- >uun 0010001 ar> i 
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TTCTAACCTA TCTACCTCAT CATCTCTCAC GGCTCTTACT GATATATAAT CATGGGAAAT 



3300 



CTCATCTTGA CCTGGkTtCA CACTTAGATG CTTTCAGCGT TTATCCCTTC CCTACATAGC 



3360 



TACCCAGCCA TGCCTTTGGC AAGACAACTG GTACACCAGC GGTAAGTCCA CTCTCCTCCT 



3420 



CTCGTACTAG GAGCAGATCC TCTCAAATTT CCTACGCCCG CGACGGATAG GGACCGAACT 



3480 



GTCTCACGAC GTTCTGAACC CAGCTCGCGT 



3510 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20986 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

CGGAGAAAAA CATGGCTAAG TCAAACTTTG AAAAACTAGA ATCACTTGTT GGCTGGGTTC 60 

GTGATAAGAA AATCACAGCC TACCGTATCT CTAAAGAAAC CAATCCCCCT GAAATGTCTA 120 

TCATTGCTCT GGCGCAGGGT CGTGCAAAAG TAAAAAATAT TTCATTTGAA ACACCCCTAG 180 

CCCTAATTGA TTTCTATGAA AAAAATTATG AAAAATTTGA AGATTAATCT TTGGATAACG 24 0 

GCGGATTCTT GACCTTCAAG TAGTAGAGAT ACAGAATCTG CCTTTTCATT TTGAGGACAG 300 

CAAAAAGACT GCACGGTTCA TGCACCCTTT TCTTTTTATT TGACATAGCG TTGAAGGAAC 360 

TCTTTTGTTC GGTCTTCTTT AGGATTGCTG AAGAGGTCTT CTGGTTTACC TTCTTCAGCG 420 

ATCACGCCCT TATCCATAAA GATAACACGG TGAGAGACAT CACGGGCAAA TTCCATTTCA 480 

TGGGTTACGA CAATCATGGT CAAGCCTTCC TGAGCCAGGT CCTGCATGAT TTTCAGCACT 540 

TCTCCAACCA TTTCTCGATC GAGACCTGAT CTTGGTTCAT CAAAGAGAAT AGCGTCCGGA 600 

TTCATGGAGA GGGCACGAGC GATGGCCACA CCTTGTTTTT GACCACCTGA GACTTGTTTT 660 

GGTTTCGCTT GCCAGTAGCG TTCTCCCATG CCGACCTTTT CCACGTTTTC TTTGGCAATC 720 

TTTTCAGCTT CTGTGCGTTC GCGTTTTAGG AC AGTTGTCT GAGCGACGAT TGTGTTTTCA 780 

ACAACATTGA GATTTTCAAA GAGCTTAAAG GATTGGAAAA CCATCCCCAA CTTTTCACGG 840 

TATTGCGTGA GCTCATAGCC TTTTTCGAGG ACGTTTTGTC CATGATAAAG GATTTGTCCA 900 

TCAGTTGGTG TTTCAAGTAG GTTAATGGAG CCTAGGAAGG TCGATTTTCC GCTTCCAGAG 960 

CTTCCGATGA TACAGATGAC CTCTCCCTTG TGGACAGTGA GTGAAATGTC TTTTAGCACT 1020 

TCGTTTTGTC CATAGGATTT TTTGACGTGT TTAATTTCAA GGATTGCTTG TGTCATTATT 1080 

TCAAATCCTC CGTTTGCATT TGGTTAGCAC CTCTAGTCTA GGTATCCATG TCCATTCTGC 114 0 
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GCrCCATAAA 


GCGTAGGATA 


t-Vj lull /\WwW 


TGAAGGTGAG 


GACAAAGTAA 


ATCACGGCGA 


1200 


TGATTGTAAA 


TGTCTGGAAG 


»f* •.♦ff^* ft^ Art/*! 

T ATTt* A 1 AuO 


1 1 * V«l lull WW 


CACCCTATTT 


CCTCAGAAAT 


1260 


AAACTTCCAC 


AACAGAGATA 


ACGTTv. AA 1 A 


fAfiATftTATC 
lAwn 1 v# Ifti*- 


TTTGATATTG 


ATGACAAATT 


1320 


CATTACCAGT 


TGCAGCTAGG 


A L\* 11 ACWjA 


w * /%ww a w^ww 


TAGGACAATC 


TTACGCATGG 


1380 


TCTGGTTATG 


GGTCATACCA 


AGAGCAGTCG 


f&fifTTfAAA 
W AwW i 1 W/WV* 


TTGTCCCTTG 


TCAACTGCTA 


1440 


GGATACCACC 


ACCGACGATT 


TCAGTC ATV*T 


AwwWMWwww * 


ATTGATTGAA 


ACGATGAAGA 


1SO0 


T AGCAGC C AG 


TGTACGGTCA 


AGGTTGATCC 


LvAAAuC I * w 


GGCAGTTCCA 

WWWrVW * A WW#» 


TAGTAGATAA 


1S60 


CCATCGATTG 


AACAATCATT 


GGCGTACCAC 


(tuAnnn 1 i 


AATCTACACA 


TTGAGAACCC 


1620 


AGCCGACTAG 


TTTTTGTAGC 


CCGTAAATGA 


CTTTGTTTTC 


nviAvin V* nwUn 


GCAGTACGGA 


1680 


AGACACCAAT 


GGCAAGTCCA 


ATAATGAGAC 


CTATuATWil 




CAGATTAAAA 


1740 


GAGTGATACC 


AGCACCACGC 


AAGAGTTGTT 


CCCAGTTn w 


Aw A/%MwAw\ 1 i 


TTAGCAACTT 


1800 


GGCTAAACAA 


ACTACTCCTA 


GTCTCTTCAG 


TTvTTGTAbL 


1 L vwUV> /\WW 1 


TCTTCCTTGA 


1860 


TCATACGATC 


CATCAAGGCA 


ACTTGGTCAT 


CTTTTGAAAT 


GGTTTCAATG 


CTCCCATTGA 


1920 


TTTGGCTAAT 


ACGATTGTCA 


TTTTTACGAA 


GCCCGATAww 


wA 1 Aww 1 w i A 


A ^ A i V> ± 1 W W 


1980 


CACTTTTGAA 


ACCACCTTCT 


ACTTGAATCA 


TCTTGAAt. 1 I 


nunu 1 1 Www** 


GCTTCACCAG 


2040 


TCAGTGCTTC 


TGGACGTTCA 


GAAACATAAG 


CATCAATwAw 


Aw^~AwWW 4 w#* 


AGAGCTTGTC 


2100 


GCATTTGAGC 


GAAGTCTCCC 


ATGCC1 GTTT 


CTTTTTTAGC 


/\V_V_ 1 www/* A I 


TGTGCAATCA 


2160 


AGTTATAAAG 


GTAGACCCCT 


TCTTGAGAAG 


TV*Ai 4 • 1 1 ww 


ACrr.TTAAAG 


TCATCCAAAG 


2220 


ATTTAGCACT 


TGCGTAGGCA 


GAATCTTTTT 


m/~> ir« ft/~f* & A 
TuAL AA ww /v* 


AACTGCTTCG 
#v*W » ww * * ww 


CTAGTATAGT 


2280 


AACTGCTCGA 


AAAGGCAATT 


TCTTGTTTGC 


Vj I i V» . A*w/*w I 


TCGACTCATA 


CCTGCCATAA 


2340 


TCATGTC AAT 


CTTACCAGAA 




GCACTAGACC 


TTCCCACTTG 


GTTTTAACAA 


2400 


CCAAAGGTTC 


TTTACCTAAG 


TCCTTAGCGA 


TTTTCTTGGC 


GATTTGAACA 


TCCTATLLSi 


2460 


TGGCATACTG 


ATTGGTCCCA 


TCGATTTTGA 


CAGCTCCGTT 


GCTATCATCA 


TCCTGGGTCC 


2S20 


ACTTAAAGGC 


AGCATATGCT 


GCTTCCATAC 


CGATGCGTAA 


ATATTCATCG 


GCTTGAGCAA 


2580 


CATTGACAAG TCCTAGCATC 


AGCAAGAGAC 


TTGTGAAAAT 


AGATAACTAy 


ATGTGGCTCA 


2640 


TGATTTCTCC 


TATTCTGATC 


TATTAAAAAA 


TAACTGTCTC 


CTATTTTATC 


GAAAAATGCG 


2700 


TAATTTTTCA 


ACATAAGTAA 


GTCTTTACTT 


ACGAAAAAAT 


GCTATAATGA 


TAAGAAACAT 


2760 


AAAAAGGGGG 


CTTAGTTCAT 


GAAAAAAACT 


TTTTTCTTAC 


TGGTGTTAGG 


CTTGTTTTGC 


2820 


CTTCTTCCAC 


TCTCTGTTTT 


TGCCATTGAT 


TTCAAGATAA 


ACTCTT AT C A 


AGGGGATTTG 


2880 
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TATATTCATG 


CAGACAATAC 


CGCAGACTTT 


AGACAGAAGA 


TAGTTTACCA 


GTTTGAGGAG 


2940 


GACTTTAAGG 


CCCAAATCCT 


GGGACTTGGA 


CGTCCTGGTA 


AGATGCCTAG 


CGGGTTTGAC 


3000 


ATTGACCCTC 


ATCCAAAGAT 


TCAGGCCGCG 


AAAAACGGTG 


CAGAACTAGC 


AGATGTGACT 


3060 


AGCGAAGTAA 


CAGAAGAAGC 


GGATGGTTAT 


ACTGTGAGAG 


TCTATAATCC 


AGGTCAGGAG 


3120 


GGCGACATAG 


TTGAAGTTGA 


CCTCGTCTGG 


AACTTAAAAA 


ATTTACTTTT 


CCTTTATGAT 


3180 


GATATCGCTG 


AATTAAATTG 


GCAACCTCTG 


ACAGATAGTT 


CACAGTCTAT 


TGAAAAGTTT 


3240 


GAATTTCATG 


TAAGGGGAGA 


CAAGGGGGCT 


GAAAAACTCT 


TTTTCCATAC 


ACGGAAACTT 


3300 


TTTAGAGAGG 


GAACGATTGA 


AAAGAGTAAC 


CTTGATTATA 


CTATCCGTTT 


AG AC AATCTT 


3360 


CCGGCTAAGC 


GTGGAGTTGA 


GTTGCATGCC 


TATTGGCCTC 


GGACCGATTT 


TGCTAGCGCT 


3420 


AGGGATCAGG 


CATTGAAAGG 


GAATCCTTTA 


GAAGAGTTTA 


ATAAGATAGA 


AGACTCGATT 


3480 


CTTAGAGAAA 


AAGATCAGAG 


TAAACAACTC 


GTTACTTGGG 


TCCTCCCTTC 


CATCCTTTCC 


3540 


ATCTCCTTGT 


TATTGACTGT 


CTGCTTCTAT 


TTTATTTATA 


GAAGAAAGAC 


CACTCCTTCA 


3600 


CTCAAATATG 


CCAAAAATCA 


TCGTCTCTAT 


GAACCACCAA 


TGGAATTAGA 


GCCTATGGTT 


3660 


TTATCAGAAG 


CAGTCTACTC 


GACCTCCTTG 


GAGGAAGTCA 


CTCCCTTCGT 


CAAGGGACCT 


3720 


GGAAAATTCA 


CCTTTGATCA 


ACTTATTCAA 


GCTACCTTCC 


TAGATCTGAT 


AGACCGTGGG 


3780 


AATGTCTCTA 


TCATTTCAGA 


AGGAGATGCA 


GTTGGTTTGA 


GGCTAGTAAA 


AGAAGATGGT 


3840 


TTCTCAACCT 


TTCACAAAGA 


CTGCCTAAAT 


CTAGCTTTTT 


CAGGTAAAAA 


AGAAGAAACT 


3900 


CTTTCCAATT 


TGTTTCCGGA 


TTACAAGGTA 


TCTGATACTC 


TTTATCCTAG 


AGCCAAAGTT 


3960 


TCTGATGAAA 


AACGG ATT C A 


AGCAAGAGCG 


CTTCAACTCA 


AATCTTCTTT 


TGAACAGGTA 


4020 


TTGAACCAGA 


TGCAAGAAGG 


AGTGAGAAAA 


CGAGTTTCCT 


TCTGGGGGCT 


CCCAGATTAT 


4080 


TATCGTCCTT 


TAACTGGTGG 


GGAAAAGGCC 


TTCCAAGTGG 


GTATGGGTCC 


CTTGACTATC 


4140 


CTGCCCCTAT 


TTATCGGATT 


TGGTTTGTTC 


TTGTACAGTT 


TAGACCTTCA 


TGGCT ATCTT 


4200 


TACCTCCCTT 


TGCCAATACT 


TGGTTTTCTA 


GGGTTAGTTT 


TGTCTGTTTT 


CTATTATTGG 


4260 


AAGCTTCGAC 


TAGATAATCG 


TGATGGTGTT 


CTAAATGAAG 


CGGCAGCTGA 


GGTCTACTAT 


4320 


CTCTCGACCA 


GTTTTGAAAA 


TATCTTGCCT 


GACATTCCAC 


GATTGGATCA 


GGCTCAACTG 


4380 


GAAAGTATTG 


TGGTCTGGAA 


TCGCCTCTTG 


CTCTATGCGA 


CCTTATTTCG 


CTATGCGGAC 


4440 


AAGCTTACTC 


ATTTGATGAA 


GGTTCATCAG 


ATTCAAGTGG 


AAAATCCAGA 


TATCAATCTC 


4500 


TATGTAGCTT 


ATGCCTGGCA 


CAGTACGTTT 


TATCATTCAA 


CAGCACAAAT 


GAGCCATTAT 


4560 


GCTAGTGTCC 


CAAATACAGC 


AAGCACCTAC 


TCTGTATCTT 


CTGGAAGTGG 


AAGTTCTGGT 


4620 


GGTGGCTTCT 


CTGGAGCCGG 


AGGTGGCGGC 


ACTATCGGTG 


CCTTTTAAAG 


AGAGCTACCA 


4680 
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TAGACTGAAA 


h. jvrrr atcata 


TAATGGAAGA 


TAGAAAAAAG 


ACAAACTATA 


AGAAAAGTCA 


4740 


ATAU I l l l AT 


V> 1 tvv\\> in* 1 


TCTTATTTCA 


ATTTGATGAT 


TTGGCCATGA 


TTTTAGAGCA 


4800 


CGGCAAAAAG 


CCLTluAAAA 


AfiTCCATTTT 


TTCAAAGGTA 


ATCCTGTGTT 


AATTTCAGAA 


4860 


ATTACATCAC 


TTTTTGTTa* I* 




GCTCTTTTTT 


AGGATATAAA 


ACACGGTTCG 


4920 


GATAACTTTT 


I i i GCAAGwT 




TACATTGTAA 


TGTTTTCCTT 


ATTCTAACTT 


4980 


AGTCTTAAGA 


TAGGCCTTAG 




AAAGCGAGGG 


CATCCTTTGG 


CAGCTTGTAT 


5040 


GAGTGCCCAC 


CGCAGATGAG 




TTTGACCATT 


CTTCCAGCTA 


AATCAATCTC 


5100 


ACCTGACTGA 


TAAATAGAAG 




AGCGAAAGCT 


TGTAATTGAG 


CAGGATTATC 


5160 


AAAGGCATGA 


ATATTTCGAA 




* a.Tf2 ArCGCC 


CTAAACGATC 


CCCAATCCCA 


5220 


GTAACCCTCG 


TGATGACCGA 


GTTGAACTCA 


UU^AiVUnW 1 


CATTGATACA 


TGTTTCCCCC 


5280 


TTGTCAATGA 


GCCTCTTGTA 


ATGCTTuA lu 


ATTTTCAATT 


CACGAGCAGG 


AGATGTTGTT 


5340 


CCGATAGAAC 


CAGGTGCGAC 


TGAGAGGATA 


TV*fTf* a &TTT 


TACAAGCGGT 


CAATCGCTTA 


5400 


ATTTCTATCA 


GCTTATCAAA 


TCCTGtCTCA 


A 1 1 1 1 I v. i 


GAGGATTAGG 


GTAGCGTGTC 


5460 


AAGAGTTGGT 


AGGTATATTC 


I 1 VjV_ 111 


CC AACCATTT 


TATCCAACTC 


AGGAAAGATG 


5520 


ATATCAAGAC 


AACGAGTGTA 


1 l\j 1 AL 1 I IV. 


TAATCACACT 


GTTTTTCTTG 


AGACCATGAA 


5580 


TATCTCTAGC 


CAGTATTTTT 


AGGTCTACTT 




GTGTTGAAAT 


TCTTCACGAT 


5640 


TGGGGTCAGA 


AAGAAGTTTA 




CATCAGCGTC 


TTTCTTATCC 


GTTTTACTCT 


5700 


TGCGAAGTCA 


TAATGATTTG 




TCATCAGCAA 


AGCATTGTAG 


GTGTAAACTT 


5760 


TATATCCTTG 


TTCATGCAGG 




CATTAAAGGC 


ATAATGTCCA 


GTATCTTCAA 


5820 


CAGCGATGAG 


ACAUTL l Iw»jr 


TTCATCTGTC 


GAATAGACAG 


ATCTAAGAGT 


T C AAAACC AG 


5880 


CTTTATTATT 


TGAAAAAU1 


i wo J i i 


GAACAGTTTT 


TCCTGCAACA 


TTCAACCCTG 


5940 


TAACATCGTG 


TTTATTTTTA 


GCGATATCAA 


TGCCTACATA 


AAuUA J^ajOA 




6000 


ATATAGTATT 


TCAAGTCTAC 


TTGGTTATCC 


ACGAATTTTT 


TCCCTTGTTA 


CCTTAGACGA 


6060 


GATCAAACGT 


CTATGCGTTA 


TCAAACTCAT 


TACCAATTGA 


AACAAAAGCT 


CTGGTTAGAC 


6120 


CCTTTCGGAA 


ATCCTCAAGC 


GATTGGAGGA 


AATGAACTAA 


TCCATACTGG 


CTTATTCCAA 


6180 


GTATACCACT 


TGCCCTTTGG 


CAGTAGCTAA 


CTGCGCTAAA 


TATAATATAG 


GGAGTAATCT 


6240 


ATGT ATCTT A 


TTGAAATTTT 


AAAATCTATC 


TTCTTCGGAA 


TTGTTGAAGG 


AATTACGGAA 


6300 


T GCTTGCCG A 


TTTCCAGTAC 


AGCTCACTTG 


ATTTTAGCAG 


AGGAATTCAT 


CCAAT ACCAA 


6360 


AATCAAAATG 


AAGCCTTTAT 


GTCCATGTTT 


AATGTCCTGA 


TTCAGCTTGG 


TGCTATTTTA 


6420 
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GCACTTATGG 


TGATTTATTT 


TAACAAGCTC 


AATCCTTTTA 


AACCGACCAA 


GGACAAACAG 


6480 


GAACTTCGTA 


AGACTTGGAG 


ACTATGCTTG 


AAGGTCTTGA 


TTGCTACTTT 


ACCTTTACTT 


6540 


GCTGTCTTTA 


AATTTGATGA 


TTGGTTTGAT 


ACCCACTTCC 


ATAACATGGT 


TTCACTTGCT 


6600 


CTCATGTTGA 


TTATCTACGG 


GGTTGCCTTC 


ATCTATTTGG 


AAAAGCGCAA 


TAAAGCGCGT 


6660 


GCTATCCAGC 


CAAGTCTAAC 


AGAGTTGGAC 


AAGCTTCCTT 


ATACGACCGC 


TTTCTATATC 


6720 


GGACTCTTCC 


AAGTTCTTGC 


TCTTTTACCA 


GGGACTACCC 


GTTCAGGTGC 


AACGATTGTC 


6780 


GGTGGTTTGT 


TAAATGGAAC 


CAGTCGTTCA 


CTTCTCACAG 


AATTTACCTT 


CTATCTTGGG 


6840 


ATTCCTGTTA 


TGTTTGGAGC 


TAGTGCCTTA 


AAGATTTTCA 


AATTTGTGAA 


AGCCGGAGAA 


6900 


CTCTTGAGCT 


TTGGGCAATT 


ctttttgctc 


TTGGTCGCGA 


TGGGAGTAGC 


TTTTGCGGTC 


6960 


AGCATGGTGG 


CTATTCGCTT 


CTTGACCAGC 


TATGTGAAAA 


AACACGACTT 


CACCCTTTTT 


7020 


GGTAAATACC 


GTATCGTGCT 


TGGTAGTGTT 


TTGCTACTTT 


ACACTTTTGT 


CCGTTTATTT 


7080 


GTATAAGAAA 


AACCTTGAAG 


CGGCAACTCT 


TCAAGCTTTT 


ATACTCTTCG 


AAAATCTCTT 


7140 


CAAACCGCGT 


CAGCTTTATC 


TGCAACCTCA 


AAACAGTGTT 


TTGAGCAGCn 


CTGCGGCTAG 


7200 


CCTCCTACTT 


TGCTCTTTGA 


TTTTCATTGA 


GCTTTAAAAT 


CCAGTCATGG 


TAATCCCCAA 


7260 


TACGCGCACA 


CCTCTTTCTT 


TCTTGCTTAA 


TT CTTC AT AG 


AGTTGCAGGG 


CTATTTGGCT 


7320 


TATCTGACTA 


GCATCTTGTG 


TTTTTTGAGC 


AAGACTTTTT 


CCTTTGCTAA 


CACTTGAAAA 


7380 


GTCCTCCTAG 


CGGATTTTCA 


AAATGACAAT 


TTTTCCAGCT 


TTTTCTTGTT 


GATGTAGATT 


7440 


GAGAGCGACT 


TTTTCTGATA 


GAAGACTCAC 


CTCTTTTTTG 


ATATCTTCCT 


CAGCAAGGAG 


7S00 


AATCTTCCCG 


TAGGTTTTCT 


CCTTGCCCAT 


TGATTTACGG 


ATGCCATTGG 


ATTTGACTGG 


7560 


AGAGTTGTGA 


ATGCCACGAG 


CCTTTCGATA 


CAGATCATAG 


CCTAGTCTAC 


CAAAACGGTC 


7620 


TATTAGGGTT 


ACCTCAGGAA 


CTTCAAGTAA 


ATCAGCACCA 


CTAAAAACCC 


CCATTTGATG 


7680 


AAGACGTTCT 


ACTCTCTTTT 


TTCCTACTCC 


ATGAAATTTG 


GAAATATCCA 


TTTGTTTGAG 


7740 


AAAATCCTCA 


CCCTGTTCAG 


GTAGAATCAC 


TGTCAAACCA 


TGTGGTTTTT 


GAT AATC ACT 


7800 


CGCCATTTTA 


GCTAAGAATT 


TCTTCTAACA 


AACGCCTCCG 


GAAGCACTTA 


CATGGACTTC 


7860 


TTGCCAGATA 


TCTTTTTGAA 


TGAGGCGAGC 


AATTTTGACC 


GCTCACTTGA 


TACCGAGTTT 


7920 


ATTTTCTGTC 


ACATCCAAAT 


AGGCTTCGTC 


AATGCTCATG 


GGTTCAATCA 


AATCTGTATA 


7980 


GCGCTTAAAA 


ATAGCTCGAA 


TCTGGAGTCC 


CACAGACTTG 


TATTTCTCAT 


AATTCCCTGA 


8040 


GATAAAGACA 


CCCTGGGGAC 


AACGTTCATA 


AGCTTCCTTG 


GAACTCATGG 


CAGAATGGAC 


8100 


ACCAAAAGCT 


CTTGCCTCAT 


AACTACAGGT 


AGAAACGACT 


CCCCCTCCAC 


CTGTTTGCCG 


8160 


AGGGTCGCTT 


CCAATAATGA 


CAGCTTTTCC 


TCTGAGTTTA 


GG ATT AT C CC 


TGATTTCCAC 


8220 
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TGCACCAAAA 
AGGAAAAATC 
CAGCTCTATm 
CCTTTTCGAT 
AAAAAAGAAA 
ACTGTTTTAG 
T ATGGT AT AC 
ATATGGTTGT 
TCAAAGGCGT 
CACCTTATGA 
AGAAAATTGT 
GTCCAACATC 
TCGGTATCCA 
TGGCTGAAAC 
TCACTAAATA 
GTCGCGCTCG 
TCATCGGTCT 
TAAATGACTG 
TAAACCTTCA 
ATGTTCGCAA 
TGGCTGTCTG 
TGGACATCTT 
AATTCGTTGA 
ATGACCAATT 
* ACGACGGTCG t 
TCGGTAACTC 
TCCGTCGCTA 
CAACAATGGC 
TTGATCCAGA 



AAGGCATCCA 
AACATGCCTA 
TGCAACCTCA 
TTCCATTGAG 
AAAGGACTTT 
AAAAGAAAGT 
TTGTCTTATG 
TAAGACAGTT 
AGATTGGAAA 
TGGAGACGAA 
AGAAGAAACT 
TATCGCTGAT 
AAACGATGAA 
TACTTTGAAA 
TGTAACAACA 
TCACGCACAC 
TTACGCACGT 
GAATGCAATC 
ATACCAAGCA 
ACCAGCGATC 
CCGTGTGATT 
TGCAGAACGT 
TGATTTCCTT 
GTACTCAGGT 
TCACCGTGTT 
ACCAGAACCA 
CTGTATGCAC 
TAAAGACGGA 
AAATGAAGAA 



TGTCAATATG 
GCACCTTTTT 
AAACAGTGTT 
TCTTACTGCT 
ATTTTTTCAA 
GTAAAAATAG 
AATGTAACAG 
GTTGAAGCAC 
GAAAAAGCAA 
AGCTTCCTTG 
AAAGCACACT 
ATCCCTCCTG 
CTCTTCAAAT 
GAAAATGGAT 
CTTAACGACG 
ACTGTAACTG 
CTTGCTCTTT 
AAAGAAATCG 
TTGCAACAAG 
AACGTGAAAG 
AACCGTGCTG 
GACCTTCCTC 
ATGAAACTTC 
GACCCAACCT 
ACTAAGATGG 
AACTTGACAG 
ATGAGCCACA 
TATGGTGAAA 
CAACGCCACA 



463 
GATGATTTTT 
ATACTCTTCG 
TTGAGCAATC 
TATTyTCTTT 
AAATATAATA 
GGAATTTTCA 
ATGACTGTTA 
AAGATATTTT 
GTGTATCACG 
CAGGACCAAC 
ACGAAGAAAC 
GATTTATCGA 
TGAACTTCAT 
ACGAACCAGA 
GTATTTTCCG 
GTCTTCCAGA 
ACGCTGCAGA 
ATCAAGAAAC 
TTCTTCGCCT 
AAGCAATCCA 
CTACATCTCT 
CTGCTACATT 
GTACAGTTAA 
TTATCACAAC 
ACTACCGTTT 
TTCTTTGGAC 
AACACTCTTC 
TGAGCTCTAT 
AC ATCC ACT A 



CTTGACAAAT 
AAAATCTCTT 
TGCGGCTAGC 
TATTATACCC 
CAGTTTCAAA 
CTTGTTGAAA 
CTACAAAAAA 
TGACAAAGCT 
ATTTGTACAA 
AGAGCGTTCA 
TCGTTTCCCA 
CAAAGAAAAT 
GCCAAAAGGT 
CCCAGCTCTT 
TGCCTACACT 
TGCATACTCA 
CTACTTCATG 
AATCCGTCTT 
GGGTGACCTT 
ATGGGTTAAC 
AGGTCGTGTA 
TACTCAATCA 
ATTTGCTCCT 
TTCTATGGCT 
CTTGAACACT 
TGACAAATTG 
TATCCAATAC 
CTCATGCTGT 
CTTCGCTGCT 



CATTTAACAA 
CAAACCACGT 
TTCCTACTTT 
TTTTTTCTGA 
TAAAATATAG 
TCGGTTACTA 
GAGGACATTA 
TGGGAAGGCT 
GCTAACTACA 
CTTCACATCA 
ATGGACACTC 
GAAGTTATCT 
GGTATCCGTA 
CACGAAATCT 
TCAAATATTC 
CGCGGACGTA 
CAAGAAAAAC 
CGTGAACAAG 
TACGGGGTTC 
ATTGCTTTCA 
CCAATCGTAT 
GAAATCCAAG 
ACAAAAGCTT 
GGTATGGGTA 
CTTGACAACA 
CCATACAACT 
GAAGGTCTAA 
CTCTCTCCAC 
CGTGTAAACG 



8280 

8340 

8400 

8460 

8520 

8S80 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 
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TTCTTAAAGC 


CCTTCTTACT 


GGTTTCAATG 


GTCGTTACGA 


CCATGTTCAC 


AAACACTACA 


10020 


AAGTATTTGA 


TATCCAACCA 


ATCCCTGACG 


AACTTCTTGA 


ATTTCAATCA 


GTTAAAGCGA 


10080 


ACTTTGAAAA 


ATCTCTTGAC 


TGGTTGACTG 


ACACTTACGT 


AGATGCCTTG 


AACATCATCC 


10140 


ACTACATGAC 


TGATAGCTAC 


AACTACGAAG 


CTGTTCAAAT 


GCCCTTCTTG 


C CAACTAAAC 


10200 


AACGTGCCAA 


CATCGGATTC 


GGTATCTGTG 


GATTTGCTAA 


CACTCTTGAT 


ACATTGTCAG 


10260 


CTATCAAATA 


CGCTACAGTT 


AAACCAATCC 


GTGACGAAGA 


TGGCTACATC 


TACGATTACC 


10320 


AAACAATCGG 


TGACTACCCA 


CGCTGGGGTG 


AAGATGACCC 


ACGTTCAAAC 


G AATTGGC AG 


10380 


AATGGTTGAT 


CGAAGCTTAC 


ACAACTCGTC 


TACCTAGCCA 


CAAACTATAC 


AAAGACGCAG 


10440 


AAGCTACAGT 


ATCACTTTTG 


ACAATCACAT 


CTAACGTTGC 


TTACTCTAAA 


CAAACTGGTA 


10S00 


ACTCACCAGT 


TCACAAAGGT 


GTATACCTCA 


ACGAAGATGG 


TTCTGTGAAC 


TTGTCTAAAC 


10560 


TTGAATTCTT 


CTCACCAGGT 


GCTAACCCAT 


CTAACAAAGC 


TAAAGGTGCT 


TCGTTCCAAA 


10620 


ACTTGAACTC 


ACTTTCTAGC 


CTTGACTTTA 


GTTATGCAGC 


TGACGGTATC 


TCATTGACTA 


10680 


CACAAGTATC 


ACCTCGCGCT 


CTTGGTAAGA 


CTCGTCATGA 


ACAAGTTGAT 


AACTTGGTAA 


10740 


CAATTCTTGA 


TGGTT AC TTC 


GAAAACGGTG 


CACAACACCT 


TAACTTGAAC 


GTTATGGACT 


10800 


TGAACGATGT 


TTACGAAAAA 


ATCATCTCAG 


GCCAAGACGT 


TATCGTACCT 


ATCTCTGGAT 


10860 


ACTCTCTAAA 


CACTAAATAC 


CTCACTCCAG 


AACAAAAAAC 


TGAATTGACA 


CAACCTGTCT 


10920 


TCCACGAAGT 


TCTTTCAATG 


GATGACGCCT 


TGGATGCATT 


GAGCTAATCA 


AGTTCTTGAA 


10980 


TAATAAAAAG 


GAACCCTCGG 


TCAAACGACT 


G AGGGTTTTC 


TGCTTGGGAT 


AGTATGACCA 


11040 


ATTCCTTCGG 


CCCAATATGC 


AATGTTTTTG 


GGCTCTTTGT 


CAACTCTAGT 


GGGTTGAAAA 


11100 


AAAGCTAAGC 


TTGAGAAAGG 


ACAAATTTCG 


TCCTTTCTTT 


TTTCATGTTC 


AGGGCGATAA 


11160 


AAATCCGTTT 


TTTGAAGTTT 


TCAAAGTTCC 


GAAAACCAAA 


GGCATTGCCC 


TTGATGTCTT 


11220 


TGATGAGTTT 


GTTAGTCGCC 


TCAAGTTTAG 


CGTTAGAATA 


ACGCAATTCA 


ATGGCGTTAC 


11280 


TCATCTAGTT 


TTTATAGCAA 


ATAAATGTCC 


TCAAAGTGGT 


TTTAAACGTG 


CGGTTGAGAT 


11340 


GAGGTAACGT 


GTCTTGAATT 


AAGCCCCAAA 


ACTGCTCAGT 


ATTCTTCTCT 


TGTAGATGAA 


11400 


ATAGGACTAG 


TTGATACAGC 


TCATAGTAAT 


CTTTAAGTTC 


AGGTACTAGA 


GTAAAGATTT 


11460 


TCTTCAGACA 


CTCCCTAGGA 


GTTAAGGTCT 


CTCTGAAAGT 


TCTAGCATAG 


AAACGCTTAA 


11520 


GAGAGAGTTT 


CCCACTATCT 


TTTAGGATAA 


ATTTCCAGTA 


ATATTTAAGA 


CCTCTGTATT 


11580 


CCAGAGATTT 


ATCATCAAAT 


TGCTTCATGA 


TGTTGATTCT 


AGTCTGATTA 


AGAGCCCTGC 


11640 


TCATCTGTTG 


GACAATGTGG 


AAACGATCCA 


GAACAATTTT 


AGCATTGGGA 


AATAATTTCT 


11700 


TAATGAGAGG 


GATATAACTT 


CCAGACATAT 


CAACAGTCAC 


GACTTTAACT 


TTTTTTCTAG 


11760 
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CTTCTTTCGA GTACTTGAAG AAATGATTTC GCATGCTTGT TTGACCTCTG TTATCAAGAA 11820 

TGCTCATGAT TTTCTTAGTG TTGAAATCCT GAGCAATGAA AGCCAATTTC CCCTTCTGGT 11880 

AGGAGAATTC ATCCCAGGAC AGGATTTCAG GCAAAGTGGT CTAATCCTCT TGGAAATCAA 11940 

ATTGCTTGAG CTTACGATAC ACGGTAGAGG TAGAGGTAGA GCTAGAGATG CCTAATTTAG 12000 

AAGCGATATG TGTAAGAGCC TCTCTGTTGA GTAGGAGTTG GGCAATTTTC TGTCTCACCA 12060 

TTTCCCAGAT TTGGCAATTT TTCTCAACCA GAGTTGTTTC AGCTACACTG ACTTTCCGAC 12120 

AGGACTTGCA TTGAAATCGT CTCTTTTTCA AATGAATGAG GCTAGGGAAA CCACCAATCT 12180 

CGATAAAAGG GATTTTAGAA GGCTTTTGGA AGTCCTATTT GATTTGTTTT CCTTTACAGT 12240 

GTTTACATTT AGGTGGGTGA TAATCAAGTG TAGCGAAGAC TTCGATATGC GTATCGTGCT 12300 

GAATGGCTTT ATTTAAGGTG ATGTTTTTGT CTTTTATTCC GATGAGTAAT GTGGTATGAT 12360 

TGATCTCTTC CATAAGATAC TTTCTAATGA GTTGTTTAGG CGCTTTTCAT TATAAGTCTT 12420 

ATGGGACTTT TTTGATACTC AAAAAGCCCT ATAATCTCCA CACTCGGATT TACCCACTAC 12480 

AGAAATTATA GAGCCACAAA AAACACTTTT GTTCACTAGC AGAAACTAGA GAGCAGAAGT 12540 

GTTTTTCTCT TCAGATTTAC CCAAAACTGG GAAATATGGG GATAAGAATA CACATGGCTT 12 600 

AGGAAGCCCC TTTTTCT GTG TAGACAGTAC GATGAACTTA TAACAAATAG TGAGCCTTTT 12 660 

TAGCAATCAT TGCGACCCGT TTCTCAAAAG CCTCTTTTCG GATATCTACA ATTCTCTCAT 12720 

ACATGAGACC CTGTTGGCTA ACATGCAAAT CTAACCCAAT CGTCAAAAAG TGAT G TTTCC 12780 

CTTTGGGATA CTGCTTTTTA ACGTAAGGCA GGTATTCTTT CCTTGTAATA ATAATCAATG 12 840 

GCTCTCTCAA ATGCTCCTCT GAAGGAGGAG GACTAATTAG AATATTGTAT CCTGTAACAG 12900 

AGGCAACTTT GTCAGTAAAA TTCCGTAAAA TAATGGACTT TATTAACTTT ACATCTGCTT 12960 

GATTATTTAA AATGATAAAA ATCGGGATAC CAGGTAGTGA GGAAAAGATG GTTTCTGTCA 13020 

AGTAGAGTGA GAAAACGTAC AGCCGATGCT CGTCCATAAC TCCTTCAATC TTCTGCTCAG 13080 

TCATCCACTC TTGAACAATT GCTTTCGAAA TATGATACAG TGGCTTGTCG CTTTCAATCC 13140 

CATAATGTTC GTAATAATTA TAATAGGGAA CTAGATTTTG TAAACCAAAC AAAAACGTTC 13 200 

TTGTTAACAA AGTCAGTCCT GTTAAAAAAG AAAGAGAATT CGAAATGTCA TTTCCTAAGA 13260 

TATTCTTGAA CTTGGATAGT AGATGCTTTC CTCTTGTATG CTGAAGAATC ACTTGAATAC 13320 

TATGAGTCTT TTTTTCTTGA TTCCATTTGT CCTTGGAAAA CGAAGAATTA GCACAACAAT 13 380 

AAACCAAAAA GATATAATCC A G TTCTTCCT GAGTAAAAGT CATGTTGGCA TGTGGCTCTA 13440 

AGTAAGTTTG GCAATGTTCC AT C AAAATCG GATACATAAA GAGGTTTTTT AATTTTTCAA 13500 
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ACTCTTTGGA CTCACGGAAC TCAAGTGGAA ATTCCCGACC TTTCCAAGTG AGTGCCACTA 13560 

GTATGCTAAA ATGAACATAC TCGTCAGGTG TGATTTCTAA CACTTCATGA CTGAGTTGAG 13620 

AATTAGACTG CACAATCATA TGTGTGACCC AATCCATACT TCCATCATTC AAATCATAAA 13680 

TCTCAATACC AAAATGAAAC TGGAGGAGTG CAATTAAAAA ACGAATGCGA TATTCAGGAC 13740 

CAACTACTTG ATTTTTCACA AGGTCCAAAC CTACTGAACG TKCTAACAAG CCACACTTTT 13800 

GTCCTACGCG GTACCCTGTT GCGATGGAAA TATACTCTTT TTGTGTAAAT TCGTTAAAGC 13860 

TTTGATTACC TTGTAGTAGA AAGAAGCGGA GTATTTTTAA AATAGTTGAT TCGTTATAAA 13920 

GCTGATGGAA GTAATAATTC GTTTGATGAG AATGGTGTTC CATTAATTGA ACTTGTTGCG 13980 

TATCTAAATT AAATGTCAAC TCTTCCTCGA ATGTTTCTTG TAATTCCTGC AAAATGCTTA 14040 

GGAGACTTTT AGATTGTAAT CAAGTTAAAG TAGACAGTTC ATCTAGTTCA ATAGACCGAA 14100 

TATCCAATAA TATATTTAAA ATGCTAATTT TATCTGTAAT TCTTTTTTCA ATGTATTTGT 14160 

TTAGCATAGT TACCGAATCT TAGTTGCATA TAGATAATTT TAATTATTAT AATACAAAAG 14220 

AAACTAATTG TCTTGTCAAA AACGTTGTGG AATTTCCGAC TTTATTGATA AAACAGCATG 14280 

TAATAAAAGG CATTTTAAAG ATAGTAATGA GTATTGGTGG AGTTTTATGG CTTATTTTTT 14 340 

TTATTAGAAA ATATTTTTTT ATCAAATATT GTCGTTCTAT AAAAAAATAT GTGATAAAAA 144 00 

TATCTATTGT CATCGAAGTT GTTTTAATTT ATACTAGGAT AGTTAATAGT AA TACT AT AC 14460 

TATACTATAT TGTATACAAG TGTGTCATTG CCAGGTTGAG AAGATAGCTA TAACGCACTT 14520 

TTATACGCTT TTGCTACGTT TGTTAGTGAA CGGATTAACT CAGTGAGATA AATTTTATCA 14 580 

CAACATAACT AATCCGTTTC TTCGTGTATA CAGATTCAAA GTACCTATGA ATCATAGAAG 14 640 

GATTAACTTG TTCTATGAAT AATGCTTAAC AGGGAGACAC ACATGAAAAA ACTAACAAAG 14700 

A7ATTTCAGA AGGCACTTGC AGGACTGTGC TGTATATCTC AGTTGACAGC TTTTTCTTCC 14760 

ATAGTTGCTT TACCAGAAAC GCCTGAAACC AGTCC AG CG A TAGGAAAAGT AGTGATTAAC 14820 

GAGACAGGCG AAGGAGGAGC GCTTCTAGGA GATGCCGTCT TTGAGTTGAA AAACAATACG 14 880 

GATGGCACAA CTGTTTCGCA AAGGACAGAG GCGCAAACAG GAGAAGCGAT ATTTTCAAAC 14940 

ATAAAACCTG GGACATACAC CTTGACACAA GCCCAACCTC CAGTTGGTTA TAAACCCTCT 15000 

ACTAAACAAT GGACTC7TGA AGTTGAGAAG AATGGTCGGA CGACTGTCCA ACGTGAACAC 15060 

GTAGAAAATC GAGAAGAGGC TCTATCTGAC CAGTATCCAC AAACAGGGAC TTATCCAGAT 15120 

GTTCAAACAC CTTATCAGAT TATTAACGTA GATGGTTCGG AAAAAAACGG ACAGCACAAG 15180 

GCGTTGAATC CGAATCCATA TGAACGTGTG ATTCC AG AAG GTACACTTTC AAAGAGAATT 15240 

TATCAAGTGA ATAATTTGGA TGATAACCAA TATGGAATCG AATTGACGGT TAGTGGGAAA 15300 
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ACACTGTATC 
TCAAATAGTA 
CCGACACGTT 
ACTTATGCTT 
AAAAACGGAA 
ACCAATACCA 
TTAAAAAATA 
TTCCCTGCCA 
GCGAGACAAA 
TATCCGATTA 
TTTTTTAGTA 
ACTAGTGGAG 
AACACAGTTT 
GAAATGAAGG 
CTTAATTGCA 
AATCATGGTG 
GTCTTTACGG 
AGTTTTATCC 
AAAATATTGG 
GAGAATGGTA 
GCAAGATTTG 
GGACAAGCTG 
TATGATACGA 
GTTACGTTGA 
ACCAATGGTC 
CCGATTCCTA 
AAACTTGGTG 
GCGCTCTTTA 
CAAAATGGCA 



AACAAAAAGA 
TGAGTAACAT 
CTCTTATTGA 
CCACTATCTT 
AGCGATTGAA 
AAGATTATAG 
AGGTACCTAC 
CTTTTACTCA 
ATAGTCAAAA 
ATTTTAATCA 
AATCTCCTAA 
AACATACAAT 
ATGAAAAAGG 
CGGCTCGTTA 
CAGAGAGTAT 
ACCCTACAAG 
TAGGTATTGG 
AAAGTATTTC 
AACAGTTGAA 
CGATTACAGA 
ATCCAGCAGA 
TAGGTGGTCC 
CTGAGAAAAG 
CCTACAATGT 
GAACAACCTT 
AGATTCGTGA 
ACATTGAGTT 
GTCTTCAAAA 
CTTATCAAAA 
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TAAGTCTGTG CCCCTGGATG TCGTTATCTT GCTCGATAAC 15360 

TCGAAACAAG AATGCTCGAC CTGCGGAAAG AGCTGCTCAG 15420 

TAAAATTACA TCTGATTCAC AAAATAGGCT AGCGCTTGTG 15480 

TGATGGGACC GAGTTTACAG TAGAAAAAGG GGTAGCAGAT 15540 

TGATTCTCTT TTTTGGAATT ATGATCAGAC GACTTTTACA 15600 

TTATTTAAAG CTGACTAATG ATAAGAATGA CATTCTAGAA 15660 

CGAGGCAGAA GACCATGATG CAAATAGATT CATGTACCAA 15720 

GAAAGCTTTG ATGAAGGCAG ATGAGATTTT GACACAACAA 15780 

AGTCATTTTC CATATTACGG ATGGTGTCCC AACTATGTCG 15840 

TGCTACGTTT GCTCCATCAT ATCAAAATCA ACTAAATGCA 15900 

TAAACATGGA AT ACT ATT AA GTGATTTTAT TACGCAAGCA 15960 

TGTACGCGGA GATCGGCAAA GTTACCAGAT GTTTACAGAT 16020 

TGCTCCTGCA GCTTTCCCAG TTAAACCTGA AAAATATTCT 16080 

TGCAGTTATA GGCGATCCAA TTAATGCTGG AT AT ATTTGG 16140 

TCTGGCTT AT CCGTTTAATT CTAATACTGC TAAAATTACC 16200 

ATGGTACTAT AACGGGAATA TTGCTCCTGA TGGGTATGAT 16260 

TATTAACGGA GATCCTGGTA CGGATCAAGC AACGGCTACT 16320 

TAGTAAACCT GAAAACTATA CCAATGTTAC TGACACGACA 16380 

TCGTTATTTC CACACCATCG TAACTGAAAA GAAATCAATT 16440 

TCCGATGGGT GAGTTAATTG ATTTGCAATT GGGCACAGAT 16500 

TTACACTTTA ACTGCAAACG ATGGTAGTCG CTTGGAGAAT 16560 

ACAAAATGAT GCTGGTTTGT TAAAAAATGC AAAAGTGCTC 16620 

GATTCGTGTA ACAGGTCTGT ACCTTGCAAC GGATGAAAAA 16680 

TCGTTTGAAT GATGAGTTTG TAAGCAATAA ATTTTATGAT 16740 

ACATCCTAAG GAAGTAGAAC AGAACACAGT CCGCGACTTC 16800 

TGTGCGGAAG TATCCAGAAA TCACAATTTC AAAAGAGAAA 16860 

TATTAAGGTC AATAAAAATC ATAAAAAACC ACTGAGAGGT 16920 

ACAACATCCG GATTATCCAG ATATTTATGG AGCTATTGAT 16980 

TGTGACAACA GGTGAAGATG GTAAGTTGAC CTTTAAAAAT 17040 
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CTCTCAGATC CGAAATATCG ATTATTTGAA AATTCTCAAC CAGCTGCTTA TAAACCCGTT 
CAAAATAAGC CTATCGTTGC CTTCCAAATA GTAAATGGAG AAGTCACAGA TCTGACTTCA 
ATCGTTCCAC AACATATACC AGCGGGTTAC GAGTTTACGA ATGATAAGCA CTATATTACC 
AATGAACCTA TTCCTCCAAA GAGAGAATAT CCTCGAACTC CTGGTATCGG AATGTTGCCA 
TTCTATCTGA TAGGTTGCAT CATGATGGGA GGAGTTCTAT TATACACACG GAAACATCCG 
TAAAGTGTAG AAATGATAAT ATCTATGTTC TGAACGATAC TTTTAAGAAG TAGCACTCAA 
GAAGAGATTT AAGTTTACTT CGTGAAACCT GTTTTATTCG TAAGTAAACT ATCATTGAAA 
GGGGAGATCT TTTCGAAAAC TTGCACAGAA AAAGCATTAT TATTCTCATG TGTAATTCAT 
TACATTGCTC ACAGTTCATT TTAAGAGATA TGAATAAGGA GAAATCATGA AATCAATCAA 
CAAATTTTTA ACAATGCTTG CTGCCTTATT ACTGACAGCG AGTAGCCTGT TTTCAGCTGC 
AACAGTTTTT GCGGCTGGGA CGACAACAAC ATCTGTTACC GTTCATAAAC TATTGGCAAC 
AGATGGGGAT ATGGATAAAA TTGCAAATGA CTTAGAAACA GGTAACTATC CTGGTAATAA 
AGTGGGTGTT CTACCTGCAA ATGCAAAAGA AATTGCCCGT GTTATGTTCG TTTGGACAAA 
TACTAATAAT GAAATTATTG ATGAAAATGG CCAAACTCTA GGAGTGAATA TTGATCCACA 
AACATTTAAA CTCTCAGGGG CAATGCCGGC AACTCCAATG AAAAAATTAA CAGAAGCTCA 
AGGAGCTAAA TTTAACACGG C AAATTT AC C AGCTGCTAAG TATAAAATTT ATCAAATTCA 
CAGTTTATCA ACTTATGTCG GTGAAGATGG AGCAACCTTA ACAGGTTCTA AAGCAGTTCC 
AATTGAAATT GAATTACCAT TGAACGATCT TGTGCATCCG CATGTCTATC CAAAAAATAC 
AGAAGCAAAG CCAAAAATTG ATAAACATTT CAAAGGTAAA GCAAATCCAG ATACACCACC 
TGTAGATAAA CATACACCTG TGAACCACCA AGTTGGAGAT GTTGTAGACT ACGAAATTGT 
TACAAAAATT CCAGCACTTG CTAATTATGC AACAGCAAAC TGCAGCGATA CAATGACTGA 
AGGTTTGGCA TTCAACAAAG GTACAGTGAA ACTAACTGTT GATGATCTTG CACTTGAAGC 
AGGTGATTAT GCTCTAACAG AAGTAGCAAC TCGTTTTGAT TTGAAATTAA CAGATGCTGG 
TTTAGCTAAA GTGAATGACC AAAACGCTCA AAAAACTGTG AAAATCACTT ATTCGGCAAC 
ATTGAATGAC AAAGCAATTG TAGAAGTACC AGAATCTAAT GATGTAACAT TTAACTATGG 
TAATAATCCA GATCACCGGA ATACTCCAAA GCCGAATAAG CCAAATGAAA ACGGCGATTT 
GACATTGACC AAGACATGGG TTGATGCTAC AGGTGCACCA ATTCCGGCTG GAGCTGAAGC 
AACGTTCGAT TTGCTTAATG CTCAGACTGC TAAAGTTGTA CAAACTGTAA CTTTGACAAC 
AGACAAAAAT ACAGTTACTG TTAACGGATT GGATAAAAAT ACAGAATATA AATTCGTTGA 
ACGTAGTATA AAAGGGTATT CAGCAGATTA TCAAGAAATC ACTACAGCTG GAG AAATTGC 



17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18C0O 

18060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 

18540 

18600 
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TGTCAACAAC 
TACATATCGT 
ATTTGTAATT 
GAGTCAAGAA 
TGCTTATAAC 
ACCTCAAGCT 
AGATAAGGAC 
TACAGGCCTT 
ATTACTAACT 
AGGCATTGAG 
AATCACTATC 
GATTATGGGT 
TTAAGTAAGA 
GTATCTTCTT 
CCCAAGAAGA 
TGCCATCTCG 
ATGATCGGGT 
TCAAAAACAC 
CAAATGGTCT 
AATTTCTTTT 
CAGATACAAT 
AGGGTGTCGG 
TGATTGGAGA 
AAAATGGAGA 
AGCCACTGGC 
AGCTGGTGAC 
AGGTGGATGG 
AAAGCGGACA 
AAGATGGTCG 



TGGAAAGACG 
AAAAAGTTTG 
GCAAATGCTG 
GAGAACCACT 
GCTCTTACTG 
GCTTATAATG 
AATGAAAATG 
CTTGCAGGTA 
ACCCCTCAGA 
TATACTGCTG 
CCACAAACCG 
ATTGCAGTGT 
GAGAAAGGAG 
TCTTATGGCT 
T C ACACG TTG 
TGATGGTCAT 
GCAAATTGTA 
TTCGTTTGAG 
TTACTATGTT 
TGAAATCACA 
GACAACAAAG 
CTTTAAATTG 
ATACCCTTAC 
GATTTTTGTG 
AGGCTATGCT 
GATTACGGTT 
TCGGACCAAT 
CTATACTCCT 
TTTCCGAGTG 



AAAATCCAAA 
TCAAAGTTAA 
ATAATGCTGG 
TGGTTGTTAC 
CACAACAACA 
CTGCTGTGAT 
TTGTGAAATT 
CATATTACTT 
AATTTGAAGT 
GTTCAGGTAA 
GTGGTATTGG 
ACGCATATGT 
CCATTGATGA 
CTGTGTTTTT 
GTCTTGCAAT 
CGGTTGCAAG 
ACAGACTTGC 
ATGACCTTCC 
CCCTCTATTA 
GATCAAACGG 
GTGAAGCTGA 
GTATCAGTAG 
AGTTCTTCTG 
ACAAATCTTC 
GTTACGACGC 
GTCAATCAGA 
ACCTCTCTTC 
GTTCTTCAAA 
GAAGGTCTAG 
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ACCACTTGAT 
TGATAAAGAT 
TCAATATTTA 
AACAAAGGAT 
AACTCAGCAA 
TGCTGCCAAC 
AGTTTCTGAT 
AGAACAAACA 
CACTGCAACT 
AGATGACGCT 
TACAATTATC 
TAAAAACAAC 
CAATGCAGAA 
CTCTTGTATC 
TGCAGAACTA 
TATGCAAGTT 
ATTCGTGGGA 
TTGAGAATCA 
TCCAGACGCA 
TAGAGCCTTT 
TAAAGGTGGA 
CAAGACATGT 
GTCAAGTAGG 
CTCTTGGGAA 
TGGATACGGA 
AATTACCACG 
AAGGGGCAAT 
ATGGTAAGGA 
AGTATGGGAC 



CCAACAGAGC 
AATCGTTTAG 
GCACGTAAAG 
GCTTTAGATA 
GAAAAAGAGA 
AATGCATTTG 
GCACAAGGTC 
AAACAGCCTG 
TCTTATTCAG 
ACAAAAGTAG 
TTTGCTGTAG 
AAAGATGAGG 
AATGCAGAAA 
GGGTGCACAT 
TCAGGAGGTG 
GGATGATTCG 
TCAGAATAAA 
GATTGAAGTA 
TCCGGTTTCT 
GGTCATTCTA 
TCAAGACCAC 
TTCTGAAAAA 
GAG AACTCTC 
CTATCGTTTC 
TGTCCAGCTG 
TGGCAATGTT 
GTTC AAAGTC 
AGT AGTTGT A 
ATACTATTTA 



CAAAAGTTGT 
CTGGGGCAGA 
CAGATAAAGT 
GAGCAGTTGC 
AACTTGACAA 
AATGCGTGGC 
CCTTTGAAAT 
CTGGTTATCC 
CGACTGGACA 
TCAACAAAAA 
CGCGGGCTGC 
ATCAACTTGC 
ATGATTAGTC 
GCAGTCCAAG 
GTTAGTCAAT 
TATTCCTATG 
CTTTCTTCTT 
TCTCATATTC 
TATCCAGCTC 
GCCAAAAAAA 
AATCCCTTGG 
GAGCTTCCCT 
TATACTGATA 
AAGGACGTGG 
GTAGATCATC 
GACTTTATGA 
ATGAAAGAAG 
ACATCAGGGA 
TGGGAGCTCC 
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18960 

19020 
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19260 

19320 

19380 

19440 
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AAGCTCCAAC TGGTTATGTT CAATTAACAT CGCCTGTTTC CTTTACAATC GGGAAAGATA 
CTCGTAAGGA ACTCGTAACA CTGCTTAAAA ATAACAAGCG ACCACGGATT GATGTGCCAG 
ATACAGGGGA ACAAACCCTT GTATAT C TTC ATGCTTGTTG CCATTTTCTT GTTTGGTAGT 
GGTTATTGTC TTACGAAAAA ACCAAATAAC TG AT ATT CAA TGT A CATC AT TATGAATAGG 
ATAGCAGGCT GAAGGGAAGA CCAGAGTACT CTGACCTGAT GTTAATCAGG AATCATGGTG 
ATGTGGCATG AATCATCAAT AACGGATATC ACCCTCGGCA GATTGTGCCA GCCTCATTGT 
GGGTTATTGT TTGTAAAACC ATAGGACTGC TCTCGTAATC ATTTTA 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21040 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

CCCAGCAAAA AGCCATCCGA AGATCACTTT TTTGCTATTT AATTTCTGTA TAAGTTACTT 60 

CCAAGCCACG CTTAACAGCT GGACGATTGG CAATTTTTTC TGCCCATTTT ACTAGATTTT 120 

GATAACTTGA GGCATCCAAG AATTTTG C AG AACCTTGGTA AAGATTTCCT TCAACTAACT 180 

GTCCATACCA AGACCAGATA GCAATATCTG CAATCGTATA CTCATTGCCT GCAATATAAG 240 

CTTTCTGAGC CAATTCCTTA TCCAATAAAT CCAACTGGCG TTTCACTTCC ATCCTAAAAC 300 

CCTTAATACC ATATTCCAAT TTTTCAGGAG CATAATTGAA GAAATGTCCA AATCCCCCAC 360 

CTAGAAAAGG TGCTCCACCT GCTTGCCACA ATACCCAATT CAAAACTTCT ACCTTTTCCA 420 

CAGCATTACT TGGTAAAAAG GCTCCAAATT TCTCAGCAAG GTAAAGAAGA ATATGAGCAG 480 

ACTCAAAGAC TCTTACGTTT TCAGTACCTG ACTGGTCCAA TAAGGCTGGA ATCTTGGAAT 54 0 

TTGGATTGAG CTTCACAAAG TCTGATCCCA ATTGATCCCC ATCCATGATA GCAATCTTAT 600 

ACAAGTCGTA AGCCGCTTCC TTAAAACCAG CTTCTAGTAA TTCTTCCAAT AAGATAGTAA 660 

CCTTCACACC ATTTGGTGTT CCCAGTGAAT AAACCTGAAA AGCTTGTTCT CCTTTTGGCA 720 

AGTTTTGTTC GAAACGGGCA CCTGCTGTTG GTCTGTTTAG CCCCGTAAAA CCTCCTTGAT 780 

TACTAGCTTC ATCCTGCCAT ACGGTCGGTA ATTGATATGC TGACATCCGA AACCTCCCTT 840 

AAATCGCATT CTTGTCAAAA CCCAGTTTGC GTTGAATAAA CTTAACGATT TCGACGATGA 900 

TAATCATTGA GAAGCTTCCA GCCATAACAA TTCCCCATTC TCACAACTCT AGTTTGGTTA 960 

CGTGGAAGAT TCCTTCAAGC GGTTCTACAA CGATTGTTGC CATGAGAAGG ATAAAGGATA 1020 



20640 
20700 
20760 
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20880 
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CCAAGATGGA 


A 1 'I'll' IV k K ^ 

CCAGTTAAAG 




TGAATflfiCCC 


AACTGTCAAG 


ATGGATTGGT 


1080 


AGACAGACTT 


CACATTGTAG 






ACCAAGGGTT CCAAAGGCCA 


1140 


TCGTTAGGCC 


ATCTGCATGA 


AT ACCAT\»A 1 




ATCAACTGGG 


TAAGCAATCG 


1200 


CAAGGCC ATA 


AACACTCATA 


ACAAGAGCTO 


v.1 1 uunu A Aw 


ACCTTGATAA 


ATGATAGAAC 


1260 


TCAAAACACC 


ACCTGAGAAG 


AAGCTTGCCT 




TGGTTTATGA 


TTCATGACAC 


1320 


CAGGTTCCGC 


AGGTTCAACA 


CCAAGAGCGA 


TAGV-TGGQ»AA 


GGTATCCGTT 


ACCAAGTTGA 


1380 


TCCACAAAAG 


ATGAACCGGC 


TGTAAGACAT 


CCCAnLLAAA 


CAAGGTTGAT 


AGGAAGATGG 


1440 


TTAATACTTC 


AGCAGTATTA 


GCAGAAAGTA 


GGTACTGAAT 


AGTCTTTTGA 


ATGTTTGAGA 


1500 


AGACCTTACG 


TCCTTCTTCC 


ACTGCGACGA 


TAATACi T \- ttU 


AAAGTTATCA 


TCTGCAAGAA 


1560 


TCATATCAGA 


AGCCCCCTTA 


GAAACCTCTG 


TACCAGTGAT 


TCCCATACCG 


ATACCGATAT 


1620 


CGCCTGTTTT 


CAGAGCTGGC 


GCCTCATTGA 


CACCGTCACC 


TGTCATGGCA 


ACGACTTTAC 


1680 


CTTGTTTTTG 


CCAAGCCTTG 


ACGATACGAA 


CCTTGTGTTC 


TGGAGACACA 


C GGGC AT AAA 


1**40 


CAGACTATTG 


ACCAACGACT 


TTTTCAAATT 


CTTCATCTGA 


CAGTTCATTC 


AGTTCAGCAC 


1800 


CAGTTAAAAC 


GTGACCTTCT 


GTATCGTTTG 


CGTCAATGAT 


TCCCAAACGT 


TTGGCAATGG 


1860 


CTTCCGCTGT 


GTCTTGGTCG 


TCACCTGTAA 


TC AT AATTGG 


ACGGATTCCC 


GCTTCCTTAG 


1920 


CCACACGAAC 


AGCCTCAGCG 


G CTTCAGG AC 


GTTCAGGCTW 


AATCATCCCA 


ATCAAACCAG 


1980 


TAAAAATTAA 


ATCATTTTCA 


AGCTCTTCAG 


AAGTGAGATT 


TTCTGGAATA 


CTATCGATAA 


2040 


TCTTATAACC 


ACCTGCAACG 


ACACGCAAGG 


j ft rats* fk 

Li iCATCAGC 


CATTTCAGAA 


TTGTTTGTAC 


2100 


GAATGAGATT TGTAACCTTC TCATCAATCG 


GAGCAATA 


CCCACCCTTA 


TCACGAACAA 


2160 


GACAACGTTT 


TAACACTTGG 


TCTGGCGCAC 


CCiTGALTGU 


TACAAGGAAA 


CGACCATCTG 


2220 


GCAATGGGTG 


AACTGTTGAC 


ATGAGCTTAC 




AAATGGCAAT 


TCAGCTACAC 


2280 


GAGGATATTT 


CTCTAAGAAA 


CCTTTGACAT 




GTCCAAGGCA 


TATTCGATAA 


2340 


AGGCTGTTTC 


GGTTCGGTCA 


CCAATCAAGT 


TACCTTCCAC 


ATCGATTTTC 


GTATCATTGC 


2 400 


CCAAGACAAC 


TGAACGAACT 


AGTGGCATTT 


CAAGACCTAG 


TTCAATATCA 


TC ACCTGAGT 


2460 


CATGTAGAAC 


CGCATCGTAG 


AAGACTTTTT 


CGACTGTCAT 


CTTGTTCATA 


GTCAGCGTAC 


2520 


CAGTCTTATC 


AGAAGCGATG 


ATTTCAGTTG 


AACCAAGTGT 


TTC AACTGCT 


GGCAACTTAC 


2580 


GAACGATGGA 


ATGTCGTTTG 


GCCAAAACTT 


GAGTACCAAG 


AGAAACAACG 


ATGGTAACGA 


2640 


TAGCACGAAG 


TCCTTCTGGA 


ATGGCTGCAA 


CGGCAAGGGC 


AACAGAAGTC 


AACAACTCAC 


2700 


CAAGTGGATT 


TTTCCCTTGA 


ATGAAGACAC 


CCACTACAAA 


AGTAACAAGG 


CCAATGACCA 


2760 
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AGATAGCATA 


GGTCAAGACC 


TTAGAAAGGT 


TGTTCAAATT 


TTGTTTGAGT 


CGTGTATCAG 


2820 


TCTCATCCGC 


ATCTTCAAGC 


ATACCAGCAA 


TATGACCAAC 


TTCAGTCTAC 


AT AC CTGT AT 


2B80 


TGACAACAAC 


ACCCATCCCA 


CGACCATAGG 


TTACGTTTGA 


GTTTTGGAAG 


GCCATGTTGA 


2940 


CACGGTCACC 


AATACCAGCA 


TCTGTCGCAA 


GCTCGACTGA 


CAAGTCTTTT 


TCGACTCGTA 


3000 


CAGATTCACC 


TGTCAAGGCT 


GCTTCTTCAA 


TTTTAAGAGA 


GTTGGCTTCT 


ATCAAACGTA 


3060 


GGTCCGCTGG 


TACCACGTCA 


CCTGCTTCAA 


GGGCAACGAT 


ATCGCCTGCT 


ACCAATTCTT 


3120 


TAGACTCAAT 


CTCTGCCATG 


TCTCCATCAC 


GAAGAACGCG 


GGCAACTGGA 


CTAGACATGG 


3180 


ATTTGAGGGC 


TTCAATAGCT 


TCTTCACCTT 


TTCCTTCTTG 


GTAAACACCA 


AAGGCAGCGT 


3240 


TGATGATAAC 


CACAGCTAGG 


ATGATAATGG 


CATCTCCGAT 


ATCTTCCCCA 


CCAGAAGTCA 


3300 


CGACTGACAA 


GATTGctGCC 


GCAACTAGGA 


TGATAATCAT 


CAAATCCTTA 


AATTGCTCGA 


3360 


TGAATTTGAC 


CAAGATTGAT 


CCTTTCTCGC 


CTTCTTCGAG 


TTCATTGTGC 


CCAAATTCCG 


3420 


CAAGGCGCTT 


TTCCGCCTCA 


CTTGATGACA 


AACCTTGCTC 


CGTCGCATCC 


ACAGCCTGCA 


3480 


ACACCTCTTC 


AGGGCTCTGA 


GTATAAAACG 


CTTGGCGTTT 


TTGTTCTTTT 


GACATGTCTC 


3S40 


TCCTCCTTGA 


CATTGTCTGC 


AAAACAGACT 


CTCTTTCTCT 


CATACCTTTT 


CACGACAAAC 


3600 


AAAAAGAAAC 


CTGTTAATCA 


TAACAAGTCT 


CGCTGTTTAA 


CATAGGGCCG 


GAAAGCATAC 


3660 


TTTTCAGCAT 


AAAATTCGGA 


ATCACGACAC 


TATCACAGGT 


TTCTGCC AGC 


TACTCCCTTG 


3720 


ACT AGT ACCA 


TTATACCAAA 


TTTTGCGGAC 


TTTTCAAACA 


CTAAAAACTC 


CCTTATTTGA 


3780 


ATTTTTCCTT 


GAAAACCAGT 


ATAATGGTAG 


AATGCTATCT 


GACTAGAAAG 


GAACTTGAAT 


3840 


GAAGCAATCT 


ATCTCAAATC 


TCAAGTTAGC 


TGACCCTGGA 


GCCATTATCA 


GTATTTCGAC 


3900 


CTATTTGATC 


TTGTCTGCAG 


CCAAATTACC 


AGCTGGTCAT 


CTCCtTCATT 


CATCCACTTT 


3960 


GGTGGCCGAT 


CGTTTTAATA 


ACGTATCGGA 


CATCATTGCA 


AATCTGGCCC 


TCTTAATCGG 


4020 


GATTCGGATG 


GCGCCCCACC 


TGCAGACCGT 


GACCACCGTT 


TTGGTCATTG 


GAAGATTGAA 


4080 


GATTTGGCAA 


GCTTGATCAC 


TTCTATCATC 


ATGTTCTATG 


TCGGTTTCGA 


TGTTCTAAGA 


4140 


GATACCATTC 


AAAACATTCT 


CAGTCGGGAA 


GAAACCGTCA 


TTCATCCTCT 


TGGTGCAACT 


4200 


CTAGGAATCA 


TTTCTGCAGC 


GATTATGTTT 


GTCGTCTATC 


TCTACAATAC 


TCGCCTCAGT 


4260 


AAGAAATCCA 


ACTCCAATGC 


GCTGAAGGCA 


GCTGCTAAGG 


ACAATCTTTC 


TGACGCTGTT 


4320 


ACCTCACTTG 


GAACCGCCAT 


TGCCATCCTA 


CCTAGTAGTT 


TCAATTATCC 


GATTGTGGAT 


4380 


AAACTGGTTG 


CTATCATCAT 


CACTTTCTTT 


ATCTTGAAGA 


CTGCCTATGA 


TATCTTCATC 


4440 


GAGTCTTCCT 


TTAGTCTTTC 


AGATGGCTTT 


GACGACCGCC 


TGCTCGAGGA 


CTACCAAAAG 


4S00 


GCTATCATGG 


AAATTCCCAA 


AATCAGCAAG 


GTCAAATCCC 


AAAGAGGTCG 


CACCTACGGT 


4560 
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AGCAACATCT 


ACCTGGATAT 


TACACTAGAG 


ATGAATCCTG 


ACTTGTCTGT 


TTTTGAAAGC 


4620 


CATGAAATCG 


CGGATCAGGT 


CGACTCTATG 


CTGGAGGAGC 


GTTTTGGCGT 


CTTTGATACC 


4680 


GATGTCCATA 


TCGAACCAGC 


ACCTATCCCT 


GACGATGAAA 


TTTTAGACAA 


TGTCTATAAA 


4740 


AAATTGCTTA 


TCCGTGAACA 


ATTGATTGAC 


CAAGGAAACC 


AACTAGAAGA 


ACTCTTGACT 


4800 


\J#^ 4 \JW\ 4 4 4 4 VJ 


TCTATATTCG 


CCAAGATGGA 


GAGCAGATGG 


ATAAAGAGGC 


TTATAAGACC 


4860 


AJUVAAAGAGT 


TAAATTCTGC 


TATCAAGGAC 


ATTCAAATTA 


CTTCCATCAG 


TCAAAAAACC 


4920 


A. AA.PTC ATCT 


GCTATGACTT 


AGATGGTATC 


ATCCATACCA 


GTATCTGGCG 


TCGCCACGAA 


4980 




n 4 i \m k * 4 Wn 


TCAAGAAACC 


AAAAAAGAAT 


AGAGAAATCC 


TTTCATGAGA 


5040 


CGGGATTTTT 


^ i #W iWl 1 A 1 


ATACTCAATA 


AAAATCAAAG 


TGCAAATTAG 


GAAGCCGGTC 


S10O 




(TTf*:A.GTCGG 


CAATGTGAAG 


CCGACATAGT 


TTGCACTTTG 


ATTTTCCAAT 


S160 


AGTCTTAACT 




CTGAGATACT 


CATAGCCTTC 


GTATTTTTCA 


AGGAGTGCTT 


S220 


CATTTTTCTC 




TTTTCfiAGAG 
1 1 1 <! uunvmw 


TAGCCAGCTT 


ACCAAAGTCA 


GAGCCGTTAG 


S280 


CCTGUATTi (. 


LiWi i UAn 1 *\ 


CU-AGCGATAC 


GTTTTTCCAA 


GCTTTCAATA 


TCACCTTCAA 


S340 


i /\\_ 1 IwLL^A 


Li(.Ll V»V_ 111 


TCTTGGTAGG 


TCATGCGTTT 


CTTGTCTT CT 


CGAACCTTGA 


S400 


V_/\V_ 1 1 J 1 IV. 


V. 1 1 1 1WWVJ>_V> 


TTTTGCACTT 


GATTGGCCAT 


ATCTCTTTCA 


AAAGCTTTTT 


5460 






TGACCAAAGA 


AAGGACGAAT 


CTTGCCATCC 


TCAAAAGCGA 


5520 


uA/\ i L 1 1 uVj 4 


4 W 4 A /» 


TCCAAGAAAT 


AGCCGTCGTG 


ACTGACTGTT 


AAAACGGGAC 


5580 




TTGCAAGAAA 


TTCTCTAAGA 


CTGTCAAAGT 


TGCAATATCT 


AGGTCATTCG 


5640 




TAAAACAAGA 


ACATTTCGTT 


TTTCCAAAAG 


CAGTTTGAGC 


AGATAAAGAC 


5700 




ACC CCCTG AC 


AATTTCTCAA 


TCAAACTCCC 


ATGCCTCGAA 


CCTGGGAAGA 


5760 


GGAATTGCTC 


CAGCAACTCA 


GCGATGGAAG 


TCCTAGAACC 


ACCACTGGTC 


TTGACCTCCT 


5820 


CTGCCACTTC 


CTGCAGGTAA 


TTGATCACAC 


GCTTGCTTTC 


ATCCAAACCC 


TCAATTTGTT 


5880 


GAGAGAAATA 


GGCGATGCGA 


ACAGTTTCCC 




1 4 w 1 1 4 


GTCGGCTCAA 


5940 


GACTTCCTGC 


AATCAGGTTA 


AGTAGGGTTG 


ATTTTCCAAC 


ACCATTGTCC 


CCAACAATTC 


6000 


CAATACCGTC 


TTTAGCCTGA 


ACTAAGAGAT 


TAAAATTTTG 


CAAAATGGGC 


TTATTTTCAT 


6060 


AGG CAAAGGA 


AACATCCTGA 


AACTCGATGA 


CTTTCTTCCC 


AATCCGACTG 


GTTTCAAACT 


6120 


TCATAGTCAA 


GTCTGTCTCA 


GCACTACTGC 


CTGAAACTTC 


CTTTTTCAGA 


TCATGGAAAC 


6180 


GATTGATACG 


AGCTTGTTGC 


TTGGTCGCAC 


GCGCCTCCGG 


TTGTCTGCGC 


ATCCAGGCCA 


6240 


ATTCTTGTTT 


GTAGAGTTGT 


TCTTTTTTGT 


GAAGAAGAGC 


CGCGTCCCGC 


TCATCCTCTT 


6300 
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CCGCCTTTAC 


GCGAACATAG 


TCCTGGTAAT 


TTCCCTGGTA 


CTCGGTCAAG 


CCTGCACGAT 


c t £ n 


CCAACTCGAA 


AATCCGTGTT 


GACAAAGCGT 


CTAAGAAATA 


ACGATCGTGA 


GTGATAAAAA 


6420 


CGACGGTCTT 


CTTAGAATTT 


TTCAAAAAGA 


GGGTCAGCCA 


CTCAATAATC 


GCAATATCCA 


64 80 


GATGCTTGGT 


CGGCTCATCC 


AAAAGCAAGA 


GGTCGTGGTT 


GCCAACTAAG 


ACTTGTGCCA 


6540 


ACTGTACCCG 


TCTTCTCAGA 


CCACCTGACA 


ATTCCCCAAC 


AGGAGTAGAT 


AAGTCTTGAA 


6600 


TGCCCAATTT 


GCTAAGAACG 


GTCTTGACCT 


GACTTTCGAT 


TTCCCAAGCT 


TGGAGAGACT 


6660 


CCATCTCTGC 


CATCACACGT 


TCCAAACGCG 


CCTGCTTGTC 


CTCACTATAG 


TCGAGCATAA 


6720 


TCAATTCATA 


CTCACGAATG 


AGCTGGATTT 


CCTTGACTTC 


ACTAGATAGA 


ACCCTATCCA 


6780 


AAACTGTCTT 


TCTATCATCA 


AAATCAGGAT 


CCTGAGTCAA 


GTAACCAATC 


TGGTAATCAT 


6840 


TTTTAGCTGA 


AAAAGGACTG 


ACATCCCCAT 


CAAATCCAGA 


AACACCAGAA 


AGGACGTCCA 


6900 


AAAGGGTGCT 


CTTGCCAGTC 


CCATTCACAC 


CGATTAAACC 


AATTCTGTCT 


AAGTCATGGA 


6960 


TAATAAAGCA 


AATATCCCTA 


AAAACGGTCT 


TGTCACCAAC 


GGATTTACTT 


AGTTTTTCAA 


7020 


CGATAAAATC 


ACTCATTTTT 


TCTCCCTCAG 


GTAAGCATGG 


ATGGCTTCAC 


G ATT ATTCTC 


7080 


CAATTCTCCA 


TCGACAATGG 


CAAACTCAAT 


CTCTCTTAAA 


ATCTCTCCCA 


ACTCTGGGCC 


7140 


TGGCTGATAG 


CCATATTCCT 


TGATCAAAAT 


ACCGCCATTA 


ATCTCAATCT 


CTTTCTTGTC 


7200 


ATGGATAGTC 


AACCTTTCGT 


ATTTTTCTCT 


GATGGCTTGT 


GGGTTGACTT 


CTT TTCCTTC 


7260 


ACCTTGACGA 


AGATTTTCAG 


CCTGTAAAAG 


CAAATCTATG 


TCAAAGCCAT 


AACAATCTCG 


7320 


CTTGCTCAAT 


TCTCCATTTT 


CACGCAGAGC 


CAAAATAATC 


AGCAAATCCT 


CAACTTGCTT 


7380 


GGCAAACTCC 


CCTGAGCTCT 


TCCAAGATTT 


CAAAAATGAC 


TGCCCATTTT 


CAATCTCCAA 


7440 


AGCCCATACT 


AAAGCCGCCC 


AGCCTTGTTC 


ACAGGATTCA 


AAAGTAAAAT 


CACTCTCCAA 


7500 


ATCAAACAGT 


CTGTTGAGCT 


TGTCCTGCCT 


AGATGCCATA 


TCAGCGACAT 


ACTCATAAGC 


7560 


TTGACTCTCA 


ATCATGGAAG 


CCAAGCCCCT 


TCTCCAAAAT 


GGAGCCAGCA 


AGAGTTTATC 


7620 


AAACTCGACG 


AAGGTACGCT 


CTACACAAAT 


TTTCTCCAAA 


AGCGGCGTCA 


AGGTCTTCAT 


7 680 


AGCTTTAAAT 


GTTTCTGGCT 


CAAGTGCAAA 


ACCAAGACTA 


GCCTGAAAAC 


GGAAACCACG 


7740 


CATAATCCGT 


AAAGCATCTT 


CGTTGAAACG 


CTCACTAGCC 


ACTCCAACTG 


CTCGCAAGAC 


7800 


TTGCTTTTCC 


AAATCTTCTA 


AACCATGGAA 


CAAGTCAACG 




TCTCATCCAA 


7860 


GGCAAAGGCG 


TTGACTGTGA 


AATCACGGCG 


TTTGAGGTCT 


TCTTCTAGCG 


ATCGTACAAA 


7920 


GGAAACCGCA 


CTGGGTCTGC 


GATAGTCCAC 


ATAGACATCC 


TCTGTCCGAA 


AGGTTGTTAC 


7980 


CTCATACTCC 


TCATCCCCAT 


CTAAGACCAA 


CACGGTTCCA 


TGCTCGATTC 


CGATATCGGC 


8040 


TGTTCGCGCA 


AAAATCTGCT 


TGGTCTCTTC 


TGGATAAGAA 


GACGTCGCAA 


TATCCACATC 


8100 
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GTGGATAGGG 


CTATGGAGAA 


OUvLa 1 n_ X \~\* 


AACAGAGCCC CCAACAAAAT 


AAGCCTCAAA 


8160 


CCCTGCTTCT 


TTAA'rri'rri 


CTAAi At. 1 v*t» 


TAAAGCCTTC 


TG AAATTC AG 


AAGGCATTTG 


8220 


CGTTAATCTC 


AT AAT AAGTG 




TAGACAAGCT 


CATGACGCTT 


GACAACTTCT 


8280 


TTAATTCCCA 


AATTGACTCC 


TGTCATGAAG 


GAGATGCGAT 


CATAGGAGTC 


ATGACGGAGG 


8340 


GTCAACCCTT 


CTC CCTGATT 


GCCAAAGA j\j 


ACTTCCTGAT 


CACCTACCAA 


CCCTGGCAAA 


8400 


CGAACTGAGT 


GGATGCGCAT 


ACCATCAAAC 


TCAGCACCAC 


GAGCACCAGC 


AATCAGCTCT 


8460 


TCCTCATCTG 


CTGCACCTTG 


CTGAATTGAC 


TCTCGAACCT 


CTGCCATCAA 


CTCAGCTGTT 


8S20 


TTAATCGCTG 


TTCCACTCGG 


AGCATCCTTT 


TTCTTGTC AT 


GATGGAGCTC 


AATAATCTCC 


BS80 


ACATTTGGGA 


AATATTTGGC 


AGCCTGCGTC 


CCAAATTGCA 


TGAGTAAGAC 


AGCACCCAAG 


8640 


GCAAAGTTAG 


GCGCAATCAG 


GCCACCCAAG 


TCTTCCGCAC 


GAGAAAATTC 


TTTTfcCPTCT 


8700 


GCAATTTCTT 


CACTCGTGAA 


ACCAGTCGTT 


CC AACTTACTG 


GAGCAAAGCC 




8760 


GCAAAACGTG 


TATTTTCGTA 


CGCAACAGCT 




AATCTACCCA 




8820 


TCAAAACCAG 


CTAAATCAGC 


CTTATCCTTG 


AAAACAGGAA 


TACCCTGCCA 




8880 


GACTCAAAAG 


GATCCAAAAC 


TGCCACCAAG 


TCCAAGTCTG 


GATCAGTCAA 


TACCATCTGA 


8940 


CAAGCAGCCT 


GGCCCATCTT 


TCCCTTAAAA 


CCGGCAATAA 


TTACTCGAAT 


ACTC AT CTCT 


9C00 


ACTCCTGTCT 


AAGATACAAA 


GTCCCTAAGA 


ACACAAAGTG 


AAAATAGGAA 


TTCCAATCAA 


9060 


GAAGTGTCTA 


CTTCTTGGAA 


GAACTATCTT 


TTTCACACAG 


GGTTCCAGCC 


GTGTTCAATT 


9120 


ATCAAGATAC 


AAAGGACCTT 


AGCTGCCTCT 


GAAAAATAGG 


GAATGGCACT 


GACTTTCCAC 


9180 


GAAAGGCAAG 


ACAGGCATCT 


TTTTTCAAGA 


CGCAGCTACT 


CCCTGTTCAA 


TTTCTAAGAT 


9240 


ACAAGGCATC 


TTAACTAGCC 




AACTAAATCA 


CTGGAATATA 


ACCCAGAGCA 


9300 


ATACTTCCTG 


CTCCTAGGTG 


CClltCAATC 


AC ACT At- LAft 


ATfTTACCAAG 


TCAAACATCC 


9360 


CAACCCAAGC 


CAAAATCAAG 


C AAGTGC TG A 


CGCAATTCTT 


CACCCTTTTC 


AGGACCATTC 




CCATGAATCA 


CAATCACCCG 


GTATTGACCT 


GAAGCCGTTG 


TTTCCTTGAT 


AATTTCAATT 


9480 


AAGCGCTTGG 


TGGCCTTCTT 


TTCAGTACGA 


ACTTTTTCGT 


AAACTTCAAT 


CACACCTTGA 


9540 


TCGTTAAAAT 


AAAGGATTCG 


CTTAATGCTA 


AGCAAATTGC 


CCAAAATGGC 


AGCCCCATTT 


9600 


GAAAGGCGTC 


CACCTTTTAC 


CAAATGATCC 


AAGTCATCTA 


CCATGATAAA 


GGCTGACGTA 


9660 


CGGCTGATTT 


GAATGGCTAG 


CTTATCCTGA 


ATGCTGCCAA 


AATC ATCGCC 


CTGATCACGC 


9720 


CAATTAAACA 


CGCTTTCAAC 


CATGATGCCT 


AGGGGAGCAC 


TTGTAATCAA 


AGTGTCTGGG 


9780 


AAAGCAATGG 


TTAAGCCCTC 


ATACTCATCG 


ACCATATACT 


GGATATTTTG 


GTAAAAACCT 


9840 



WO 98/18931 



PCT/US97/19588 



496 

CAAATTCCAG AAGATAGCAA AAGCCCCAAC CCATGTGTAT AGCCTTGTTC TTTGAGCGAA 9900 

GTTAAGATCT CATCTAACTT GGCAATACTT GGTTGACTGC TCTTAGGCAA TTCAGAACCC 9960 

TGAGCCATTT TTTGGTAAAA TTCCTCAGCA GACAGATTGA TGCCTTCGAC AT AT TCCTC A 10020 

CCATCAATAT TGACAGGAAT ATCCAAGACA AACAAGTCTT CTCTTTGCAA G AT CTCTGC A 10080 

CTGAGATAAG CAGAGGAATC TGTGAAAACA GCTAATTTCA TATTAGAACT CCAAATTAAT 10140 

TCCTGGTAAG TCTAATCCAA TTTCAGTCAC TTCGTAAGTC AAACGATTGA GCATGTTCAA 10200 

ACATCGACGA GCCAAGCTTT CCACCTCTTC TTCGTTCAAT TCACTTGGTT CATTGACAAT 10260 

ACGGCCATCG ATATGGTTTA CTTCTGACAT TGTTCCACTA ATGACAAACT TATCAAATAC 10320 

AATCATAAAG CTCAAGATGA CAATCAAGGA AGTCACTTGA TTTTCTTGGT CATGTTCGAG 10380 

CAATTGGAAA TTCACATCCA CCTTGCTTTC AGGAGCTCCA TTTTCATTTT CCCATTCAAA 10440 

ATTACGCCCA TCAAAATGAT ACTGACTAAC AAATTCTTCT TCACGTTTAA GATTCATCTC 10500 

TTTCTCCATC GGCTACAATA TTATAAGCTA TTGTACCATA ATTTTTTATT TTCATCTAGT 10560 



TTTCTAGGAT 


1 TTAGTCAATC 


CCAATTTCAC 


CACGAACTAC 


ATCTGTGATG 


GTATCAACAT 


10620 


AGTAGTTTAC TTCTTCTGTT 


GTAGCCGCTT 


CTGCCATAAC 


ACGCAAGAGG 


GGTTCTGTTC 


10680 


CACTTGGACG 


AACAAGGATA 


CGGCCGTTCC 


CCGCCATTTC 


TTCTTCCATC 


TTCTCGATGA 


10740 


TGGCCTTCAT 


AGCTGGCACT 


TCCATGGCCT 


TTTCCTTCAT 


GACCTTTTCC 


ACTCGGATAT 


10800 


TAACTAATTT 


TTGTGGATAA 


ATCGTTACTT 


CTGCCGCCAA 


CTCTGATAAG 


CTCTTACCAC 


10860 


TTTCCTTCAT 


GATTTTAGTC 


AATTCAACTG 


CTCATAATTG 


ACCATCACCT 


GTGGTATTGT 


10920 


AATCCATCAA 


GATAACGTCA 


CCAGACTGTT 


CACCACCAAG 


CTTGTAGCCT 


GATTTTCTCA 


10980 


TTTCTTCAAC 


AACGTACCGG 


TCACCAACTG 


CAGTAACTGC 


CTTCTTAATA 


CCTTCGCCAT 


11040 


TCAAGCCCTT 


GTCGAAACCA 


AGGTTAGACA 


TAACAGTTGT 


CACAATTCTA 


TTTTGAGCCA 


11100 


ATTGTCCTTT 


TTCAGAAAGG 


TATTTTCCGA 


TGATGTACAT 


AATCTTGTCA 


CCATCAACGA 


11160 


TGTCACCATT 


CTCATCAACA 


GCAATCAAGC 


GGTCACTGTC 


TCCATCAAAG 


GCCAAACCAA 


11220 


TAGCTGACCC 


ACTTTCTTTG 


ACCACTTCTT 


GAAGGGCTTC 


TGGATGTGTT 


GAACCAACAT 


11280 


TAACGTTGAT 


GTTAAGACCG 


TCTGGTGTTT 


CCCCCATAAC 


CGTCAATTGG 


GCACCAAGGT 


11340 


CTGCAAAGAT 


TTGACGCGCA 


CTGGTAGAAG 


CTGCTCCATT 


AGCTGTATCC 


AAGGCAACCT 


11400 


TCATTCCATC 


AAGAGGACTT 


CCAGTTGAAA 


CAAGGTATCC 


TTCATACTTA 


CGCArGCtTC 


11460 


TCGATAATCT 


ACCAAAATTC 


CTAAGCCTTC 


TGCACTTGGA 


CGAGGAAGAG 


TGTCTTCCTC 


11520 


AGCATCTAGC 


AAGGCTTCAA 


TTTCTGCTTC 


TTTTTCATCA 


TCTAGTTTGA 


AGCCATCACC 


11580 


CCCAAAGAAC 


TTGATTCCGT 


TATCAAGGGC 


TGGGTTGTGG 


CTAGCAGAAA 


TCATCACACC 


11640 



WO 98/18931 



PCT/US97/19588 



497 



rr*?* ft /- r*T 


P*P*^P*T*P* A^^^P^PT 
Ww i 1 LHvJ lit 


f*AAf"f*AAGTA 


agctactgct 


CCTGTTCCAA 


GGACACCAAG 


11700 


TTTU1 ATALo 


Tr.AATrrrTA 




ACCTGCCACC 


AAGGCCGATT 


CCAACATTTC 


11760 


CCCTGAAATA 


* \# 111 1 i— 




* * * 


GCtTCCGtTT 


CATGTTGACT 


11820 


AAG AA CAT Afc» 


1 wLAAAAL 


GTC CT ACTTT 


AAAGGCTAAT 


xct ggtg tt a 


CTTCTAGGTT 


11880 


AGCTTCTCCA 


LOtjACTtLA I 




/\ ft n ft i ft/%www 


attgttataa 


AATCCTTTTC 


11940 


TATTTTTAAT 


TCGTTTTTGA 


ACTAwnwT 


1 1 4Va/\W.w 


nf\\Jr% ft v 4 W 4 W 


CGATGAACTG 


12000 


CTTGTACTTG 


AATTTGATGT 


GCTTGAACTT 


rVITfifTAPTG 
UU 1 UW 1 />V» A U 


p*I**I*T TftTART 

VJ 4 4 ft 4 w 4 r%\9 4 


CACCTTCATT 


12060 


ATTGTATCAA 


ACGGAGTG AT 


AACTGCCGuT 


ft ftPAfAAPAP* 

AAun I— /w\ w /\w 


4 4 vViUV • w 


GATTGCCTGC 


12120 


AAACGTACTG 


AACCACTGTA 


ATTACCTGTT 






AACAGCGATA 


12180 


ATCTTATCAA 


TTCTATCCAA 


TGTCTCTTGG 


TUAL 1 1 AA 


1 A\inL Aw 1 IV. 




12240 


ACCATGACAT 


TTTCAATTTG 


TACCCGACTA 


*i»r* ft & n*nTr & P* 
TCAAl I l GAL 




PTCTGCTACA 

V. ft W 4 w\J ft 


12300 


ATCTTTACCT 


TATCCTTCTG 


AGCCTTCTTA 


LLAA1\. 1 IbA 


CTGTAATTTT 




12360 


GCCACAGCGG 


TCAGCCCATT 


gggtaaatct 


TCAATCCTCA 


ft A^T" ft APTTP 


AATFCTTCCA 


12420 


ACACCGGCAT 


CTGTT AGGTC 


nL»L AU i AAH- 


TTP" ft iTT^aP 


U i U 1 rtl* 4 1 4 V_ 


TTCCATTTCA 


12480 


CTAGCTAGCG 


ATAGCCGATT 


TGCACCAGTQ. 


AAunLLnl. 1 Vi 


M 1 xVV— 4 1 \— 1 \ln 


AGCAAAACCC 


12540 


CTAATAAAAT 


ACTTATCACT 


ATT AT AGCGT 


AT i* I vAA I t\\j 






12600 


GTATAGGTTT 


CCGTTTTTAC 


LiCkWTAuLA 


* ^* ^ ^™ / ft /*^'P^T 
w 1 Vjv 1 /Vw 1 vi 1 


TTTCAAAATT 


CGTCGCCGTA 


12660 


GCATAGACAA 


ATAAGACACA 


ftPP*ft ft & ft ft ft/"! 
AoLAAAAAAW 




TP, AT AT AT AA 


actatttttt 


12720 


TTCATGTTTC 


f ft Tf*f*T r*r*T* ft 

CATCCTCCTA 


ULAn I wu 1 1L 


TTTAAAACTA 


AGACCCACTT 


CCTCTTTTGG 


12760 


AAGT AAG ATT 


TCACuTAATT 


W lu ill L/v\A 


TTCATCAAGT 

1 A W% 4 W#V%»*J * 


GTTAGGTTGT 


GCTTAAACCT 


12840 


K 1—1* ft »^ ftp 

TCCATTATAG 


U 1 1 A 1 vuAnn 


4 1 \- 1 V> Www 4 


TTCCTCTGAT 

4 4 W W 4 W 4 VJ*» 4 


ACGACAAAAG 


TCAAGGCATC 


12900 


TGAGACTTCT 


CATAAACCGA 


TAGCCGCCCG 


CTGTCTGGTC 


CCAAATTCCT 


TGGAAATCCC 


12960 




CTCAAGGGCA 


CATAGGCAGA 


CGTCACAGCG 


ATACCiTlX-TI' 


CTTTGATAAT 


13020 


CACCGCACCA 


TCATGTAGGG 


GAGTGTTGGG 


AATAAAAATG 


TTAATGAGAA 


CTTCTGCAGA 


13080 


AATCTtAGCA 


TCCAAGGGAA 


TTCCTGTCGA 


AATATACTCC 


TGCAAGGTAC 


GTACACGCTC 


13140 


AATAGCAACC 


AAGGCCCCGA 


TTTTACGAGG 


ACTCATCTAT 


TCAACAGACT 


TAACAAAGGC 


13200 


ACGAATCATC 


TGTTCCTCAG 


CACTAATAGG 


GGCATTGGAA 


AAGAAATCTG 


TCGCTCTTCC 


13260 


CAAACGTTCC 


AAACCAGTCC 


GAATCTCTGG 


AGAGAAGATA 


ACAACCGCCG 


CAATAACCCC 


13320 


ATAAGTAATA 


ATTTGATTGA 


TTAACCAAGA 


AATCCTAGTC 


AAACCAATCA 


tatttgcaag 


13380 
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GATTTGAGCT 


AAAATAAACA 


CCAAAACTCC 


ACGTACCAAA 


ATCATAATCT 


TGGTTCCTGC 


13440 


AATAGCTTTT 


CTAAAATGCT 


ATAAAATATA 


AGCAACAATC 


AAAATATCAA 


TCAGATTGAT 


IJjUU 


AGCTATCGTC 


CATGGACTTG 


CAAACAAACT 


GGTCCAATAT 


TGCAGATTGG 


ATAATTGTTG 




AAAATTCATC 


CCTGATATCC 


TCCCTATCAA 


AACACTTTCC 


TCCTATTATA 


CCATTTTCTG 




GCATTTTTTT 


CCCTATCCTA 


GTCCATTTTA 


CATTGAACAA 


AAftTATGATA 


AAATAAACTG 


1 i (re n 


ACTAAAAAAA 


ACAAAGGAGA 


AACTATGTCT 


CAACTCTATG 


ATATTACCAT 


TGTGGGTGGT 


i j in y 


GGTCCTGTCG 


GGCTTTTTGC 


AGCCTTTTAT 


GCCCACCTAC 


GCCAAGCCAA 


GGTTCAAATC 


13800 


ATCGACTCTC 


TTCCCCAGCT 


AGGTGGACAA 


CCTGCTATTC 


TCTACCCTGA 


AAAGGAAATC 




CTAGACGTAC 


CAGGCTTCCC 


AAACCTGACT 


GGAGAACAGT 


TGACTAACCC 


CTTGATTGAA 


13920 


CAGCTAAATG 


GATTTGATAC 


CCCTATTCAT 


CTCAATGAAA 


CGGTTCTTGA 


GATTGACAAA 


13980 


CAAGAAGAAT 


TTGCCATCAC 


AACTTCTAAA 


GGAAGTCACC 


TGACTAAAAC 


AGTTATCATC 


14040 


GCTATGGGTG 


GCGGTGCCTT 


CAAACCACGT 


CCGCTCGAAC 


TTGAAGGGGT 


TGAGGGCTAT 


14100 


GAAAATATCC 


ACTACCACGT 


TTCTAACATT 


CAGCAATACG 


CTGGTAAGAA 


AGTCACGATT 


14160 


CTTCGTGGGG 


GAGACTCGGC 


TCTGGATTGG 


GCTTTCGCTT 


TTGAAAAAAT 


CGCACCAACT 


14220 


ACCCTTGTTC 


ACCGCAGAGA 


TAATTTCCGT 


GCCTTGCAAC 


AC AGTCTT C A 


AGCCTTGCAA 


14280 


GAATCATCTG 


TAACCATCAA 


CACACCATTC 


GCCCCTAGCC 


AACTCCTTGG 


AAATGGAAAA 


14340 


ACACTTGATA 


AACTTGAAAT 


CACAAAAGTC 


AAATCTGATG 


AAACTGAAAC 


CATTCACCTA 


14400 


GACCACCTCT 


TTGTCAACTA 


TGGTTTCAAA 


TCTTCTGTCG 


GTAACCTTAA 


AAACTGGGGG 


14460 


CTCGACCTCA 


ACCCTCACAA 


GATTATCGTC 


AACAGCAAAC 


AGGAATCCAC 


CCAAGCAGCT 


14520 


ATCTATGCTA 


TCGGTGACTG 


CTGCTACTAT 


GACGGAAAAA 


TTCATCTGAT 


TGCGACAGGC 


14S80 


CTCGGAGAAG 


CTCCAACTCC 


TCTCAACAAC 


GCTATCAACT 


ACATTGACCC 


TGAACAAAAA 


14640 


GTACAACCAA 


AACACTCTAC 


TAGTTTATAA 


AAAAGAACCA 


CGACTCACAT 


AGGATTCGTG 


14700 


GTTTT AT AAT 


TCATCCGCTA 


TCTTATTGAT 


TTTTCTGAGT 


CTGTGATTGA 


CACCACTTTT 


t A "1 £.f\ 

X4/DU 


CCTCAGAGGG 


GTGCTG AG AC 


TATCTGCTAA 


CTGCTGGATA 


GAGTAGTCTC 


GGTGCTGAAT 


14820 


CCTCAGTTGC 


GCCACTTCCT 


GCAAATCTAC 


TGGCAAATTT 


TCT AAGCCC A 


TGATATCTTT 


14880 


GATTTTACTG 


ATATTC TT AA 


TGGTCTTCAT 


GCTGGCAGAA 


ACTGTCCCAG 


CGATATTAGC 


14940 


TGTCTCGGCA 


TTATTAGCCC 


GATTGAGGTC 


GTTACGGGTT 


TCTCGCAAAA 


TCTTAACCCG 


15000 


CTCAAAATCA 


TCACCTGCCT 


GCATGGCTCC 


TATTACTATC 


AAGAAGTCCA 


TAATGTCTTC 


15060 


TGCTCGCTGG 


AGATACGTCA 


CAGCCCCCTT 


CTTGCGCTCA 


AGCACCTTGG 


CATCCAGTAA 


15120 


AAACTGTTGG 


AGAAGGGAGG 


CAATTCCTTG 


CGCGTGGTCC 


AGATAAACAC 


AACTGATTTC 


15180 
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CAACTCGTAC TTCCCTGACT CAGGCTCACG AATGCTCCCA TTTGCCAAGA AAGCGCCACA 
GAGATAGGCA CGACCTGCTT CCTCATCCGA TAAAATCGCC TCATCAATAC CTGTTTCCAG 
GCCAAACAAA GACTCTGCCA AGTGCAAATC ACTTAACAAA TCCTCCACCT TTTCATCTGT 
AAAAACGGTA TAGACGCGAT TCTTGCGAAG ATTGCTCCGT TGGTGGTGAC GAATTTCAGA 
TTTGATTTCA TAGAGATGGA GAAAGGACTC ATAGAGGTGA CGGGCCAGTT TGGCATTTTC 
TGTCACAACT CACAAACTCA AGCCCGAAGT CGAGAGACCG ATGCTACCAG ACATTTTGAT 
AATGGCAGAT AATTCATGCC AGCTCAGATG GTGTTGGCCC AGGATTTCTT CTTTTACTGC 
TACTGTGAAA CTCATTTTTT CACCTGTATA ATGCGCATCA ACTCCTCCAC AATCAAATCT 
CCATCGTGGA AGGCACCGCC ATTTTCCAGA CGAAGGAAGT TAGATGAAAT CACGCGCGAA 
ACTTGCTTAC AAAGACCTAC AAAATCGTGT TCCACTTCCA CTAAGTATTC ATCAAAACGG 
TTGGAATTCA TGTATTCCTG AGGCACTTTT TCAATATTCA CCAAGACACT GTCGATAAAA 
CGGCCACCAA CGTGACGATG CAAGACTTCC ACGTGGTCGC TATCTGTAAA GTGTTCCGTC 
TCCCCACGTT GGGTCATGAT ATTGCAGACA TACGCAATTT CTGCCTTGGT TTCCAAAAGA 
GCCCGCCCAA TTTCCTTAAT CACCATATTG GGCAAAATAG AGGTAAAGAG GGAACCTGGC 
CCTAGGACAA TC AT CTC ACT TTCAAGGATG GTCTGCACTA CTCCACGGCT GGCCAGAGGC 
GTATCATCGT TTAGGGCATT GGTCACATAG ACATTGTCAA TTATCCCTCG ATGGTCTACA 
ATATGACTCT CTCCACCCAC TTCTGTCCCA TCCTGAAAGA CTGCATGAAG GGTCAAAGGA 
TGCTCACTGG AAGGATAAAT TTTCCCTGTT GTATGGAAAA ATTTGCTCAA TAACTGCATG 
GCATTATAGG TTGAACCCTG CATTTCTGAC AAGCCAGCAA TGATGAGATT TCCCAATGGA 
TGGCCAGCAA AGGCTCCGGC ATCCTCAGAG AACCGATACT GAAAGACCTT CTCATAAAAC 
TTAGGCATAT CCGACATGGC CACAAGGACA TTACCAAGAT CACCTGGCGG TCTCAACTCT 
TGCATATTTT TTCGGAGTTC ACCTGAAGAA CCACCATCAT CTGCCACCGT CACGATAGCT 
CCGATTTCCA CATCTTTTTC CCCCAGACTT TTTAGAATGA CGGGACTTCC AGTCCCTCCA 
CCAATCACCG TTATCTTTGG TTTTCTCATG AACGGTTTAC CGTTTCCTTT CTGCGGTCTT 
TGTCGCGATG CCCTTCATTA ACAGACCAAT TCTTCGATAA^ ^TF^^^F? />J ^ CCTT J^ C . 
CAAATGCCAC ACTACGGTGT TGTCCACCCG TACATCCCAT GGCAATGGTC AAAACGGACT 
TACCTTCCTT TTGGTAACTT GGCAGAATCG GCTCAATCAA GGCCAATAAA TGTTGATAAA 
AGTCTTCTGA CTCAGGATGG TTCATGACAT AATCATAAAC AGGTTCATCC ACACCCGTTT 
GGTTTCTCAC TTCTGGTAAA TAATAGGGAT TTGGCAAGAA ACCGACATCA AAGACCAAGT 



15240 

15300 

1S360 

IS420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 
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CCGCATCAAT 


CGGGATTCCA 


TACTTAAATC 


CGAAAGACAT 


CACTTCGATA 


CGGAAAGACT 


16980 


GGGCTTGTTC 


TTGGTCTGAA 


AACTGCTCTG CAAGGGTTTT GCGCAgCTCA 


CGTGGAGTCA 


17040 


GTTCAGTCGT 


ATCCACCACA 


TTTTGGCTCA 


TATTTTTCAA 


ACGTGCCAAG 


AGTTCACGTT 


17100 


CCAACTTGAT 


TCCATCTAAA 


ATACGACCGT 


CTGCTGCTAG 


TGGGTGACTC 


CGTCTGGTTT 


17160 


CCTTGTAACC 


AGCGACCAAT 


TCCTTATCAG 


CCGCATCCAA 


AAAGAGGATT 


TTGAAATCCA 


17220 


AACCATCTTG 


ATTTTCCAAC 


TCATCCAAAA 


CACCTTGAAT 


CTCTGAAAAG 


AAAGAACGGC 


17280 


TACGCATATC 


CACTACCAAG 


GCCAACTTAG 


GATTGTCTTC 


CTTAATTTCA 


ACCAGCTGCA 


17340 


AAAACTTAGG 


CAAGAGAGCT 


GGCGGCATAT 


TATCAATGCT 


GAAATAACCT 


AGATCCTCGA 


17400 


AGGACTGAAT 


GCCTACAGTT 


TTCCCTCCGC 


CACTCATCCC 


TGTCACAATC 


ACCAAGTGAA 


17460 


GTTGTTTCTT 


TGTCATCTTT 


TTCTCCTTAT 


ATCAAAAGAA 


GTTTGGCAAC 


ACCAAACTTC 


17520 


AACTACCTTA 


TCCAATCTCT 


GCGATGACTT 


CAATTTCGAC 


TTTTACATCA 


CGAGGAAGAC 


17S80 


GAGCTACCTC 


CACAGCTGAA 


CGAGCTGGGA 


ATTCCTCTTT 


GAAGGCCGTT 


TGGTAAACCT 


17640 


CATTAAAAGG 


AACAAAGTCG 


TTCATATCGC 


TCAAGAAGCA 


ACTTGTTTTG 


ACAACATCGT 


17700 


CAAAGTCTGT 


TCCTGCTT CT 


GCCAAAATAG 


CACCGATGTT 


TTTCAACACT 


TGCTCTGTCT 


17760 


GTTCTTGGAT 


ATTCTCTCCT 


ACAATTTCCC 


CACTTTCACG 


GGATAGGGGA 


ACTTGACCGC 


17820 


TAGCAAACAA 


AAGGTTGC C A 


ACGATTTTTC 


CTTGAACATA 


GGGTCCGATA 


CCCTTTGGGG 


17880 


CCTTATCTGT 


ATCAATTCTT 


TTTGCCATTT 


TCTTTTCCTC 


ACAATTTTTC 


TAAGATTGCA 


17940 


TCCCAAGCCT 


CATCCATCCC 


TGCCTTACTG 


ACAGATGAAA 


AGAGGATGAA 


ATCGTCACTC 


18000 


GGCTCAAAGT 


TTAATTTCTT 


TTTGATTGCT 


GATTCATCCT 


TGTTCCATTT 


ACCACGACGA 


18060 


ATCTTGTCCG 


CCTTGGTCGC 


CACAATGATG 


ACTGGAATCT 


CATAATACTT 


GAGAAATTCG 


18120 


TACATCTGCA 


CATCATCTGC 


TGACGGGTCA 


TGACGAAGGT 


C AA CT AG A CT 


GACAACCGCA 


18180 


CGGAGATTTT 


CCCGAGTCGT 


TAAGTACTCC 


TCAATC ATGC 


ACCCCCACTT 


TTCACGTTCC 


18240 


TTTTTAGAAA 


CACGACCATA 


GCCATAACCA 


GGCACATCCA 


CAAAGCGCAT 


CTTCTCATCA 


18300 


ATGTTAAAAA 


AGTTCAGGAG 


CTGGGTTTTA 


CCAGGTTTTC 


CTCATGTACC 


GGCGAGATTC 


18360 


TTACCCTTCA 


ACATAGTGTT 


GATAAAGCTG 


GATTTACCAA 


CATTTGAACG 


CCCTCCTAGG 


18420 


GCAATCTCTG 


GCAGTTCATC 


CTCCGGATAG 


TGGGACTTAT 


TAGCTGCACT 


GAGCAAGATT 


18480 


TCAGCATTGT 


GTGTATTAAG 


TTCCATAGTC 


ACCTCTAGGC 


TGTTTCTAGG 


ATCGGTTTAT 


18540 


CCGTTCCATC 


TACAGTTTCT 


TTAGTGATGC 


GAACCAATTT 


CACATTTTCC 


TGACTCGGCA 


18600 


CCTCAAACAT 


GACATCTAGC 


ATGGTTTCTT 


CG ATGATGG A 


GCGAAGTCCA 


CGCGCCCCTG 


18660 


TCTTCCGTTC 


GATTGCTTTA 


TTAGCAATCT 


CTTGAAGGGC 


TTCGTCGTCA 


AATTCCAACT 


18720 
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ATAAGAAAGC 


AAGGTTTGGT 


ATTGTTTCAC 


CAAGGCATTT 


CTTGGCTCTT 


18780 


T PAACATG CG 


AACCAAGTCA 


TCAACGGTCA 


ATTGCTCAAG 


AGCCGCAAAA 


ACAGGCAAGC 


18840 


("TTrAATCAA 


CTCAGGGATA 


ATACCAAATT 


TTTGAATCTC 


TTCAGCGATG 


ATTTCTTGCA 


18900 




RTTTTfCTCA 

1st 1 X 1 » w\J » V«n 


ATCGCCTTAT 


TATTTTGACC 


AAATCCGATG 


ACTTTTTCAC 


18960 




*r*TTGACAATT 


TCTTCAATAC 


CATCAAAAGC 


ACCACCCACC 


ATGAAGAGGA 


19020 


TATTTTTTGT 




ATCATCTCTT 


GTTGTGGATG 


TTTGCGTCCA 


CCTTGAGCCG 


19080 




ft ft f ArVTTfCf* 


TCAATAATCT 


TGAGAAGGCC 


TTGTTGCACC 


CCTTCACCAG 


19140 


AAAV.A I LnLu 


♦PfiTT ATACAC 


ACATTCTCAC 


TCTTCTTGGC 


AATCTTGTCA 


ATTTCATCCA 


19200 


CATAGATAAT 


f rfArnrTCT 


GCACCTTCGA 


TGTTAAAGTC 


AGCAACCTGC 


AAGAGTTTCA 


19260 


GGAGGATATT 


qMpp » ft 

1 IULALA1LL 


TTArCCACAT 


AACCAGCCTC 


CGTCAGAGCT 


CTCGCATCCG 


19320 


*™ ft IV *P A ff* ft. ft. ft. 

CAATAGLAAA 


/\V* w 1 fti 1V» 


AAGCTCTTAG 


CCAAGCTCTG 


GGCAAGGAAA 


GTTTTCCCTG 


19380 


ft ft ft / '^Mf^^ 

AACCAt* i luG 


OLLAnl LnH. 


AAA ATCTTTG 


ACTTCTGCAA 


ATCCACATCT 


TCTCACTCTT 


19440 


CGCCTGTATC 


/*HV*r & & & *T**TYT 


ATCrGTTTCT 


AGTGGTTATA 


AACCGCCACT 


GCCAAGGCAC 


19SO0 






ATTACATAGT 


GGTTCAAGAT 


ATGGAGGAGT 


TCAATTGGTT 


19560 






GCCAAGACTT 


CCTCAACCAA 


TTCTTCTCGA 


ATGATTTCCT 


19620 


CiAULi AAL I t. 


rirrraTtrA 


TTACAAATAA 


AAGCATTGTT 


GCCAGCAATT 


ATTTTTTGTA 


19680 


CTTC*TTCTTG 


U 1 1 1 1 tU^dA 


CAAAATGAGC 

^ #%^%#W^ » WJ#"»hW^* 


AATAAACCAT 


CATATCATTT 


TTTCTATTTG 


19740 


TACAwATuAT 


i 1 \m\m 1 A V»^\ • 


TCTATACTGT 


CAT TCTATCT 


AAAATAAGGT 


CATGTAAAAA 


19800 




TATTCACCAG 


ATTGGTAAAG 


GCATTTAACC 


AAAGGAGGAT 


AGAAAGCCCG 


19860 




TACCAAAAGC 


TTGTGCTCCT 


GCCAGAAAGC 


AGATCAAACA 


CAGAAAAGCC 


19920 


fTTfIAATAf!Ar 


CAAATAAACT 

W*W^#» 4 rWi\i * 


CCGTTCCATT 


AGACTTCCTT 


TCTCTTGCCG 


TATTGGATGG 


19980 


TAAAATCATA 


AGGATTCTTC 


TCATCTTTGG 


iv ft ft ft 'T^P^ 1 


f^f-TTflAAACT 


GTCTCAAAAA 


20040 


GAGACAAGTC 


AAGTTCTTCA 


GGGAAATACG 


TATCTCCTTC 


CACCCGAGCA 


TGAATGTGAG 


20100 


TGACAATCAC 


TTCATCAAGG 


TAAGGTTCAA 


AAGCCTGAAA 


AATTTGCTTC 


CCACCGATAA 


20160 


TGTAGAGATT 




GCCTGATACC 


ACTCAAGAAC 


AGACTGGACG 


TCCTGAAAAG 


20220 


TAGCAACCCC 


ATCTATCTTT 


TCTTCCGGAT 


TACGCGTCAA 


AATCAAGGTT 


TCCCGTTTTG 


20280 


GAAGCAAGCG 


ACGCCCCATC 


CCATCAAAGG 


TCACACGCCC 


CATCAAGATA 


GCATGATTCA 


20340 


GAGTTGTTTC 


TTTAAACTGC 


TGCAATTCTG 


CTGGCAAATG 


CCAAGGCAGA 


CGATTTTCCT 


20400 


TACCAATCAC 


ACCCTCTTCA 


TCCTGGGCCC 


AAATAGCTAC 


GATTTTCTTA 


GTCATGCTTC 


20460 
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WO 98/18931 



PCT/US97/19588 



502 

CATCCTTTTC ACTGATAGTA CTATTTTATC AAAAAACTCA AAAAAACACT GGTTTGGAAT 20S20 

AGCTTACAAA ATACAAAAAA TCTGTAAGAA ATTTCCTACA GATTTATCTA TCTTTCCTTA 20580 

TTTCTTACAA ACCAGGTGCT TGTCCAAGTT CGGCTGCAAG CATCCAAATT GTTTTATCTG 20640 

TTTCAGTTTT AGCGCCTGCA AAGATACCGT TTGTCACATC GTCACCTTCT TCATCAGTGA 20700 

CATCCAAACC TTTTTGGAAA AGTTCTGACA ACTAACGCTA GATAACAAGA ACACGTTCCA 20760 

AGCTTTCTTC AACATTACGG TATTCACCAG CTTCTTCTTC GATTTCACTA TTTTGAAGGA 20820 

ACTCTGTCAA TGTAGAGAAT GGCCTTCCAC CGAGTGTAAT CAAGCGTTCA CTGATTTCAT 20880 

CCAATTGACC GTCAAGAGCT TC CATGT ACT CATCCATTTT TGGATGCCAT ACAAGGAAAC 20940 

CACGACCATG CATATACCAC TGCACTTGGT GCAAACCAAC GTGACCTACA TACAAATCAG 21000 

CAACAGCTTG GTTCAAGACT TCCTTT G TTT TTGCCAATGC 21040 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2387 base pairs 
(8) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



ATTCTTAATA 


CGATTAAAAG 


CCTTATTACT 


AAAAGAAAAT 


TTCAGTTACA 


TGAACTAAAC 


60 


TTGCTCGTCA 


AATCCCGATT 


TAACGAGATG 


TTTGGGGAAA 


ATAAAATATT 


TGAAAGCATT 


120 


GATAACTTAT 


TTGATATTAT 


AGATGGTGAT 


AGGGCCAAAA 


ATTATCCTAA 


ATCAGATGAG 


180 


TTGTTTAGTG 


AGGAGTACTG 


TTTATTTTTA 


AATACAAAGA 


ATGTTACTAA 


AAACGGATTT 


240 


TCATTCGATA 


CAAAGCAATT 


TATCACTAAA 


ACAAAGGATA 


AATTACTTCG 


AAAAGGCAAA 


300 


CTTGAGCGTT 


ATGATATAGT 


CTTGACAACA 


AGAGGTACTG 


TTGGAAATGT 


AGCGTACTAC 


360 


GATGAATTAA 


TAAAATATAA 


ACATTTACCT 


ATAAATTCAG 


GTATGGTAAT 


ATTACGTCCC 


420 


AAGACACCAA 


ATCTAAATCA 


GAAATTTATT 


ATCCATGTTT 


TAAGGAATAA 


TAATTATAGT 


480 


CGAGTGATAT 


CAGGAAGTGC 


TCAGCCTCAG 


TTACCAATTA 


CAAAATTAAA 


AAAAATACTT 


540 


CTCCCCCTCC 


CCCCACTAGC 


CCTCCAAAAT 


GAGTTCGCAG 


ACTTTGTAGT 


CCAGGTCGAC 


600 


AAATCACAAT 


TGGCAATCCA 


AAAATCTCTG 


GAAGAACTTG 


AAACTTTGAA 


GAAATCTCTG 


660 


ATGCAGGAGT 


ATTTTGCCTC 


ATATTCTGCC 


ATTGTAATTA 


CGGTAATGAT 


TTGTTATAAT 


720 


ACTTCAAAGG 


AGG AAATC AG 


ATGGTAGTAA 


AAACAAGAAA 


ACAAGGAAAT 


TCAATCACCA 


780 


TTACGATTCC 


AAGTGAATTT 


AATATTCCAA 


GTCGTGTTAA 


ATACGAAGCG 


AAATTGTTAC 


840 
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C AAGTGGTG A 


GATTATCTTT 


ACTCCTGAAG 


AATTGGGGCA 


GCAGGTTTCT TATGTATCTG 


900 


ATGATGCCTT 


TGACTTAAAT 


TTAGATAAAA 


TATTTGACGA 


ATACGACGAT 


GTTTTCAAAG 


9 60 


W 1 A 1 XJVJ A 


AAAATCACAA TCTATTTGAC AGAAAAGCAA 


ATTGAAAAAA 


TAAATGCTTT 


1020 


AGCLAATTCAA 


CGGTATTCTC 


CAAATGAGAA 


AATTCAAACA 


GTTAGTCCTT 


CTGCCTTAAA 


1080 


TATC&TTGTG 


AACTTACCAG 


AACAATTTGT 


CTTTGGGAAG 


CCTCTTTATC 


CAACAATTTT 


1140 




ACGATACTAT 


TTGTCCAATT GATAAAGAAG 


CATGTTTTTG 


CTAATGCTAA 


1200 




g ctttctt cg 


TTTTGGTCAA 


ATTTTTACAA 


TTAAACGGCT 


ATCGTTITTC 


1260 




GAAGAAGCAG 


TAAAAATGTG 


TGTAACCATC 


GCAGTAGAAG 


CTTTAACTGA 


1320 


TV™ & & & A & A f 
TunAMVAnlu 


ACAAGCTACT 


CCAAATGGAT 


TTCTGAACAT 


TCTGTTAGAG 


AAAAGGTCAA 


1380 


AAAGTAACCT 


AGTATGCTGG 


ATTTGAATGA 


GCACAAGAAA 


ATAAATGAAC 


AGACAATATT 


1440 


AGAATTCTviT 


AATGCAGAAA 


CTGATATTGT 


CTCTTTTTAT 


TGATGAATAA CAAAGTGAGA 


1500 


AATTATGGAA 


TCAAAAGTTA 


CAATTATCAT 


GCAAGAAATG 


TTACCTCTTT TAAATAATGA 


1560 


ACAATTACTA 


GCGTTGAGAG 


AGAGTTTAGA 


ACATCATCTA 


GTAGACGGAA 


AAAAGCAGCA 


1620 




AATAATAACC 


TGTTGCAACT 


ATTTATTACC 


GCCAAGCACG 


TAGAGGCCTG 


1680 




ACAATTCCTT 


ATTATCAGAC 


GACCATTGAA 


AACTTCTTTA 


ATCCTATTAA 


1740 




ACACAACTCA 


CAACAGATGA 


TTTAAGGAGT 


TATTTAGCAA 


ATTACCAGTC 


1800 


TV* AAA irflfiT 


TGTAGTAAGG 


CAAATTTACA 


CAATATTAGG 


CCTATATTGT 


CTTCTTTTTT 


1860 


TGCTTGGCTT 


GAGCAAGAGG 


ATATATCATT 


AAAATTCCCA 


TTCGACCGAT 


ACAGAAAATT 


1920 


AAGACTCAGC 


AAAATGTGAA 


CCAAACTTAT 


ACTGATGAAC 


ATTTGGAAAT 


TATGCGTGAT 


1980 


AACTGTGAAA 


ATTTGAGAGA TTTGGCAATA 


ATACACCTAC 


TACCATCCAC 


AGCTATGCGT 


2040 


GTAGGGGAGC 


TTGTACAGTT 


GAATCGTTCA 


GATATTGATT 


TTGAAAACAG 


AGAGTGTGTT 


2100 


GTCTTTGGTA 


AAGGAAACAA 


GGAGAGACCA 


GTATATTTTG 


ACGCTCCTAC 


GAAAATTCAT 


2160 


TTAAGAAATT 


ATCTTAACGA CAGAAAAGAT 


AGTCACCCTG 


CTCTTTTTGT 


AACCCTAGTT 


2220 


GGAAAAGTCC 


AGAGGCTTGG 


AATTGCTGGT 


GTACAGATTC 


GCTTAAGAAA GTTAGGAGAC 


2280 


AAACTCCGCA 


TACAAAAGGT 


TCACCCACAT 


AAGTTCAGAA 


GAACTTTAGC 


GACTAAGGCA 


2340 


ATTGATAAAG 


GTATGCCTAT 


CGAACAAGTC 


CAAAAACTGC 


TAGCTCA 




2387 


(2) INFORMATION FOR SEQ ID NO: 57: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



Qitcrwin- ^i«n 
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(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ATATTAAAGC CACTTTCTGT CCGCTAGGGA AAAATGTTCC TGGGAATGAG GACTTGGTGA 60 

AGAGGATAAA ATCTGAAGCT CATGTTCTTG GAAACCATAG CTCCAGCCAT CCGATTCTCT 120 

CGCAACTCTC TCTTCATGAA CCTAAAAACC AGATTACTGA TACTGAGGAT CTGCTAACTA 180 

AAGTGCTGGG TTCTAGTTCT AAACTCATGC GTCCACCTTA TGGTGCTATT ACAGATGATA 240 

TTCGCAATAG CTTGGATTTG AGCTTTATCA TCTGGGATGT CGATAGTCTG GACTGGAAGA 300 

GTAAAAATGA AGCATCTATT TTGACAGAAA TTCAGTATCA AGTAGCTAAT GGCTCTATCG 360 

TTTTGATGCA TGATATTCAC AGTCCGACAG TCAATGCCTT GCCAACGGTC ATTGAGTATT 420 

TGAAAAATCA AGGTTATACC TTTGTGACCA TACCAGAGAT GCTCAATACT CGCCTAAAAG 480 

CTCATGAGCT GTACTATACT CGTGATGAAT AAGCAAGAAA AAATAGGTCT CTTAGATATT 540 

TGACAGACTT AT TT TTT ACA GAATATACTA CTACTTAAAA AATGTTTTAT GCTATAATTG 600 

ATGAATAAAA TAGAAGGAGA AGCATATGAA TACCTATCAA TTAAATAATG GAGTAGAAAT 660 

TCCAGTATTG GGATTTGGAA CTTTTAAGGC TAAGGATGGA GAAGAACCCT ATCCTCCACT 720 

GTTACAAGCC TTCAACCCTC GTTATCGTCA TATTCATACG GCGGCCATTT ATCAGAATGA 780 

AGAAAGTGTT GGTCAAGCAA T CAAAG AT AG CGCAGTTCCA CCTGAACAAA TCTTCGTAAC 840 

TACCAAGCTT TGGAATAGTC AGCAAACCTA TGACCAAACT CGTCAAGCTT TGGAAAAATC 900 

TATAGAAAAA CTGGGCTTCC ATTATTTGGA TTTGTATTTG ATTCATTGGC CCAACCCAAA 960 

ACCCCTCAGA GAAAATGACG CATCGAAAAC TCGCAATGCG GAAGTTTGGA GAGCGATGGA 1020 

AGACCTCTAT CAAGAAGGGA AAATCCGTGC TATCGGCGTT ACCAATTTTC TTCCCCATCA 1080 

TTTGGATGCC TTGCTTGAAA CTGCAACTAT CCTTCCTGCG GTCAATCAAC TTCGCTTGGC 1140 

GCCAGGTCTG TATCAAGATC AAGTCGTACC TTACTGTCGT GAAAAGGGAA TTTTATTGGA 1200 

AGCTTGGGGG CCTTTT G GAC AAGGAGAACT CTTTGATACC AAGCAAGTCC AAGAAATAGC 1260 

AGCAAATCAC GGAAAATCGG TTGCTCAGAT AGCCTTGGCC TGGAGCTTGG CAGAAGGATT 1320 

TTTACCACTT CCAAAATCTG TCACAACCTC TCGTATTCAA GCTAATCTTG ATTGCTTTGG 1380 

AATTGAACTC AGTCATGAGG AGAGAGAAAC CTTAAAAACC ATTGCTGTTC AATCGGGTGC 1440 

TCCACGAGTT GATGATGTGG ATTTCTAGAA AATC AT AAAA AGAATTGTAC ATTATTCTAA 1500 

TTTTTGATAT AATAGTCAGC AGGAAAGAAA CTCTTATGGC GTTCTTCAAG CGAGCTTGCG 1560 

ATAGTGGGAG CCAAGTAGGG CAAAATAAAG CGCTGGCGCT TTCTGTAGTA TTTTCAAAAA 1620 
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ATAAATTAGC 


GTGGAACCGC 


GTTTCTGACG 


CCCCTAGGTT 


AAATCAACCT 


1680 


AGGATTGTCA 


GATGTCGTTC 

wn 1 V»9 4 WVJ 4 4 w 


TTTTGCTTAT 


TCAGTCTATT 


GTGTGAAAGA 


AAGGAGAGCC 


1740 




^*f**P A*T*P* TP*I*PIT 

1 I A W a 


AAAAGACGAT 


AGTCAACTAG 


CTACATTTCG 


TGATTTTGTA 


1800 


CTAACAAATA 

W A W^W% A « 


PTf.AAAACTT 
v. 1 WAAAAW 4 1 


G AAA C ATT AT 


CAATCTTTTT 


TAAAGAATGA 


ACTTGCAGTC 


1860 


TCTGATTTAC? 

4 W A W A A t r\W 


CCTAAGCTCT 

V.V^./V\w\> A W 4 


T ATTTGGT C A 


GATTTTAATG 


CTGCTACACA 


GATTATTAGG • 


1920 


wAAAW 1 wL A w 


*r**rr*r m a a rcr^ 

4 1 WWAAWV. 1 A 


TAPAAATAAT 

A /%W/W% * /W» A 


AGACGAGTGC 


TTATCACGCC 


TGATTTAGCT 


1980 


GTTTGGAAAG 


AATTGTATTT 


ulnI\Aui lu 


A l vunV. a nv.w 


nu A \M 4 A \» 4 


GCAAACTCAA 


2040 


/""•f ft ft T ft/""" ft A & 


GTCACTATCA 


TTCJTTTATCT* 


L>AAAA 1 1 ILL 


TfTTACACAT 

A L 4 L ALAwA A 


TCTAGGACAT 


2100 


W Aw n niif. 1 v. 


ATTGGTUjOA 


CATTTTTTAG 


ATflATTTTCA 


TCCTTATCAC 

I 4 A #% A wnv 


TCTTATATCT 


2160 


ww 1 IWCAAGA 


******** n*r%* m *^*j*fr* 

GGGGATGGTT 


& &n* at at*fa 
uAATAIAi In 


L> I LuLnnu 1 A 


A A 1 L 1 A UnLn 


CAACAGGAAT 


2220 


»T»*¥V*ft. ft ******* iv 


AAAAATTTGT 


AA1LAA 1 L JL 


X L\j 1 nunAL 1 


TTTTCACAAfi 
111] Lrtunnu 


AACTATACTT 

A/Wj AAA AW A 1 


2260 


GGCATTCATT 


GAATGATTTT 


Gb l 1\. 1 X LGA 


L I 1 A I\lA J AA 


unAL A A I wLA 


nu 4 A 1 1 AAA t 


2340 


A *TV~ ft h «V !V < *■« IV 

ATwaATAltG 


******** ft i 't "|M|IH» 

GCGCAGCTTT 


TTGACAGTAG 


nlAAu 1 I uu 1 


nviMAnn a a i A 


GGT AGTCIT AC 

WJ A AVi 4 V> A AL 


2400 


AACCGb l L IT 


X ^ It ***1 > if* A ^ 

AGATTCTTAT 


LA n I A 1 WOO 


L AAA 4 AL AVjA 


A A A A ArTTTT 


V_ 4 A V* A 4 «^ w 


2 4 60 


ATTGGTTTGT 


T C AGC AGAAA 


11 AAT 1 \jAAA 


ry ft ft A AT1TA 
AAuAAA 1 A 1 A 


AliirTAAAfl 
/WW>L I nnnu 


r.A^TAAACAA 

\J*\\J 4 fW^^.f>/\ 


2520 


'T'/**^ ft ft /~" ft ft 


ATTAACATTT 


LAt 1VUA1 1— A 


L> IvOLAuAwn 


LL A LL A A ALA 


U 4 ^ WWW A WW 


2580 


rr****+ « r^********* k 


**** ft / VII » ^ K 

wlAlTAGawT 


uLL I uAuV. 1 A 


unLvLAU l AL 


1 /V/\^ A ^ w A V. 4 


A WWW A WW A *\ k 


2640 


OA 1 V.viAWuAb 




Kj I LUUAAu 1 A 


n A A i\ i 4 k * 4 A 


ATT AACCACT 




2700 


A nn^nftA I In 


ALA 1 1 IUA\. 1 




CAGAAACCTC 


CTTACAGTCG 


GACTGCCCTA 


2760 




T Aft AfTTftPPT 
1 AWAW A WLL * 


CAnrTACAfC 


CAGTACTAAC 


TCGTCTTGCC 


TCGTATAATC 


2820 




AL 1 L W X V* 4 L W 


CAACAAATTA 




ACGAGTATTC 


AATGTCTAAG 


2880 


AAATTAACAT 


TTCAAGAAAT 


TATTTTGACT 


TTGCAACAAT 


TTTGGAATGA 


CCAAGATTCT 


2940 


ATGCTTATGC 


AGCCTTATGA 


TAATGAAAAA 


GGTGCGGGGA 


CAATGAGTCC 


TTACACTTTC 


3000 


CTTCGTGCTA 


TCGGACCTGA 


GCCATGGAAT 


GCAGCTTATG 


TAGAGCCATC 


ACGTCGTCCT 


3060 


GCTGACGGTC 


GTTATGGGGA 


AAACCCTAAC 


CGTCTCTACC 


AACACCACCA 


ATTCCAGGTG 


3120 


GTCATGAAGC 


CTTCTCCATC 


AAATATCCAA 


GAACTTTACC 


TTGAGTCTTT 


GGAAAAATTG 


3180 


GCAATCAATC 


CTTTGGAGCA 


CCATATTCCT 


TTTGTTGAGG 


ACAACTGGGA 


AAACCCATCA 


3240 


ACTGGTTCAG 


CTGGTCTTGG 


TTGCGAAGTT 


TGGCTTGACG 


GAATGGAAAT 


CACTCAGTTC 


3300 


ACTTATTTCC 


AACAAGTCGG 


TGGATTGGCA 


ACTGGCCCTC 


TGACTGCGGA 


AGTTACCTAT 


3360 
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GGTTTGCAGC 


GCTTGGC TTC 


TTACAnCAA 




fTCTCTATCA 


TATCGAGTGG 


3420 


CCTGATGGTG 


TAAAATACGG 


AGAAATCTTT 


» ftv*/"* f**rv? 




rTCAAAAT AT 


3480 


TCATTTGAAA 


TTTCGGACCA 


AGAAATGTTG 


CTTGAAAACT 


l 1 vjn I /W\w 1 i 


TCAAAAAGAA 


3540 


GCTGGTCGTG 


CATTAGAAGA 


AGGCTTGGTA 


CACCCTGCCT 


ATuACTATUT 


TTTCAAATCT 


3600 


TCACATACCT 


TTAATCTGCT 


TGACGCGCGT 






RCCTGCAGGC 


3660 


TATATCGCTC 


GTATCCGTAA 


CTTGGCCCGT 


^»#a>^^»w *l ****** A 

GTCGTAGCCA 


AAALI.1 I i VI i 


1_ w AVliW\UU^ 


3720 


AAACGCCTAG 


GCTACCCACT 


TTTGGATGAA 


ik h ft ft ft ft ^ 

GAAACAAGAG 


CTAAACTCCT 




J ' Dv 


GCAGAATAAA 


GAGAGTGACA 


AATTACGAAA 


ATCGGCGAAC 


AGAGTCACwt 


U iuAuLUnu A 


1 AAA 


TCCCGCAGTG 


ATGAAGGTAT 


CCTTAGTGAA 


ACTAAGGATA 


% ****** Ift ft \ ft 

CTAGGCAAAA 


TTuOAuAL i 1 


i onn 

J 7UU 


TTGGCTCCAA 


TTTTAGCAAT 


GAAACAACGA 


AGTTGGTTGC 


TTGCGTGCCA 


ATCACATAAG 


J70U 


GCAAACTGGA 


AAATAAAAAG 


ATACTTTTCG 


GAGAAAAAAC 


ATGACAAAAA 


ACTTATTAGT 


4UlU 


ACAACTCGCT 


CTTGAAGAAT 


TACCAGCCTA 


TCTTCTTACG 


CCAAGTGAAA 


ftt 1ft 1ft ftV yfttftft %. ***** 

AACAACTAGG 


4080 


CGAAAAAATC 


GCACCCTTCC 


TCAAGGGAAA 


ACCCCTGTCT 


TTTGAAGCCA 


mmw^ ft ft ft ^WftJMM 

TTCAAACTTT 


4 140 


CTCAACACCA 


CGTCGTTTGG 


CTGTTCGTGT 


AACTGGTCTT 


GCAGACAAAC 


ACTCTGATTT 


4200 


AACAGAAGAT 


TTCAAGGCTC 


CAGCAAAGAA 


AATTGCCTTA 


GATAGTGATG 


GAAACTTCAC 


4 260 


CAAAGCAGCT 


CAAGGATTTC 


TCCGTGCCAA 


AGGTTTGACT 


GTTGAAGATA 


TCGAATTCCG 


4 3 20 


TGAAATCAAG 


CGTGAAGAAT 


ATCTCTATGT 


CACTAAGCAA 


GAAATTGGTC 


ft ft ^+0+ ft MfVffe^ ft 

AAGCACjITCA 


4 JBU 


AGCCATTGTT 


CCAGGCATTG 


TGGATGTCTT 


GAAGTCACTG 


ACTTTCCCTG 


TCAGCATGCA 


4440 


CTGGGCGCGA 


AATAGCTTTG 


AATACATCCG 


CCCTGTTCAC 


ACTTTAACTG 


TTCTCTTGGA 


j4 C Art 


TGAGCAAGAG 


TTTCACTTGG 


ATTTCCTTCA 


TATCAAGGGA 


AGTCGTGTGA 


GTCGTGGC.L.A 


4 SOU 


TCGTTTTTTG 


GGACAAGAAA 


CCAAGATTCA 


GTCAGCATTG 


AG CT ATG AAG 


AAGACCi ICC 




TAAGCAGTTT 


GTAATCGCAG 


ATCCATCTGA 


ACGTGAGCAA 


ATGATTA#i AG 


ALLAAAi LAA 


4 ODV 


GGAAATTGAG 


GCAAAACATG 


GTGTACCTAT 


CCAAATTCAT 


CCGGAl 1 lOL 


1 UnA 1 uMu A 


4740 


CTTGAATTTG 


GTTGAATACC 


CAACTGCCTT 


CATGGGAAGT 


TTTGATGCTA 


AATACCTTGA 


4800 


AGTTCCAGAA 


GAAGTCTTGC 


TGACTTCTAT 


GAAGGAACAC 


CAGCGTTACT 


TTGTTGTTCG 


4860 


TGATCAAGAT 


GGAAAACTCT 


TGCCAAACTT 


CATTTCTGTT 


CGTAACGGAA 


ACGCAGAGCG 


4920 


TTTGAAAAAT 


GTCATCAAAG 


GAAATGAAAA 


AGTCTTGGTA 


GCCCGCTTGG 


AAGACGGAGA 


4980 


ATT CTTCTGG 


CGTGAAGACC 


AAAAATTGGT 


GATTTCAGAT 


CTTGTTGAAA 


AATTAAACAA 


5040 


TGTCACCTTC 


CATGAGAAGA 


TTGGTTCTCT 


TCGTGAACAC 


ATGATTCGTA 


CGGGTCAAAT 


5100 


CACTGTACTT 


TTGGCAGAAA 


AAGCTAGTTT 


GTCAGTGGAT 


GAAACAGTTG 


ACCTTGCTCG 


5160 
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TGCACCAGCC ATTTACAAGT TTGACTTGTT GACACGTATC GTTGGTGAAT TTGACGAACT 
CCAAGCAATT ATGGCTGAAA AATACACCCT TCTTGCTGGT GAAACTCCAG CGGTGGCAGC 
TGCTArrCGT GAACACTACA TGCCTACATC AGCTGAAGGA GAACTTCCAG AGAGCAAGGT 
CCGCGCAGTT CTAGCCATTG CAGACAAATT GGATACGATT TTGAGTTTCT TCTCAGTAGG 
ATTGATTCCA TCAGGTTCTA ATGACCCTTA TGCCCTTCCT CGTGCAACTC AAGGTGTGGT 
TCGTATCTTC GATGCCTTTG GTTGGCACAT TGCTATGGAT GAGCTGATTG ATAGCCTTTA 
TGCATTGAAA TTTGACAGTT TGACTTATGA AAATAAAGCA GAGGTTATGG ACTTTATCAA 
GGCTCGTGTT GATAAGATGA TGGGCTCTAC TCCAAAAGAT ATCAAGGAAC CAGTTCTTGC 
AGGTTCAAAC TTTGTTGTGG CAGATATGTT GGAAGCAGCA AGTGCTCTCG TAGAAGTAAG 
CAAGGAAGAA GATTTTAAAC CATCTGTTGA ATCACTTTCT CCTCCCTTTA ACCTGGCCGA 
GAAGGCAGAA GGGCTTGCTA CGGTTGATTC AGCACTATTT GAGAATGACC AAGAAAAAGC 
TTTGGCAGAA GCACTAGAAA CACTCATTTT ATCAGGACCT GCAAGTCAGC AATTGAAACA 
ACTTTTTGCG CTTAGCCCAG TCATTGATGC TTTCTTTGAA AATACTATGG TAATCGCTGA 
AG AT C AGGCT GTCCCTCAAA ATCGTTTGGC AATCTTGTCA CAACTAACCA AGAAAGCAGC 
TAAGTTTGCT TGTTTTAACC AAATTAACAC TAAATAAAAT TTGATAAACG GACTTTATCT 
TATTACAAAG GAGAAGAAAT GGATCCGAAA AAAATTGCTC CTATCAATGA GCTTGCTAAA 
AAGAAAAAAA CAGAAGGCTT AACACCACAA GAAAAAGTGG AACAAGCCAA ACTACGTGAG 
GAGTACATCG AAGGTTATCG CCGCGCTGTT CGTCACCACA TTGAAGGAAT CAAAATTGTG 
GACGAAGAAG GAAACCATGT TACACCAGAA AAACTACGCC AACTACAACG TGAAAAAGGA 
TTACATGGCC GTAGTCTTGA TGATCCAAAT TCATAATAAT ACTCTTCGAA AATCAAATTC 
AAACCACGTC AGCTTCACCT TGCCGTACTT AAGTACAGCC TGCGGCTAGC TTCCTAGTTT 
GCTCTTTGAT TTTCATTGAG TATATGTATT CTTTCTTTTA ACAAAGATAG ATGAAACGAT 
AACAAAGAGA CTAGCAGTTT GTGTTTGCTA GTCTTTTTTC GCTAAAAAAG GAACCATAAT 
GGTTCCTAAA AACTATCATT AGTAACTTGC ACCGGCTGTA GCGTCTGCGT CACCACCCTG 
GCCTCCAGCA TCCCCTGAAT CAGAAGCGCC AGAAGTAGCA TCGGCGTCTC CATGACCTCC 
GGCAGCAGGA GCAAATGGTC CGCTACCACC CACCAAACGT TGACCAGTCT CTTTTAGCTA 
CCAGTCAAGC CATGGTTGGA AGTTAAAGAC GATTTCATTC ATACCAGCGT ATGATCCATC 
AGGATAGTAC ATTGCTTGGT AGTTGTGAGT GTTGATAACA CCTGCAGGAG AACCTGGAAC 
GATCGTACGG ACGTATTCTT GGTTTCCGTT GCGAAGTGTT CCCATAACCC ACTCTACGTT 



5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 
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CTTCATACGT 


GCTGGTGGAA 


CAGAACCATG 


AACAGTCGAC 


AT ACGGCT AC 


CTGATTGAGG 




TCGTACACGT 


TTAGCGAACA 


TAGTGTCTGG 


ATCTTGGTGA 


GCCTTCTTGT 


AGTAGAGGAA 


7020 


TTGGTTGTTG 


TCCTCAGCGT 


ATGTCAATTC 


AAATGCCATA 


CCTTTCAAGA 


ACATATCAAT 


7080 


TTGGTTAACT 


GTTACCATAC 


CGTGGTCCAA 


TTTGACATAG 


CTATCACCAG 


AAACACCACC 


7140 


AGTGAATGCT 


GCAACTTTTT 


CTACCCATTC 


TGGATCGTCA 


GGGTCAACTT 


CTGTGATGGT 


7200 


TGTAGCGATT 


GCTTTTCCAC 


AATCCAAGTC 


TTCTGATTCG 


ATTGCTTTTG 


GTTTTTTCAA 


7260 


TTTCGAAACG 


ACTCCTACGT 


ATTTAACAAA 


GTTATCTAAG 


CAACTTTCAA 


GGAATTTAAC 


7320 


AGTCCCTTCG 


TTCGTGATAT 


TTCCGTTGTT 


ATCAAAAGCT 


TCCTTAGCTT 


TACCAAGAAG 


7380 


GAATTCGTTA 


CCTGGAAGCG 


TGTAGGCATT 


AACACCTGGA 


GCATCAAGGA 


TTTTACGAAG 


7440 


GTGAACTTGA 


GCACGTCATG 


TTCCTTGGTC 


ATAGTATGAT 


GCACCCACAA 


TCATAACAGG 


7500 


CTTGTTTTCA 


AATGGATGAA 


CTTCGTATGA 


AAGCCATTCA 


AGTACAGATT 


TGAGTGAAGC 


7560 


TGAGATACTG 


TGGTTATGCT 


CAGGAGTAGC 


AATGATAACA 


CCATCTGCAC 


CAGTAATTTT 


7620 


GTTATATAAA 


TAACGTAATT 


GCAAACTTTC 


ATCCCATTTT 


TCATCTTGGT 


TAAACATTGG 


7680 


AACTTCGTCA 


ATTTCAAGAA 


CTTCTAATTC 


AAATTTGAGT 


TTGAAGTAGC 


GACGGATAAA 


7740 


TTCCAAGAGC 


TTACGCTTAT 


ATGATTGATC 


CTAGTTTGAT 


CCAACAAGTC 


CAACAAATTT 


7800 


CATTCTTTTT 


GGTCTCCTAT 


CTTACAAATT 


TTCCCAGTCA 


AAGTCTT C AG 


CATCTTTCCC 


7860 


AAGTAATTCT 


TGTGCATTAC 


GTAATTTTTC 


TCTCATTTTT 


ACAAAGATAC 


GGAAGTCATC 


7920 


AAAGATGGCA 


TCCAATTTCT 


TGATAACATC 


AAGGTCAACC 


AACTCGCCAC 


TTGCGTTAAA 


7980 


TGCTTGAAGA 


GACTGTCAGA 


GCAAGAATTC 


ATCTGCAACA 


ACATTTGCCT 


TGATTTCAGG 


8040 


AGCATTCAAG 


ATTTGACGAA 


GTTGCAATTG 


CGCACGAGAT 


GAACCAAGCG 


TACCCTAAGA 


8100 


AGCACCTGTA 


ATCATGATTC 


CTTTGTTCAA 


AAGTGGGTAA 


ATACCATAAG 


ACAACCAAGC 


8160 


AAGAGCGCTC 


ATCAAAACAG 


CTGGAATAGA 


CTCATCATAC 


TCAGGAGTAC 


CGATAATAAC 


8220 


GCCATCTCCC 


TCTTCGATTT 


TAGCAGCAAT 


TTCCAATATT 


TCAGCACGTA 


CTTGCTTGTC 


8280 


AGCTGGTTTG 


TTGAAGACAG 


GAATGGCCTT 


CATTTCAACA 


AGTTCAATTT 


CAGCTTTGTC 


8340 


AGTAAAGTGT 


TTTTGCATGT 


ATTGAAGCAA 


TTCACGCTTT 


CTAGAACGTT 


TTGAATTTGT 


8400 


TCCAACAATA 


GCAATAACT7 


TTAACATGAG 


ATTTCCTTTC 


TCTTTTTACA 


TAATACAATT 


8460 


TTAAAATTCC 


ATTGAAACAG 


TTGTCTCTAT 


AGAGTAGGAA 


TTCCTGAAGA 


ACAGCTTAGG 


8520 


TGGCCTTCTT 


TATCGATGAG 


GATGACTTCG 


ATGCCCTCCA 


AACTTTCGAC 


TTGCCAGAGG 


8580 


ATAGAAGCAG 


GTCTTTCTCC 


AAAGAGTCGA 


GTCCTCCAGA 


TTTCGCCATC 


GACTGATTTA 


8640 


TCAGAGATGA 


TTGTTACACT 


CGCTAGTTCC 


GTTTCAACAG 


GATATCCTGT 


TTGACTGTCA 


8700 
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AAAATCTGAT 


GGTAATCTTG 


TCCATCGACG 


GTCAGGTGAC 


GTTCATAAAT 


GCCTGAAGTC 


B /Ov 


ACGACAGATT 


TATTCACAAC 


AGCGATGGTC 


ATTAAATGAT 


TTCCCCTAGG 


ATTGGCTGGG 


aaon 
oo«u 


TCTTGAATCC 


CGATTTCCCA 


TGGCTTATCC 


CCTCTTGCCT 


GATTTTTTCC 


AATGGTCAGG 


O Q O A 


ATATTCCCTC 


CCAGATTGAT 


CAAGGCAGAA 


CTCACCCCCT 


CTTTCCTAAC 


AAATTGGGCA 


o Oil rt 
079U 


ACCTTATCCG 


CACTCTATCC 


TTTGGCTAAA 


CAACCTAGAT 


CGATCTTCAT 


lXILTTTCTGT 


9000 


TTTAAAAACA 


CAGTAGAAGT 


AGAAGAATCT 


AACTCGATAC 


CATGAGCATT 


GATTAGAGGC 


9060 


AGCACCCATT 


CAATTTCTTG 


AGGCTGGGCG 


ACCTTGGCAT 


CTGAAAAACC 


GATACCCCAG 


9120 


GTTTGAATTA 


AGGGACCAAT 


CCTGATATTG 


AGCTGGCTAG 


AGAGCCCTAG 


GCTATCCTCT 


9180 


AAC CCAAGTG 


AAATC AG CTC 


AAACAGGTCT 


GGATGAACCG 


TGACGGGGGC 


TATTCCTGCT 


9240 


TGATAATTGA 


TTTCCATCAA 


CTCAGATTCT 


TGACTATTGG 


CGTTCAAGCG 


GTATTCAAGT 


9300 


TCTTTGAGCA 


AGTCAAAGCA 


TTTTTGGAGA 


AAGATATCGG 


CTTGCTCATC 


CACTAATGAA 


9360 


ATAGTGATAG 


TAGTCCCCAT 


T AG CCGTTCA 


GAATGTGAAC 


GAAGAGTCAA 


GCTACCAACT 


9420 


CCTTTCTCTT 


ATAGAAAATA 


AGTTGTAATA 


TCAAATAATC 


ATCTAAATTG 


AAGCCCTTAC 


9480 


ATTTCATTTT 


CATGTTATTA 


TAATACCATA 


AAGTTAGAAT 


TTTCACAAAC 


AAAATTTGGA 


9540 


AAAACTCAAC 


AAATATGCTC 


ATAAAATTCA 


TCAGCCTTGA 


AAACAGGATA 


AATGGGGAAT 


9600 


TATTTTTGAT 


AAAAAATGCT 


GAAATAATAG 


TACCCCCCTT 


GTAAACCCTA 


ACGGTAAATG 


9660 


CTATACTAGT 


AAGGTAAATT 


TAGAATGAAG 


GCAGGAAATT 


TTTATGAGTA 


AAATCGTTGT 


9720 


AGTCGGTGCT 


AACCACCCTC 


CTACACCATC 


TATCAATACC 


ATGTTGGATA 


ATTTTGGAAA 


9780 


TGAGAACGAA 


ATTGTTCTAT 


TTGACCAAAA 


CTCTAACATC 


TCTTTCCTAG 


GATGTGGAAT 


9840 


GGCTCTTTGG 


ATTGGTGAAC 


AAATTGACCG 


TGCTCAAGGC 


TTGTTCTATT 


CTGATAAAGA 


9900 


AAAATTGGAA 


GCTAAAGCTG 


CTAAAGTTTA 


CATGAACTCA 


CCTGTTCTTT 


CAATCGACTA 


9960 


TGATAACAAA 


GTACTTACAG 


CGGAAGTTGA 


AGGAAAAGAG 


CACAAAGAAT 


CATACCAA/A 


10020 


ATTGATTTTC 


GCTACAGGCT 


CTACACCAAT 


CTTGCCACCA 


ATCGAAGGTG 


TTGAAATTGT 


30080 


TAAAGGAAAC 


CGCGAATTTA 


AAGCAACTCT 


TGAAAACCTA 


CAATTCCTGA 


AATTGTACCA 


10140 


AAATGCTGAA 


GAAGTTATCA 


ATAAACTTTC, TGACAAGACC 


CAACACCTCG 


ACCCTATCGC 


10200 


CGTTGTTGGT 


GGTGGTTACA 


TCGGTGTTGA 


ACTTGCTGAA 


GCCTTTGAAC 


GTCTTGGAAA 


10260 


ACAACTTCTC 


CTTGTTGATA 


TCGTTGATAC 


TGTCTTGAAC 


GGTTACTATG 


ACAAAGACTT 


10320 


CACACAAATG 


ATGGCGAAGA 


ACTTGGAAGA 


TCACAACATC 


CCCTTGGCTC 


TAGGTCAAAC 


10380 


TGTTAAAGCA 


ATCGAAGGTG 


ACCGTAAACT 


TGAACGCTTG 


ATTACTGACA 


AAGAAAGCTT 


10440 
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TGACGTCGAT ATGGTTATCC TTGCAGTTGG TTTCCGTCCA AACACAGCCC TTGCAGGTGG 10500 

TAACATCGAA CTCTTCCGCA ACGGTGCCTT CCTTCTAGAC AAGAAACAAG AAACATCTAT 10560 

CCCAGACCTT TACGCTGTTG GTGACTGTGC GACT C TTTAT GACAATGCTC GTAAAGATAC 10620 

AAGCTATATC GCTCTTGCTT CAAATGCTGT GCGCACTGCT AACGTTGCT 10669 

(2) INFORMATION FOR SEQ ID NO: 5B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 2 base pairs 
( B J TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



CCCGCTAATA 


GATACTTTAT 


GATAGAATAA 


AGAACAAGAT 


TGACAAGTAA 


GAGGAAACAT 


60 


TATGCAAAAT 


CAAACACTCA 


TGCAATACTT 


TGAATGGTAT 


CTCCCCCACG 


ACGGTCAACA 


120 


CTGGACGCGT 


CTCGCTGAAA 


ATGCTCCACA 


CCTAGCTCAT 


CTGGGGATCA 


CTCACGTCTG 


180 


GATGCCACCA 


GCCTTCAAGG 


CAACCAACGA 


AAAAGATGTC 


GCCTATGGGG 


TCTATCACTT 


240 


ATTTGACTTA 


GGAGAGTTCA 


ACCAAAAAGG 


GACTGTCCGC 


ACCAAGTATG 


GTTTCAAAGA 


300 


AGACTATCTT 


CAAGCCATTC 


AAGCCCTTAA 


ACCACACGGA 


ATTCAACCTA 


TGGCCGATGT 


360 


AGTTCTCAAC 


CACAAGGCTG 


CTGCCGATCA 


CACGGAAGCC 


TTTCAGCTTA 


TCGAACTTGA 


420 


TCCTGTAGAC 


CGTACAGTTG 


AACTTGGAGA 


ACCCTTCACC 


ATCAATGGCT 


GGACTAGTTT 


480 


TACCTTCGAT 


GGTCGCCAAG 


ATACCTATAA 


TGGCTTCCAC 


TGGCATTGGT 


ACCACTTCAC 


540 


CGGTACAGAC 


TACGATGCCA 


AACCCAGTAA 


ATCTGGGATT 


TATCTGATCC 


AAGGGGACAA 


600 


CAAGGGCTGG 


GCCAACGAGG 


AATTGGTCGA 


TAACGAAAAC 


GGAAACTACC 


ACTACCTCAT 


660 


GTATGCCGAC 


CTAGACTTTA 


AACATCCTGA 


AGTCATCCAA 


AACATCTATG 


ACTGGGCTGA 


720 


TTGGTTCATG 


GAAACGACTG 


GTGTACCTGG 


TTTCCGTTTG 


GATGCCGTTA 


ACCATATTGA 


780 


CTCTTTCTTT 


ATGCGCAACT 


TCATCCGCGA 


TATGAAGGAA 


AAATACGGTG 


ACGATTTCTA 


840 


TGTTTTTCGT . GAATTTTGGA 


ACCCAGACAA 


CGAAGCCAAT 


CTGGACTATC 


TCGAAAAAAC 


900 


CGAAGAACAC 


TTTGACCTTG 


TCGATGTTCG 


TCTCCACCAG 


AATCTCTTTG 


AAGCCAGTCA 


960 


AGCTGGCCCA 


AACTATGACC 


TTCCTGGCAT 


TTTC A C AG AT 


AGCCTGGTTG 


AACTCAAGCC 


1020 


TGACAAGGCT 


GTGACTTTTG 


TCGACAACCA 


CGATACCCAA 


CGAGGACAAG 


CCCTTCAGTC 


1080 


TACCGTTGAA 


GAATGGTTCA 


AGCCAGCAGC 


CTATGCCCTC 


ATTTTGTTAC 


GCCAAGACGG 


1140 


CCTTCCATGT 


GTCTTTTACG 


GAGACTACTA 


TGGGATTTCA 


GGGCAGTATG 


CTCAAGAAGA 


1200 
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TTTCAAACAA 


ATCCTTGACC 


GCCTCCTAGC 


CATCCGAAAA 


CATTTGGCCT 


ATGCAGAACA 


1260 


AAATGACTAC 


TTTGACCATG 


CTAACTGTAT 


CGGTTGGCTA 


CGTTCAGGTG 


CTGAAAATCA 


1320 


ATCCCCAATC 


GCAGTCCTTA 


TCTCAAATGA 


CCAAGAAAAC 


AGCAACTCAA 


TCTTTGTCGG 


1380 


TCAAGAATGG 


ACTAATCAAA 


CCTTTGTAGA 


TTTACTTGGT 


AACCACCAAG 


GTCAAGTTAC 


1440 


AATTGATCAG 


GAAGGTTATG 


GACAATTCCC 


TGTCTCAGCT AGATCCGTAA GTGTCTGGGC 


1500 


AGTCAATACC 


ATCTAATAGC 


TCATAATAAC 


CAAGCTAGGT 


CCAAGCGGAT 


TTCGcrrrrr 


1560 


TGTATTCACA 


AAAAGACCTA 


CCCAAATGGA 


TAGATCTTTA 


CTTGATTACA 


ATTTACCTGC 


1620 


TACTGCATCC 


AACAATTCTT 


GGATCTTAGG 


TTGGTTGCTT 


CCTCCTGCCA 


TGGCCATATC 


1680 


TGGTTTACCA 


CCACCACGTC 


CATCGATGAT 


TGGTGCTAAT 


TCTTTGACAA GGTTTCCTGC 


1740 


ATGAAGGTCT 


TTTGTCTTGC 


TTGCTACAAG 


GACATTGACT 


TTGTCACCGA TAGCGGCAAC 


1800 


TAGGACAAGA 


AGATCAGAGT 


AGTCTTTTTG 


TTTCCAGTTA 


TCTGCAAAAG 


TACGAAGGGC 


1860 


ArCCCCATCC 


GATACAGACA 


CTTGACTAGC 


AATCTAACGA 


TGACCGTTGA 


CTTCCTTAAC 


1920 




ATATCGCCTC 


CGGCTGCAGC 


TGCGGCTTTT 


TCTTTCAACT 


CAGCATTTTC 


1980 


TTTTTGAACT 


TGACGAAGTT 


GTTCTTGAAG 


TCCTTCTACC 


TTGTGAGCTA 


CTTCCTTGAC 


2040 


TTCAGGTGCT 


TTCAAGGTTC 


CTGCGATACC 


TTTAAGAGCA 


TC CTCTTGTT 


CACGATAGGC 


2100 


TTCAAAGCCT 


XCCTTACCAG 


TCACTGCCAA 


GATACGGCGA 


CTTCCTGAAC 


CGATTCCTTC 


2160 


TTCTTTGACA 


ATTTTGAAGA 


GACCAATCTC 


ACAACTGTTG 


TCAACATGAG 


TACCACCACA 


2220 


AAGTTCAATA 


GAGTACTCAC 


CGATAGTCAC 


GACACGAACT 


TCCTTGCCGT 


ATTTCTCACC 


2280 


AAAGAGGGCC 


ATAGCTCCCA 


TTTCTTTAGC 


ACTGTCAATA 


TCCGTTTCAA 


CTCTCTTCAC 


2340 


TTCAAGTGCT 


TCCCAAATTT 


TCTCGTTAAC 


TTGCTGTTCA 


ATCGCACGAA 


GTTCCTCAGC 


2400 


AGTTACTGCT 


TGGAAGTGGG 


TAAAGTCAAA 


GCGAAGGAAT 


TCAACTTCCT 


TAAGAGATCC 


2460 


TGCCTGTGTT 




CAAGGATATT 


GTGAAGGGCA 


GCGTGAAGCA 


AATGAGTCGC 


2520 


ACTGTGGTTT 


TTCATGACAC 


GGTGACGGCG 


ATTGCTATCA 


ATTGCCAAGG 


TATATTC TTG 


2580 


GTTCAAGGCA 


AGCGGTGCAA 


GCACTTCAAC 


TGTATGAAGG 


GCTTGACCAT 


TTGGGGCTTT 


2640 


CTGAACATTG 


GTCACAGTAG 


CCACAACCTT 


ACCTCACTCA 


TCCAAGATTT 


GTCCGTAGTC 


2700 


AGCTACCTGT 


CCACCCATTT 


CAGCATAAAA 


TGACGTTTCC 


GCAAAGATAA 


GAGAGGCAGT 


2760 


TCCTTCTGAA 


ACAGCTCCTA 


CTTCTGCATT 


GTCAGCAACG 


ATAGCTACCA 


ATTTAGAAGA 


2820 


CAATTGGCTA 


GCATTGTAGT 


TGAAGACACT 


TTCTACACTG 


ATGTTTTGAA 


G AGTTTC ATT 


2880 


TTCCATACCC 


ATTGAGCCAC 


CCTTGACAGC 


TGACGCACCC 


GCGCGTTCTT 


GCTGTTCTTT 


2940 
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CATGGCTGCT 


TCAAAACCTT 


CACGGTCTAC 


AGTCATACCA 


GCTTCTTCAG 


CGATTTCTTC 


3000 


ACTCAATTCA 


ACTGGGAACC 


CATAAGTATC 


ATAGAGTTTG 


AAGACATCTG 


AACCACCGAT 


3060 


AACAGATTGA 


CCTTTTTCTT 


TCAAGTCTGC 


TACAATGCCT 


TGGGCAAAGT 


GTTGACCTGA 


3120 


GTCAAGGGTA 


CGGGCAAATG 


ATTCTTCTTC 


GCTCTTAACG 


ATTTTCTCAA 


TAAAGTCACG 


31B0 


TTTCTCAAGC 


ACTTCTGGGT 


AGTAGCTTTC 


CATGATTTTT 


CCAACAGTTG 


GAACCAATTT 


3240 


GTAAACGAAA 


GGCTCCTTGA 


TACCCAATTT 


TTGACCATGC 


ATAGAAGCAC 


GACGGAGAAG 


3300 


ACGACGAAGA 


ACATAACCAC 


GACCTTCATT 


TCCTGGAAGG 


GCACCATCAC 


CGATAGCAAA 


3360 


TGAAAGAGAA 


CCAATGTGGT 


CTGCGATAAC 


CTTGAAGCTC 


ATGTTCTCGC 


CATCTTGGTC 


3420 


ATAAACCTTA 


CCAGACAATT 


TCTCGACTTC 


ACGGATAATC 


GGCATGAAGA 


GGTCCGTTTC 


3480 


AAAGTTGGTC 


TTAGCCCCTT 


CGATAACGCC 


CACCAAACCC 


TCCAAACCAG 


CGCCCCTATC 


3S40 


AATGTTCTTA 


TGTGGCAATT 


CCTTCTATTC 


CCTACGAGGA 


ACAGCACGCT 


CTGCCTTAAA 


3600 


TTGTGACAAA 


ACGATGTTCC 


AGATTTCAAT 


ATAACGGTCG 


TTTTCAATAT 


CTTCTGCAAG 


3660 


CAGGCGAAGA 


CCGATATTTT 


CTGGGTCAAA 


GGCTTCCCCA 


CGGTCAAAGA 


AGATTTCTGT 


3720 


ATCTGGTCCA 


CAAGGTCCCG 


CACCGATTTC 


CCAGAACTTG 


TCCTCAATTG 


GAATCAAGTG 


3780 


ACTTGGATCC 


ACTCCCACTT 


CAATCCAGCG 


CTTCTAAGAA 


TCTTTATCGT 


CTGGATACTA 


3840 


GGTCATGTAA 


agtttitcag 


CAGGCAAATC 


AAACCATTCA 


GGGCTTGTCA 


AAAGCTCATA 


3900 


ACCCCAAGTG 


ATAGCTTCGT 


CACGCAAGTA 


ATCCCCGATA 


CAGAAGTTCC 


CCAGCATTTC 


3960 


AAACATGGTA 


TGCTGACGCG 


CGGTCTTCCC 


TACGTTTTCG 


ATGTCGTTGG 


TACGGATACC 


4020 


CTTTTGGGCA 


TTGCTAATAC 


CTGGATTTTC 


AGGGATAATG 


GTCCCGTCAA 


AGTATTTCTT 


4080 


AAGGGTTGCT 


ACCCCAGAGT 


TGATCCACAA 


AAGACTTGGG 


TCATTTACAG 


GAACCAAACT 


4140 


TACTGATGGT 


TCTACTGAGT 


GACCTTTGGT 


CGCCCAGAAA 


TCAAG CCACA 


TTTGGCGTAC 


4200 


TTGTGCACTA 


GATAGTTGTT 


TCATATTGTC 


TCCTTATTCA 


CTTGTTTAAT 


GTCATTGCCT 


4260 


TTCCAGCATT 


TCCACATAGT 


CAATCGCGAC 


ACAGAGGGAA 


ATGACTAGGT 


CTGCATAAGC 


4320 


GTCTTCAAGA 


ACCGTTACGG 


TATAGGTAGA 


AGTCACATGG 


AAGAGTTCCT 


TCTTAATTTC 


4380 


CGCAATCAAC 


TGATCCCCAT 


CATCCAGCAA 


TTTGAAATTC 


AAATCCCAGA 


TATTGCCCTC 


4440 


GATACGAAGA 


C CTAGATT AT 


CAAACTCATA 


CTTATCTCGC 


CAGAAGGTCA 


ACTTCTT ACG 


4500 


AATGACAAAA 


CTCGAGCCAT 


CCCGAAGCTG 


AATTTCAAAA 


CGAGGAAGCA 


ACGTCAAGAT 


4560 


TTCTTTACTA 


ATCTCACTGA 


CTTGTTCACC 


AGCCGCATCA 


TAGATGGTAA 


AGGTTTTAGG 


4620 


AATCTTAAAA 


AATGATCCCT 


CCACCTGATA 


GGCAATTTCT 


CCCCTGTCAT 


CCTTCATAGC 


4680 


GAAGCGTTCG 


CCTCCAAGAC 


GAAACTTTTG 


TTTGACAAGA 


AATGTTTTCA 


TCAACACCTC 


4740 
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4 



LAAAnnl LAA 




CATATCACGA 


AGGGCGAAAA 


ACCCCGCTAC 


CACCTTCATT 


4800 


CAATGAACTT 


GTCATTCTCT 




CAATTGTATC 

W A A w A * w 


ATTGAGTAGC 


ATCACTTCCT 


4860 


AGCTTAGATG 




rrr.rrATTTC 


TCTGGACTAA 


CACAAGTGAA 


AATCAATTCT 


4920 


CAACTTTCTT 


ATTATAACGT 




TTfirftTCAAC 


TGGAAATGAT 


CTCCGTTGAA 


4980 


TTAGACCAAT 


TCL.L/av*A.A 1 \- 


1 1. Iwn 1 1 nv» a 


'f*! 1 ^^*^ Ai^U"! AT* 
Alt* WrtVJVWA 


ATATTTTTTC 


TTACTGCCAT 


5040 


TTTTCTTTTT 


ATCCCAAAi I 


1 L V»~A Inl i nV> 


TAAACACAGC 


TACTAGAATA 


TTTCCAAATA 


5100 


TAAACGTCCC 


TATCACCCAA 


I A 1 A 1 L»VjAV_ 1 




fiTATTRTCfiA 
u a n a iwi wwi 


TCCAAGOCAT 


5160 


CCTTTAAATG 


GAATAGTATA 


GCACHlwT 


TA1(*A ATrAT 


A AA fSfTTTGGP 


CAGAAACTTT 


5220 


TTTTGAAAAA 


AGTAGACATT 


TTCATTATTT 




TCTCTAACGT 

A ^ A «J A *mr\\j\j A 


TAATACTCAA 


5280 


TAAAAATCAA 


AAAGCAAACT 


AGGAAGC ~l AG 


V. V- 1 Uv\uL 1 \j 


1 AL 1 l\inU 1 A 




5340 


ACGCTGACGT 


GGTTTGAAGA 


GTATAGG LIT 


arTiTArTAr 
l /vl av. i nv. 


T A^T"/" A Aftf* A 


AATAAAfAAA 


5400 


TAAACAACTA 


GAATAGAAAA 


AGATAGGGCT 


f*P A & & A Af**Ff* 


ACTTCTATTC 


HTAAAAAPfi 


5460 


AACCACCTTG 


ACTCATTCGT 


CTTCTTACGT 


TTATCTCCTA 


l. i ILUViAlAL 


ATTTTA^^^CT 


5520 


CTAGGAAGAG 


GTCGCTATAT 


TTCCCTOTLt 


All IA lUu I Km 


AAA 1 i \- A 


T AAACTTCT A 


5580 


GGTGTTTCAT 


GGTTTCAACA 




AOut. V— 1 a A l \m. 


• t l~ i i I VI 1 i 


TCCTCTGGGA 


5640 


GCAATTCCTT 


CGCTGGTAGG 


ii 1 uu lu 1 I 


AATi<7f(*CAr 


ATACTCCGCA 


TTTTGCAGAG 


5700 


CATTTTCAGG 


TTTCAACATA 


AAu 1 1 a*A 1 nA 


Ar/:rATAnnr 


TR A( '.TTT TRC 

A A A A A VJVI 


TTTTTAACTG 


5760 


TTTTGGCAAT 


GACCATAVi I* 




GATTGCTGGC 


CTCTCTCGGT 


ACCACATAAC 


5820 


CTAGATTTTC 


ATTTTTTTCT 


A Af* A*TT"W:f^f* 




AGAGAAGGTC 


ACGCCGATTG 


5880 


CAACA I I Ai i 


L. 4. jAA . w A 1 A 


1 Awl^Ww » i 


TCTCGTCCGC 

^\A9 4 W^W\#W 


AACGATAGCC 


TTGATATTTG 


5940 


GAGTCAGTTT 


Vj 1 Aunvi^ I In 




CTTCCAACTG 


CTGCAGATCC 


TTGGACTTGA 


6000 


GGCTGTAGCC 


GAGGGAATTG 


AGTCCTAGTC 


CCAGCACCTC 


ACGCCCCCCA 


TCAAAGAGCA 


6060 


TGATAGAATT 


CTTATACTCC 


GGCTTCCAAA 


GGTCATCCCA 


ATGCTCAGGC 


GCTTCATCTA 


6120 


CCATGGTTTC 


GTTCTAGACA 


ATTCCTAAGG 


TTCCCCAGAA 


GTAAGGGATG 


CACAATTTAT 


6180 


TACCTGCCTC 


AAAGCACTGG 


TTGAGAAACT 


CTGCTCCGAT 


ATTTTCGATT 


CCTTCAATTT 


6240 


TTCAATAATC 


AAGCGGAACC 


AAGAGGTCTT 


CGTCCTTCAT 


CTTGTTAATC 


ATGTATTCAC 


6300 


TTGG AATGG C 


AATATCGTAG 


GTCCTTCCAC 


C CTGCTTT AT 


CTTAGTGTAC 


ATGGCTTCGT 


6360 


TGGAGTCAAA 


AGTCTCGTAC 


TGAACTTGAA 


TTCCTGTTTC 


TTCTGTAAAC 


TGAGTCAACA 


6420 


GTTCAGGATC 


GATATACTCT 


CCCCAGTTAT 


AGATAACCAA 


TTTTTGACTA 


TCTCGACTAT 


6480 
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TGATTTTACT ATCTAAATGA GTCGCAATTC CCCACAAGAC 
CTGCTAAAAA TGAATACATT TTTTTCATCC TTGCTCCTCC 
ATAACCTACA ACTAGGATAA TACTAAAGAC AAAGACTAGA 
TAAGGAAATC CCCTTGCGAG CACGAGAGTA AATCTCGACT 
TCCTGTTACA AAGAACGTCA CGGCAAAGTC ATCTAACGAA 
ACCAGTAATG ATAGACGGAG TCAGGTAAGG AAGCATCATT 
ACTAGCTCCC AAGTCATAGG CCGCATGAAT CATCTCGCCA 
CAAGACCATC AAGACCACGA TAGGAATGGA GAAGGCCACG 
AAAGCCAAGT GAAAACTTGA CTTGGGTAAA GAGAATCAAG 
GTCAGGCGCA ACCATGAGGA TATTATTGAG TGATAGAAAC 
AGACTGGTAG ATGTAAATGG CACCAAAAGT CCCGATAATG 
GAACGCCAAG AAAAATGTCT GAGCCAAAAT CAGCATGAGT 
TTCAAAGTGA CTCCAGCTAA AACCTCTAAA GCTATTCATA 
ATAGCCAATC AAGTAAAAGA TAGGCAGGTA GAGGACCAGA 
GTTGGCAAAT TTTTTCATCC TTCTCTCCTT TCCTTAGTCA 
GTCAGGATGA CAATCACACC GATGGTTGAA CCCATACCAT 
TTCTGCTCAA TAGCCCTCCC CAAGGTGATA ACGCGTTCCC 
TG AAG AG ACT CAAACTTGGG ATAAAGACCC ACTGAACCCC 
(2) INFORMATION FOR SEQ ID NO*. 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9223 base pairs 

( B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
ID) TOPOLOGY: linear 



AAGGATAATC 

TTCTCACGAG 

GCAGACAGGG 

GATAGCCTTG 

TAGCTGAAGG 

TCCTTGAACA 

TTCATTTCCT 

TGACTACATA 

AACCTACCAC 

GCTTCTTGGT 

GTCGCTATCA 

CTCCCATCTC 

TCATCACCAG 

AAGACCAGTC 

CCCACATGCT 

ACTTCTCATT 

ACCAATCAAA 

GG 



GCTCCAATTC 
AGATAAAGTA 
CATTGATTTC 
AAAAGCCATT 
CCATGAAATA 
TCTGAAATTG 
TGAGTCGAGG 
GAACGGTCAA 
CAATCATAAC 
ATTTCTTACG 
AGGCTGATAG 
CAAACATGGT 
CATTAAAGGC 
CCAGATAAAG 
GATGAACATG 
GCTTAGAAAA 
CGGGTCAGCA 



6540 
6600 
6660 
6-720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7542 



Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: S9 : 
AAAACCAAAT TCCGGTATTT TAACCTATGC TGTAAATACC ATGAAGTCTC 
TCAGGTCTAT AACATTAAGG TTGAGACAGA AAATGGAAAT TATCTTGGTG 
TGTTTTGGTC CTTTTGACAA ATTACTTCGC T G AT AAGAAA ATCTTTGAAG 
CGGCTATGCC AACATTTTGA TTCTGAAAGA TGCCTCTATA TTCTCCAAAT 
TCCTGATTTA TTAAAAGGGG ATGTTGTCGC AAATGATAAT ATCG AGT AT A 
TAATATTAAA ATCTCTTCAG ATAGTGAATT GGAGTCAGAT GTTGACGGAG 



TCATGACAGA 
AAGCTAGCCA 
AAAACAAGGA 
TATCCGTCAT 
TCAAAGCGCG 
ATAAATCAGA 



60 
120 
180 
240 
300 
360 
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TAACCTACCT 


o 1 AbAAA 1 1.A 


i AftlYVTAGC 
nnu 4 *— W i /«j*— 


TCAGGCAGTA 


GAAGTATTTT 


CAAAACCGAA 


420 


ft #*■ TV /^r* ft 1»f ft 

AGAGGATTAU 


ff»ft*T*ft*T'ftf > ft/"*ft 
TATATAbAGA 


AAGccrrrrr 


X MAUU«> 1 i 1 I 




AAAAGATAGT 


480 


»^J^fP* ft ft ^ ft 

TCCTTTAACA 


ALWiALA I 11. 


W 1 4\^^/WA » M 


U A 1 A A AvrWIA 


AATACTATAC 


TGGATT C ATT 


540 


GAGTTTGAAA 


ACGTTTGCGT 


ft ft ft k<iwny;Kt 
AAAA* I lw/w\ 


ft ft f^ ft ^^P^P*17 




TTGATGGAAT 


600 


TGAGTGCTAT 


TTACCATAGG 


CCTGAGTuuu 


Au 1A1 uAW 1 /% 


TPTTT1T1 ft f j 
1 \» 1 1 4 Al *W\v» 


CATAAGAAAC 


660 


TCCATATTCG 


AATTCGAACT 


ft ft ft ft ft r^t^j^F^. 
AAGAAAQuGu 


ir*W!llifi 


V. f% 1 LAaL 1 1 V* 




720 


ACCCTTTTAT 


CTTTATCGAG 


GAGTTTTATC 


ft ft f» ft ft ft ft 
AGGATACAAA 


ft /"» ft ft ft fl*f /*» *pf* 

AGAAATGuTL 


AAVfn 1 /WIW 1 1 


780 


CTGGTACCTT 


ATTTGACCAT 


TGGCAGGTTG 


AAGTGTCAGT 


TGACTTTGCA 


1 /V iUwAu 1 


840 


ATCTCTTTGA 


GCTCAGAGAT 


ACAGAAGGTC 


AAAATATTTT 


/ '»«»■ ft ifvypr ftl* 
UiA IvrOCuA 1 


AAnoLrO lulu 


900 


TGGAAAATTC 


TCTAGAAAAT 


CTTCATGCAA 


TTGGGAATGG 


ATTTAAGTTG 


CCTTAGCTTC 


70V 


ATCAGATTGA 


TGCCTGCAAG 


gTTCCTGACT 


GGGTTTCAAA 


TACGGTATGG 


«*» ax ffV^ ft ft T* >V*1*- 

TATCAGATAT 


4U1U 


TTCCTGAAAG 


ATTTGCCAAT 


GGCAATGCTC 


TATTAAACCC 


AGAAGGGACT 


f|*fT* ft/^ ft /*T^flP 


lUDv 


ATTCATCTGT 


CACACCTAAG 


AGCGATGATT 


TCTTTGGTGG 


TGATTTACAG 


OuuATTA liu 


ii JA 


ATCATATGAA 


TTACTTGCAA 


GACTTGGGTA 


TTACTGGACT 


ATATCTTTGT 


r*r*r* ft T*f**f r F**TV* 
(.LwA H.1 1 lu 


i *> a n 


AATCTACAAG 


CAATCACAAC 


TACAATACGA 


*"* ftf* ft f**T* ft y «iwn 

CAGATTACTT 


1 V* AAA 1 1 UAL 


^_ vj l v_/\ i i i 


12 60 


CAGACAAGGA 


GACCTTTCGG 


GAACTGGTGG 


ATCAAGCGCA 


TCATCblGGC 


ft IV" & & if"*T l #** A 
A IGaAAUIVA 


i lift 
UlU 


TGCTGGATGC 


GGTATTTAAT 


CATATTGGTT 


CGCAATCTCT 


TwAATGGAAA 


/vn lu 1 1 


1380 


AAAATGGTGA 


AC ACTCTGCT 


ft* ft ^ S ft ft •¥*•*• 

TATAAGGATT 


Mm#f>n#< ft*** A T* 


ipp ft ft O i AT" 1 ^ 




1440 


CTGAAAAGCT 


AGTTAATAAG 


AGAGACTTAC 


CCTATCATtiT 


TTTTGCfTTTC 


ft ACfi AfTATA 
unwuA^ 1 /> 4 r\ 


1500 


* CCCTAAGCT 


AAs\T AC AtJ I. C 


AA i LL/\UAI#W 




■ wi 1 • 1 /wv\u 


GTTGCCACTT 


1560 


ATTGGATTGA 


fti^*y»i h^k ftf* 
AGAGTTTAAT 




Wl?*»V* 1 I 1 Vji_»/\ 


TRTGCIf^TAAT 
1 Vf 1 1 1 


GAGATTGACC 


1620 


ATCACTTCTG 


GAAGGATTTT 


CCTAAGGCAG 


TTTTAGCTAA 


AAATCCTGAT 


CTTTATATCC 


1680 


TAGGAGAAGT 


CTGGCATACA 


TCTCAGCCTT 


GGCTAAATGG 


AGATCAGTTC 


CATGCCGTCA 


1740 


TGAATTATCC 


TTTATCTGAT 


AGTATCAAGG 


ACTATTTCTT 


ACGACGAATT 


AAGAAGACAG 


1800 


ACCAGTTCAT 


CCATGAAATC 


AATGGAGAGT 


CTATGTATTA 


CAAGCAGCAG 


ATTTCAGAGG 


1860 


TCATGTTTAA 


TCTCTTGGAT 


TCACATGATA 


CAGAGCGAAT 


CCTGTGGACG 


GCCAATGAAG 


1920 


ATGTTCAACT 


GGTTAAATCA 


GCCTTAGCCT 


TTCTCTTTTT 


ACAAAAAGGA 


ACACCGTGCA 


1980 


TTTATTACGG 


AACCGAGCTA 


GCCTTGACTG 


GAGGACCAGA 


TCCAGATTGT 


CGTCGTTGTA 


2040 


TGCCTTGGGA 


ACGTGT ATCA 


AGTGACAATG 


ATATGCTGAA 


CTTTATGAAG 


AGGCTGATTA 


2100 
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AAATTCCGAA 


ATACGCGTCA 


GTAATCATTT 


CGCATGGCAA 


GTATAGCCTT 


CAAGAAATCA 


2160 


ACTCTGATCT 


AGTAGCTCTG 


GAATGGAAAT 


ACGAAGGACG 


GATCCTCAAA 


GCAATATTCA 


2220 


ACCAATCAAC 


AGAAGATTAT 


CTTTTAGAGA 


AAGAAGCAGT 


AGCACTAGCA 


AGCAATTGCC 


2280 


AAGAATTGGA 


TAATCAGCTT 


GTCATCTCTC 


CAGATGGATT 


TATGATTTTC 


TAAAAACTAG 


2340 


TTGATGAAGA 


TTATGGTACA 


TTTCATACCT 


TATATACTAT 


AATAACGCTA 


GTTACTAAAC 


2400 


TTGTAAAGGA 


GAACTTAAAT 


GAATTGTAGA 


GGACATGAAA 


CAACACAAAC 


AATTGTTAGA 


2460 


CATTTTGAAG 


TTCAGCCTAA 


AGCACATATT 


AAGCTGTTAG 


CAAATCAACA 


AAAACATAGT 


2520 


GATGCAGGAG 


CAACTATTGA 


AGATGAATAT 


TATGTATTTA 


TCGCTGAGAG 


TAAAATTGAT 


2580 


GGCAAGAAGG 


AAGTTATTCA 


GTGTTGCATG 


GGTGCGGCAA 


GGGATTTTTT 


AGAACTAATT 


2640 


AATCACAAAG 


GCCTACCTCT 


TTTTAATCCG 


CTTGTAGGTG 


ATTCTCATGT 


AAATAATAGA 


2700 


CAACAATATG 


ACAATACAGG 


GAGTGGAAAT 


TTATAACCTG 


AAAAGTGGAA 


TGAAACTGCA 


2760 


AAGCAGCTTT 


ATAATGCTAT 


AATCTGGTTC 


ATTATTTTAT 


GGAATGCTAA 


GCCGG AT ACA 


2820 


C CTTTATTTA 


ATTTTAAAGA 


CGAACTAATT 


AAGTATAAAA 


CATATGAGCC 


TTTTGAAAGC 


2880 


AGTATAAAAA 


GAGTAAATAC 


TACTATAAAG 


AATGGTACTA 


AAGGGAAAAC 


TCTGACTGAC 


2940 


ATGATTAATG 


GCTACAGAGC 


GGATAACGAT 


ATTAGACATG 


AAATTTGTAA 


CTTTAATATT 


3000 


CTGAAAAATA 


AAATTCCTGA 


TATGAAAAAC 


CAACAAGGAA 


ATACAATGGA 


ATCTTACTTT 


3060 


TAGTTATTGT 


TGAATTTTGG 


GTATTCTATA 


AAATATCCTA 


ATTCAGATTT 


AAATAGTAGA 


3120 


CTATACAATA 


TAGTTAAAAT 


ATCAGTAAAA 


ACAACACTTT 


ATTCAGGTAT 


TGGATACGCT 


3180 


TTGCTAATAG 


CCTAATAATC 


ACATCTGGAG 


TGTTGCTACA 


ACGAAAAAGG 


TGATAATCCT 


3240 


TGATTTCAAG 


CT ATTTT AT A 


AGCATTTTGT 


CTTTGTAGAT 


AAAGGCAATT 


TTGACAATAA 


3300 


AAATCCTAAA 


AGCTGAATCG 


TTATAGATGT 


ATTTGTAGAT 


ATCGTTTGCG 


CATCGAAAAA 


3360 


ATTAATACAA 


GAATAAATAT 


TTATAGCTCT 


TTAGGTGACT 


TTTATAGAAG 


TAAAGTTTAG 


3420 


CATAGAAAAA 


CAAGAAATAA 


CGCACCATTT 


TTGCTGCGTT 


ATCCTTTTTT 


ATGCTATAAT 


3480 


GCATTTATAA 


AAATAAAGGA 


GTTTCCTATG 


ATTGGAAAGA 


ACATAAAATC 


CTTGCGTAAA 


3540 


ACACATGACT 


TAACACAACT 


CGAATTTGCA 


CGGATTGTAG 


GTATTTCACG 


AAATACTCTG 


3600 


AGTCGTTATG 


AAAATGGAAC 


GAGTTCAGTC 


TCTACCGAAT 


T AAT AG AC AT 


CATTTGTCAG 


3660 


AAGTTTAATG 


TATCTTATGT 


CGATATTGTA 


GGAGAAGATA 


AAATGCTCAA 


TCCTCTTGAA 


3720 


GATTATCAAT 


TGACTTTAAA 


AATTGAAATT 


GTGAAAGAAA 


GAGGTGCTAA 


TCTATTATCT 


37B0 


CGACTCTATC 


CTTATCAAGA 


TAGTCAGGGA 


ATTAGCATTG 


ATGATGAGTC 


TAATCCTTGG 


3840 


ATTTTAATGA 


CTGATGATCT 


ATCTGATTTG 


ATTCATACGA 


ATATCTATCT 


AGTAGAAACT 


3900 
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TTTCATGAAA 


TACAGACATA 


TAGTGGCTAT 


TTGGATGGAA 


TTGAACCTAT 


GTTAGACATA 


3960 


TCTCAAAAAC 


GGATGCTGGC 


CTAATGGAAA 


TCCAAGATTA 


TACTGATAGT 


GAATTCAAAC 


4020 


ATCCTTTAGC 


AAGGAATCTT 


CGTTCACTGA 


CAAGAGGAAA 


AAACTCCAGT 


AAGCAACCTA 


4080 


TAGCCATTTT 


GCTTCGAGGG 


CAAACTGGTG 


CCGGTAAGAC 


TACAATTCAT 


CGTATTAAAC 


4140 


AGAAAGAATT 


TCAAGGAAAT 


ATTGTTATCA 


TAGATGCTGA 


TAGTTTTCGT 


TCTCAGCATC 


4200 


CACACTATTT 


AGAACTGCAG 


CAAGAATATG 


GCAAAGACAG 


TGTAGAATAT 


ACCAAAGATT 


4260 


TTGCAGGAAA 


AATGGTAGAG 


TCTTTAGTAA 


CAAAATTGAG 


TAGTTTGAGA 


TACAATCTTT 


4320 


TGATAGAGGG 


AACTTTACCA 


ACAGTTGATG 


TTCCAAAGAA 


AACAGCACAA 


CTCTTGAAAA 


4380 


ATAAGGGATA 


TGAAGTACAA 


TTGGCCTTAA 


TTGCGACAAA 


CCCTGAATTG 


TCCTATCTAA 


4440 


CTACTCTTAT 

\J A A ^ A A r\ A 


CCGTTATGAA 


GAACTGTACA 


TTATCAATCC 


AAATCAAGCA 


CGCGCAACTC 


4 500 


CAAAACAACA 


TCATGATTTC 


ATTGTAAATC 


ATCTACTTGA 


TAACACACGA 


AAATTGGAAC 


4560 


AACTAGCTAT 


CTTTCAAAGA 


ATTCAAATTT 


ACCAACGAGA 


TAGAAGTTGT 


GTATATGATT 


4620 




TACAACTTCA 


GCAGCAGATG 


TTCTTCAAGA 


GTTACTCTTT 


GGGGACTGGA 


4680 


CTCAGCTAGA 


CAAGGAGATG 


TTGCAGGTGG 


GGGAAAAGAG 


ACTTAATGAA 


TTACTTGAAA 


4740 


AATAAACAAT 


TGATATTTTT 


AGGAGAATAG 


AAATGAGAGG 


GTTTAATAAC 


AAG AT AAAGT 


4800 


CTGTTT ATC A 


AGAACTAACA 


AATTCCAAAG 


ACAAATTCGC 


TACCTTTCAC 


AAGACTTTAA 


4860 


TTCATTTCCA 


TACACCTGTT 


TCTTATGATT 


ACAAGCTATT 


TTCTAATTGC 


ACTGCAACGA 


4920 


AATATAGAAA 


AATTACTGAA 


GATGAACTAT 


ATGATATATT 


TTTTGAAAAT 


AAGAAAATAA 


4980 


AAGTTGATAA 


GACAATTTTT 


TTTAGTAATT 


TTGATAAGGT 


TGTTTTTTCT 


ACTTCAAAAG 


5040 


AATATATTAC 


TTTTCTTATC 


TTAGCACAGG 


CAATCATAAA 


AAATGGAATA 


GAAATAGTTG 


5100 


TAGTAACTGA 


TCATAATACT 


ACCAAAGGTA 


TTAAAAAGTT 


ACAAATGGCA 


GTCTCAATCA 


5160 


TAATGAAAAA 


TTATCCGATT 


TATGATATAC 


ATCCTCATAT 


TTTACATCGA 


GTAGAAATTA 




GTGCAGCAGA 


TAAATTGCAT 


ATTGTATGTA 


TATATGATTA 


TGAACAAGAA 


TCATCGGTTA 


5280 


ATCAATGGTT 


AAGTGAAAAT 


ATTATAAGTG 


AGAAAGATGG 


AAGTTATCAA 


CATTCACTGA 


5340 


CTATAATGAA 


GGATTTCAAT 


AATCAAAAAA 


TAGTTAACTA 


7 ATTGCTC AT 


TTCAATAGTT 


5400 


ATGACATTTT 


GAAAAAAGGT 


TCTCACTTAT 


CAGGTGCATA 


TAAACGAAAA 


ATTTTTTCTA 


5460 


AAGAAAATAC 


ACGATTTTGG 


AGTTTAATAT 


TAACTCGAAA 


GAATCTTCGC 


AACAACTTGA 


5520 


TATTCTCTAT 


AAAGAAGTTG 


GTGTATTAAG 


TTTGGGACAA 


AAAGTTGTAG 


CCATCCTTGA 


5580 


TTTTTTATTA 


GCATATAGTG 


ATTATTCTAA 


AGACTTCAGA 


CCATTGATTA 


TTGATCACCC 


5640 
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GTTATATTTA 


518 
CAGGCATTTA 


GTTCACCACT 


TTAGAGATGT 


5700 




CGTCAAATTA 


TTTTAGCAAC 


ACATAATCCT 


ACAATTGTAA 


CAAATTCTAT 


5760 






TGGAGTCAGA 


TGGAGTTAAC CGATGGATTG 


AATCACAGGG 


5820 


AT ATGTTAvj 1 


Ci&AAATATA 


TAAAAAATCA 


TATCATCAAT 


CAATTAGAGG 


GAGGAAAAGA 


5680 


TTCCTTCAAG 




CTATATATGA 


GACGGCTTTA 


TCAGAGTAGA 


GTCAGAAAAA 


5940 


s • ft HOW ft ^ & 

CTACGTTAGA 


W\i 1 IMVjLLJ 


Hf^TTTTTTCT 


TTGTCCGACA 


GGCATAGTGT 


ACATCTGAGG 


6000 


TCCAAGTCCT 




TTRfTCCAGA 


TGAAACCAAT 


AGCGACTCCT 


AACCCTGAAT 


6060 


ATCGTGAGCT 






AGCGAAATCA 


AGGTTCTACA 


AACAGAATCG 


6120 


TGACTTGAAG 


C CAT AT AT At* 




ACTCTAAAAT 


CCAAATAGGT 


GTCCTAACCT 


6180 


ATATACGTAA 


* MM* ft f+f* ft ^ ft ^ 

ATTACGAGAu 


TAAAw 1 nww/\ 


AAGATGTACG 


GCTTATTCCG 


TGAGCCTTTA 


6240 


GGACGTAGTA 


* ft f*^* ft ft ft 

CAACGAATwA 


rrYVV AtTTf" AG 


CTGAACACAT 


AGTATTGAAG 


AAATTTCTGT 


6300 


AATGGAAATG 


*» 9k ft ft^ ft ft 

GAGCGAAGAA 




AAATGAATAC 


CTCTCTAATT 


AAATTTGTCA 


6360 


ATTCTAATTC 


CTCCTATGAA 


ft ft ft f» ft fWt\ A 
AAuALAu 1 vj/\ 


CCTGAAAATG 


TAAACGATGG 


GACCTGATCA 


6420 


TAAATATAGG 


ACGCTACATG 


UAuTuu Ivi 1 1 


AGAGATTAGT 


CCTTACTTGA 


TTTCTGATAA 


6480 


CTTCCCCAAA 


TTTCTTCTGC 


T AT AC TTTTC 


TCAACTTTTA 


AAAATCCAAC 


TAAGAATTTT 


6540 


ACCTGGGGGT 


TTGGGGGC GG 


ti r*r* ft. r"T & Af**T 


TATCTTATCG 


TTAGCTCTCA 


AAACTGGTAG 


6600 


GTTTTGATAG 


GCTGCCGATA 


I Vj/V 11111 V*U 


GATATTCTCG 


ACACAATATC 


TGAGCTCGCA 


6660 


AACCCTTACA 


AGAATGAAAA 


TLAul lul iv* 


GAAAAGTGTA 


CTGACATTCT 


ATGGTACCTC 


6720 


ACATTCTCAG 


TACAAGTATT 


TTr/*i & Ann A 


AGTAGCAGTA 


TGAAACGAGA 


TCTGCGTGAT 


6780 


ATTCGGAAAC 


AATTTCCTTT 


aACACAAGCA 


GAAGAAAAGC 


AAATTCTAGC 


TTTCATGAGA 


6840 


G AGCGGGGAG 




C^fTGATTTT 

*u * W * Ur% a * * * 


CTTCGTAAAA 


GTTTACTTTC 


CTCTCATTTA 


6900 


CAAAAACAGA 


■iw^ ft ft ^ ft *T*f* 

TGGAGACATw 


RTTTCCCCTC 
^ 111 uw^w * 


TGGCAATCCC 


AAAAACTACA 


ACAAATCAGT 


6960 


CGTGACGTTC 


ATGAAGTTTT 


AATCTTGGCA 


CAGTCAGAAC 


GTCAAGTCAC 


CCAAGAGCAT 


7020 


GTATCTATTC 


TCTTAACGTG 


f+ ft /»»r*» ft ft 

CGTGCACUAA 


TTGATTCAAG 


AGGTTGCAAA 


CACCATACCC 


7080 


CTCAGTAAAG 


AATTTCGTGA 


GAAGTACATG 


AGGTAAGCAC 


ATGGAACATC 


GTTACCGAAC 


7140 


CAATCTCAAG 


AAAGTCTTTT 


TGTCTGATAG 


TGAGTTGAAC 


CAACTAAATA 


TAAATATCGA 


7200 


TCAAAGTGGT 


TGTAAATCCT 


tttctgaata 


TGCGAGACGA 


ACTCTACTCG 


ATCCTGGTAT 


7260 


GAATTTTATC 


ACGATTGACA 


CAAACGGTTA 


CCAAGATTTA 


CTCTTTGAGT 


TAAACAGGAT 


7320 


TGCCAATAAT 


ATCAACCAGA 


TTGCTCGAAG 


TGTTAATCAA 


TCTCAGTTAA 


, TTTCTGGTGA 


7380 


AG AATTGC AG 


GAGTTGAAAA 


. AAGCAATTGG 


TGAATTGATA 


AAAGAAGTTG 


I ATAAGGAATT 


7440 
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TAATCTGCAA 


GCCCAGAAGC 


TAAAGGAGTT 


CCATv»GTv.A\i 




TTTGCCATTC 


7 500 


ACGGAAAGAG 


TTACCGCAGA 


AAGCTTATCA 


ACTACATTCT 




AAAACCAATA 


7560 


ATCTTGCCTT 


GGTGTCGGAC 


TATGGCATGA 


AGAATTTTCT 


GGACTTTLL I 




/ Div 


AAATGGTGCA 


GATGTATCAT 


GAAAATTTCA 


TCAGCAACGA 


TACGCTTI Av. 




7CDA 


ACGACAGGAT 


GGAAGAAAAT 


CAACGAAAAA 


TACACGCTCA 


ft ^ ft it 
CCACATCATT 


cagtctttct? 


* 9 4U 


CGCCAGAGGA 


TCATATCACT 


CCTGAACAAA 


TCAATCGCAT 


ft ^^^VWW ft Ik ^ 

AGO TT AT GAG 


AliuiuAAGG 


7800 


AATTAACTGG 


TGGCAAATTT 


CGTTTTATCG 


TTGCGACCCA 


TGTTGATAAA 


GACCACCTGC 


*i o en 
/oou 


ACAATCACAT 


CATTATCAAT 


TCAGTAGATA 


GCAATTCTGA 


CAAAAAGCTC 


AAGTGGGACT 




ACAAGGTGGA 


GCGAAATCTT 


CGCATGATTT 


CTGACCGTTT 


TTCTAAAATC 


GCAGGTGCTA 


7980 


AAATCATTGA 


GAACCGCTAT 


TCTCACCAGC 


GGTATGAAGT 


CTATCGTAAG 


ft s^+ls ft m #v%^^ ft ft 

ACTAATCACA 


8040 


AGTATGAACT 


CAAGCAGCGA 


CTCTATTTTT 


TGATGGAACA 


TTCTAGCGAC 


ffeuMM/* ft ft AMI 

TTTGAGGATT 


8100 


TCAAAAAGAA 


TGCTCCGCTA 


CTACATGTGG 


AGATGGATTT 


CCGTCACAAG 


CATGCCACCT 


8160 


TTTTTATTAC 


GGACTCAACT 


ATGAAACAGG 


TGGTGCGTGG 


CAAGCAACTC 


AATCGCAAGC 


8220 


AGCCTTACAC 


AGAAGAATTT 


TTTAAGAACT 


ACTTTGCCAA 


AAGAGAAATA 


^ ft ft ft / ^f'/™* ft 

gaaagtctca 


o^on 

D«OU 


TGGAATTTTT 


ATTGCTGAAA 


GTTGAGAATA 


TGGATGATTT 


ACTTCAGAAA 


GCAAAACTTT 


834 0 


TTGGACTAAC 


TATCAATCCT 


AAACAAAAGC 


ATGTTTCTTT 


TCAATTTGCA 


ft **0m+f* ft 

GGAGTGGAGG 


8400 


TAAAGGAGAC 


AGAGCTAGAC 


CAGAAAAATC 


TTTATGATGT 


AGAGTTTTTC 


CAAGAli ATT 


B40U 


TTAAAAATAG 


AAAAGATTGC 


CAAGCTCCAG 


AAACTGAGGA 


TTTCGTTCAA 


CTTTATCAAG 


DC") A 


AAGAAAAGTT 


ATCCAAAGAA 


AAACAACTTC 


CAAGCGATGA 


GAAGTTCTGG 


GAGTCCTATC 


OCBA 


AAGACTTCAA 


GACTAACAGA 


GATGCCCTTC 


ATGAA i ; iGA 






O 04 w 


AAATTGAAAA 


ACTAGTGGAT 


GATGGAATTT 


ACGTCAAGGT 


CAAU i i i\tvT 




8700 


AGGGACTTAT 


CTTTGTCCCG 


AACATGCAGC 


TTGATATGGA 


AGAGGATAAG 


CTCAAGGTTT 


8760 


TCATCACGGA 


AACCAGCTCC 


TACTATGTCT 


ACCACAAAGA 


CGCTGCCGAG 


AAAAATTGTT 


8820 


ATATGAAAGG 


TCGAACCTTA 


ATTAGACAGT 


TCAGCTATGA 


AAATCAAACC 


ATTCCATTAC 


8880 


CCAGAAAACC 


GACAGTCCAT 


ATGATTAAAG 


AGAAGATTGC 


GGAAGTGGAT 


GCTTTGATTG 


8940 


AACTGGAAGT 


AGAAAATCAA 


TCTTATGTCA 


CGATTAAAGA 


TGAGTTAGTG 


CATGAACTAC 


9000 


CAG CGTCTGA 


ATTGAGAATC 


AATGAGTTGC 


AAGAACGAAT 


GTCAACCTTG 


AATCAAGTAG 


9060 


CAGAATATCT 


ACTGGCTTCA 


GTTCAAAGTA 


AGCAAGAAAT 


GAAATTAAAT 


CTTTCAAAAC 


9120 


TGAATATAAC 


TGAGAATATC 


AGTGCTAATA 


TTGTTGAGAA 


AAAATTGAAG 


AGCCTGGGGA 


9180 
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ATCAACTGGA ATTGGAAAGG GGCAGGTATG AAAAGATGGT ACT 9223 
<2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6827 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(O) TOPOLOGY: linear 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



TCTGCTGGCT 


ACCATCATCT 


GACTTGGGCA 


AGACCAAACT 


CTTAGTTACA 


ACTGTATTCT 


60 


TCTCAGCATT 


TTCAATAACT 


GGCAATGCCG 


ACTGAACCGT 


ATCTTTTTCT 


GTTTTTGTAG 


120 


CTGGTCCAGT 


TTCTTTTTTC 


TGTCCGCAAC 


CAACCAGGAC 


AAAAAGGAAA 


GCTAGACTAA 


180 


CAAGAACTAT 


TTTTTTCATT 


TCTTTCTTCT 


TTCTTTTTGA 


AATTAAAATA 


GAATAAGACT 


240 


GGGAAGTGCT 


CCCAGCCTTG 


ATGTTTATAG 


AGCTGCACGC 


AAACGTGCTT 


CTGCATTTTC 


300 


TACATTACGG 


ACAGAGCGTG 


GTACGAAGGC 


ACGAATATCG 


TCTTCCTTGT 


AGCCAACTTG 


360 


CAGGCGTTTT 


TCATCTACAA 


GGATTGGGCT 


CTTTAAAATT 


CTCGGTGTTT 


CCATAATCAG 


420 


ATTGAGAACT 


TCATTGACAC 


TCAAATCTTC 


AATATCCACT 


CCAAGGGCTT 


TGGCATAGCG 


480 






THflfTATTCC 

X OV*V. k ft 1 I ^ ^» 


CTTATCTGTT 


TTGCTTAGAA 


TATCCACTAA 


540 


TTCTTCTCTC 


GTAATTCCTT 


CTTTACCAAG 


GTTTTGTTCT 


TTATAACTTA 


ACTGGTGGGC 


600 


ATTGAGCCAC 


GTTTTTCCTT 


TTTTACAGCT 


AGTACAACTT 


GAGACTGTAT 


AAATTTTAAT 


660 


CATGTACCTA 


CCCCTTTCCC 


TACATGTTAC 


TATCAGTTTA 


GTCTATTATA 


CCATAAAAAA 


720 


CATCCGACTT 


GCGACCTATT 


TTTAATTTTT 


TTTGACTTTT 


TTCCTCATTT 


TCGTACTTTT 


780 


TTCTTGACAA 


ACAACTAAAT 


GACTATCAAC 


TCTTTTGGAG 


CTAGGCTCAA 


TAATTCACAA 


840 


CCTGTCTCTG 


TAATCAGGAT 


ATCATCCTCG 


ATACGAACGC 


CATATTTCCC 


TTCGATATAC 


900 


ATACCTGGTT 


CATCCCTCAA 


GGCCATACCT 


CTCTTAATAG 


TTTCTGTAGA 


AGTCTGACTA 


960 


AAGTAGCCTT 


CCTCATGGAT 


ATCCAGACCA 


ATACCCTGCC 


CAATGCCGTG 


AGTAAAGTAG 


1020 


TCACCATAAC 


CTGCCTCAAT 


GATAATATCA 


CGAGGGATTT 


TGTCAAAGTC 


ACGGAAACCT 


1080 


AAGCCTGCCT 


TAGCTTGGTC 


AATCAAGGCT 


TGGTTAGCTT 


TTAGAACCCT 


ATTGT AAATC 


1140 


TCTGCCTGCT 


CATCGCTAAC 


ATGCCCTAGA 


TAGATAGTCC 


GGGTCATATC 


ACTGACATAG 


1200 


TGGTCATAGA 


GACAGCCGAA 


GTCCATGGTG 


ATGCCTTCTC 


CCAACTCCAC 


TGCTTTGTGC 


1260 


ATTGGATGGG 


CATGGGGTTT 


AGAAGAATTG 


ATACCGCTAC 


CTAGGATCGT 


ATCAAAAGAT 


1320 


AAGCCAGATG 


CTCCCAACTC 


ACGCATGCGG 


AAATCAAGGA 


AGTTGGCAAT 


CTCAATTTCA 


1380 
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GTTTTTCCTC 


Gil I GAT AAA 


GTCAAGCGCA 


T(.\«V.\rGAAAG 


i »|W|'/V ,l TP^*A 


On A AuAAww 


1440 


CCCTTGCGAA 


TCGCTGCAAT 


CTCTGCCTCA 


TCCTTAATCA 


TACGAAWAWW 


TTtCACAAAC 


A 3Uv 


TGAGTTTGTG 


GAAGCAAGTT 


CAAACCTGCA 


AAAGCTGCCT 


CCATACGGTG 


UTAATAACAW 


1 SOU 


ACTGAAATCT 


CATCTTCAAA 


ACCGATACGA 


GTCAAGCCCA 


TGTCCTTAAC 


AAl I LLITjCA 




ATGACAGCCA 


ATTCATCACG 


ATCAGCCACA 


ATCTCAAAAC 


CACTGGTTTC 


.TTGCTTAGCT 


i can 


GCGATGATAT 


AGCGAGAGTC 


TGTCACTAAG 


ACCTGACGGT 


CACGACTGAT 


AAAGACTGTT 


1740 


CCGTTTCAGC 


CCCAAAAACC 


ACTCAAATAA 


TAGACGTTTT 


TAAGATTGTT 


GATGATGATA 


lbUU 


CCATCTAGTT 


CTTTTTCTTG 


CATTTTAGCT 


AGAAATGCTT 


GTACGCGTTT 


ATTCATGATG 


1 obu 


TAACTTTCCT 


TTCAAATAGT 


GT CCTGT AT A 


GCTGGCTTCG 


TTGGCAGCTA 


CTTCTTCTGG 


1920 


AGTTCCTCTT 


ACGATGATGG 


TTCCACCACC 


GACACCGCCC 


TCAGGTCCCA 


AG TC AATGAT 


1980 


ATGCTCTGCC 


CTCTTGATAA 


CATCCAGATT 


GTGCTCCATC 


ACGAGGACTG 


TATTGCCATC 


2040 


CTCTACAAAG 


CGAGCTAAAA 


CCTTGAGCAG 


GCGAGCAATG 


TCCTCTGTAT 


CAAGCCCTGT 


2100 


CGTCGGCTCA 


TCCAGAATCT 


AGAAAGATTT 


TCCTGTCGAT 


CCTTTGTGGA 


GTTCGCTAGC 


2160 


T AACTTC AT A 


CCTTGGGCTT 


CTCCCCCAGA 


AAGGGTGGTA 


CCTGGCTGTC 


CCAAGGTCAC 


2220 


ATAGCCTAGC 


CCTACATCCT 


TGATGGTCTG 


GACTTTGCCT 


TGAATTTTCC 


CAATGTGTTG 


22 80 


GAAAAATTCT 


ACCGCATCGT 


TGACCGTCAT 


ATCCAAGACC 


TGCGAAAT AT 


TCTTTTCCTT 


2340 


GTAGTGAACT 


TCTAGGGTTT 


CACTGTTATA 


GCGGGTTCCG 


TGCCAAACTT 


CACAAGCCAC 




ATAAACATC7 


GGCAAGAAGT 


GCATCTCAAT 


CTTGATAATC 


CCGTCACCTG 


AGCAAGCTTC 


2460 


ACAGCGACCT 


CCCTTGACGT 


TGAAACTGAA 


GCGCCCCTTC 


TTGTAGCCTC 


GAATCTTGGC 




TTCATTTCTC 


TGAGCAAAAA 


GGTCACGTAT 


ATCGTCAAAA 


ACTCCTGTAT 


% <-i » i—f*»p/™/^ 
AGGTAUCTGU 


■jean 


GTTAGACCTC 


GGCGTCCGTC 


CGATAGGGCT 


CTGGTCAATA 


TCAATCAAAC 


GGTCOACATaj 




CTCAATCCCT 


GTAATACTCT 


TAAACTTACC 


AGGTTTGTCT 


GAATTACGGT 


TGAGCTTCTG 


2700 


GGCAATGGCT 


TTTTTGAGAA 


TGCTGTTGAT 


TAGAGTCGAT 


TTCCCTCAAC 


CCGACACACC 


2760 


TCTCACTGCG 


ATAAATTTTC 


CTAGTGGAAA 


GCGAGCCGTG 


ACATTTTGCA 


AGTTGTT CTC 


2820 


ACGCGCTCCT 


ATCACTTCAA 


TAAAACGACC 


ATTTCCGACA 


CGGCGCTCTT 


CTGGTACTCG 


2880 


GATGACACGT 


TTGCCTGACA 


AGTACTCACC 


TGTGATAGAC 




GACCCACTTG 


2940 


CTTAGGTCTA 


CCTGCTGCAA 


CAATCTCACC 


ACCAAAAACA 


CCGCCACCAG 


GACCAACGTC 


3000 


AATCAGATAA 


TCAGCCTCAC 


GCATGGTATC 


TTCGTCGTGT 


TCCACCACGA 


TAAGAGTATT 


3060 


GCCCAAGTCA 


CGCATCTTTT 


TCAGACTGGC 


AATCAGGCGA 


TCATTGTCCC 


TCTGCTGAAC 


3120 
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ACCGATTGAC GCCTCGTCTA GGATATACAG GACACCTGAT AGGTTGGAAC CAATCTGGGT 3180 

TGCCAAACGA ATGCGCTGAC TTTCCCCACC TGAAAGGGTT CCTGCTGAAC CTGACAGGGT 3240 

TAGATAGTTA AGACCCACAT TATTAAGGAA GGTCAAACGA TCCTTGATTT CCTTGAGAAT 3300 

GGGACGAGCA ATGAfGGCTT CATTTTCACA CAAAGTTAAC TGGCTCACCA AGTCCAAGTC 3360 

GTCAGCGATA GACAGGTCTG AGATTTCTCC AATATGTGGC CCTTGCTGGC CGCCCACACG 3420 

GACAGACAAG GCCTGGTCAT TGAGACGATA GCCTTGACAG CTTCCGCAGG TCAGCTCATT 3480 

CATCTAGAGA CGCATCTGAG TGCGAGTGTA ATCGCTATTG CTTTCATGGT AACGACGTTT 3540 

CATATTATTG ATAACTCCCT CAAACGGAAT GTCGATATCG CCCACGCCAC CAAATTCATT 3600 

CTCATAGTGG AAATGGAATT CCTTACCATC TGACCCATAG AGAATCAAGT TCTTATCTTC 3660 

TTCTGACAGG TCCTCAAAAG GCTTATCCAT AGCCACTCCA AAGACTTTCA TGGCCTGCTC 3720 

TAACATGTTT GGATAGTAGT TGGATGAGAT ACGATTCCAA GGTGCTAGCG CTCCCTCACG 3780 

TAAGGTTTTG CTAGCATCTG GCACTACCAA ATCAGTATCC ACCTCCAGCT TGATGCCCAA 3840 

CCCGTCACAC TCACTACAAG AGCCAAAAGG AGCATTGAAA GAAAAGAGAC GAGGCTCTAA 3900 

CTCTGGGACA CTAAAACCAC AAACTGGACA GGCATAATGC TCAGAGAACA ACAACTCCGA 3960 

GTCGTCCATG GTCTCCATAA TGACATAACC TTCTGCAATA CGAAGGGCAG CCTCAATGGA 4020 

ATCAAAGAGA CCACTACCAA TGCCCTCCTT GATAACAATA CGGTCAACCA CGACATCGAT 4080 

ATTCTCTTGC TTGCTCTTAG ACAACTCTCG CACTTCGGTC ACATCATACA CTTCCCCATC 4140 

CACACGGACA CGAACATACC CCTCTTTCTG AACCTTCTCC ATAACACTCT TATGTTGGCC 4200 

TTTTTTCTTC CGCATCACAG CAGCCAACAT CTCCAAGCGC TGGCGTTCAG GTAACTCCAA 4260 

AACCTTATCA ACGATTTGCT CCACAGAAGA AGCATTGATA GCTCCATCTC CGTTGATACA 4320 

GTAAGGCGTC CCCACACGTG CGTAGAGGAG ACGCAGATAG TCATTGATTT CAGTCGTCGT 4 380 

TCCCACCGTC GAGCGAGGAT TTTTACTAGT CCTTTTCTCG TCCATGGAAA TAGCTGGGCT 4440 

GAGACCATCA ATGGCATCTA CATCTGGTTT TTCCATATTT CCCAAGAACT GACGAGCGTA 4500 

GGCGGACAAA CTCTCTACAT AGCGACGTTG TCCCTCCGCA TAGAGAGTAT CAAAAGCCAG 4560 

ACTGGACTTC CCTGAACCTG ACAAGCCAGT CACGACAACC AACTTGTCTC GCGGAATCTC 4620 

CACATCAATA TTTTTTAAAT TATGGGCACG CGCCCCATGA ATGACAATTT TATCTTGCAT 4680 

CTTTGTTCTT TCTAGTCCAT TATTGCTTAC C ATT AT AC C A AAAAAAGTGA GATTCTATTA 4740 

CCCAAAAGGC CGATTTTGTA GTATAATAGT ACAGTGTGAA AAAATCTGAA AAATGAGAAA 4800 

GGATAAGGGA TATGAAACAA GTTTTTCTCT CTACAACAAC TGAATTTAAA GAGATCGATA 4860 

CGCTTGAACC GGCTACTTGG ATCAATCTCG TCAATCCCAC TCAAAATGAA TCACTCGAAA 4920 
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TCGCCAACAC 


CTT CG AT AIT 


GATATTCCTG 


ACCTTCGAGC 


ACCGCTCGAT 


GCGGAAGAAA 


4980 


T t iTCT CCT A^ 

1 V* i W 4 WW 4 Ft 4 


TACCATTGAA 


GACGAGTATA 


CCCTGATTAT 


CGTAGACGTG 


CCGCTCACGG 


5040 


&r^IAAACAAA 




TACTACGTAA 


CCATCCCGCT 


TGGTATTATC 


ATCACTGAGG 


5X00 


#Ww%^^nl iftl 


C ACT ACGTGT 


TTGGAACCAC 


TACCTGTCCT 


TGATCTCTTT 


ATCAACCGTC 


5160 


r.ATTCCCTAA 
w/t4 iv^vi/vi 


TTTCTATACC 


TTCATGCGTT 


CACGTTTTAT 


CTTTCAAATT 


CTTTATCGCA 


5220 






WA^m. a 1 ^Vi i A 


CAATCGACCG 


CAAGAGTGAA 


CAAATCGAAA 


5280 






\» k> A /V\ 1 


AATTGATTGA 


GCTCATGGAA 


TTGGAAAAAA 


5340 


CTATCGTCTA 






CAAATGAGCG 


CCTGATTAAG 


AAATTGACCA 


5400 


GTTCAACCAG 


LAAl A i LAnu 


& & AT AfVTTC 
AnnlALvt 1 IVl 


ACCACGAAGA 


CCTGCTTGAA 


GACACCCTGA 


5460 


TTGAAACCCA 


ACAGGCCATT- 


uAun 1 wutAu 


ATATTTATGG 


AAACGTCTTG 


CATTCTATGA 


5520 


CAGAUACCTT 


TGLV.TL iAl\. 




ACCAGAACAA 


CATCATGAAA 


ACCTTGGCCC 


5580 


TTGTCACCAT 


CGTCATviTCC 


f\ l Uwv,AALI.a 


*T*^^*M *"I "Tt" 1 

X UVJ 4 \— * A A 4 V* 


TCCCTACGGG 


ATGAACTTTA 


5640 


AGGATAATGA 


AATCCClLTA 


AALwAuAVtW 


CAAATCCCTT 


CTGGTTAATC 


GT C1TT ATCG 


5700 


CCTTTCCTAT 


VjALi I Vj I V_ 1 LO 




ATCTCATCCA 


TAAAAAATGG 


TTCTAAGAGG 


5760 


ACTTCCTATG 


mi k«i i/^* ft ft ft & m T*f** 


AIL 1 AJ-A/W* 


AT r AACTAAC 


AAAAACCAAG 


ACTTTGTCCA 


5820 


CATTGCTACC 


CAAuAAl 1 U.A 


1 A \9\M 


CAAAACAGAC 


GCTGAAATCC 


AG ACT ATTT T 


S880 




nl 1 Lt_ww*wv\ 


TCCTTGAGGA 


GCAATCTAAA 


GGTACAACTG 


CCCGTTCCCT 


5940 


ATACGGCGl-A 






CTTCACTGTC 


AAAGAGCAGT 


ACGAAAAAGA 


6000 


GCATLWAAAA 


ft Ja 71 HT^ATft 
UAAAA 1 i V* 




GATGATTATG 


GACTCAGCTC 


TTTTCATCAC 


6060 


I /lU>W 4 A * 4 




CCGCCCTCAC 


AACCTTCTTT 


GCGGCAGACC 


AACCTTTCGG 


6120 




ATTACTCTTC 

t\ A 4 J^\m 4 \* 4 * W 


TATTAGTTGG 


ACTGGTTGGT 


GGATTTGCCT 


TCTACTTGAT 


6180 


GTACTACTTT 


GTTTACCAAT 


ACTATGGACC 


AGATATGGAT 


CGCAGTCAAC 


GTCCACCTTT 




CTGGAAATCT 


GTACTAGTTA 


TCCTAGCTTC 


TATCTTCCTT 


TGGTTGCTTG 


TCTTCTTTGC 


6300 


AACAAGCTTC 


CTACCAGCTA 


GCCTTAACCC 


AGTACTGGAT 


CCATTGCCAC 


TAGCTATTAT 


6360 


TGGAGCAGCC 


CTCCTAGCCC 


TTCGCTTCTA 


TCTCAAGAAA 


CGCTTGAATA 


TCCGTAGTGC 


6420 


AAGTGCAGGA 


CCAACACGCT 


ATCAAGAATA 


AGAAAACGAT 


AAAAGCAACT 


GCAGGTGCGG 


6480 


TTGCTTTTTC 


ACTTACTTTT 


TTGAGTTATA 


TTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


6540 


CCTAGCTGCA 


GGTTGCTCAA 


AGCACACCTT 


TGAGGTTGCA 


GATAAAACTG 


ACGTGGTTTG 


6600 


AAGAGATTTT 


CGAAGAGTAT 


TAAAAGTATT 


CTTCTGAAAT 


CCCACATAGC 


TTTCTCTTAT 


6660 



i: 
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ATTTTGTG AT AAAATAGGCT CAATCTATTT CTAGCACGAT GACATATGCT TTCTACTATT 
GGTATTGTTA GTTTATCTAG TGGCATTATC GGAGAGGATT TTGTCAAACA CGAAGTGGAC 
TTGGCTATCC AACGTCTCAA GGATCTCCGA CTCAATCCCA TCTTTTT 
C2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11864 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



6720 
6780 
6827 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



CTGGCTAGTT 


GCATAGAGCA 


AAGTTGCTTC 


TTCATCAACA 


AAACCCTTCA 


TTTCAAAATA 


60 


GGAAAGCAGC 


TCATCAGGAC 


TCTCCAAACG 


AATCCCTTTG 


TAATCCACCT 


CAACTGCCAC 


120 


CTCTTTCAAG 


GCTGCAAGAA 


GAAGTCTTCC 


CAGGCCCTGT 


CTCTGATGGT 


CAAACTCGAT 


180 


GACTAAAGAA 


TGTACTTTTA 


GACATTGCGG 


ATTCTCTGAC 


TGGGGACTTG 


ATAAAATATA 


240 


CCCTAAAACT 


TGATTTTCAT 


CCCTAGCTAG 


AAGAAAGGTA 


TCCGCACACT 


TACGGATACT 


300 


TTCTTCTAAA 


ATATGGGAAA 


GTTGCTGCTT 


TTCAGCTGGA 


AAACACGACG 


TCTGAAGTGC 


360 


CCCTATCTCA 


GCCAAATCAG 


ACTTGCTTCC 


CTGAATCATC 


TTAATTGGAA 


TTTCCATGGG 


420 


AACATCCTAT TGAACATTGC 


TTGTCAAGTT 


AGACAAGAGA 


CGCTCAAATG 


AGTATTCATA 


480 


GGTTTGGATG 


TCTCCTGCTC 


CCATAAAGAC 


GTAAACAGCA 


TTGTCATGCT 


CTAGGAGTGG 


540 


AGAAACATTT 


TCAACACTAA 


TCACTTGGTG 


TTTTTTGTTG 


ATTTTGTTGG 


CTAGGTCTTC 


600 


TACCTTAACG 


TCACCATGAT 


CTACTTCACG 


AGCCGAGCCA 


TAAATTTGCG 


CTAGATAAAC 


660 


ACCATCTGCT 


7CCTTTAAAC 


CATCGGCAAA 


CTCCTCCAAC 


AACCCAATGG 


TTCTTCTAAA 


720 


GGTATGCGGT 


TGAAAGACTC 


CTACAATTTC 


CTTGCTTGGG 


TATTTCTGAC 


GAGCCGCATC 


780 


CAAGGTCGCA 


ATAATTTCTG 


TTGGATGGTG 


GGCAAAGTCA 


TCGATAATCA 


CTGTATCATT 


840 


GACAATTTTC 


TCAGTCAAAC 


GACGTTTAAC 


ACCGGCAAAT 


GTTTTCAAGT 


GCTCACGCAC 


900 


CAACTTCAAA 


TCAAATCCTG 


CTGTGTAAAG 


AAGACCAATA 


ACGGCTGTCG 


CATTCATGAT 


960 


ATTGTGACGA 


CCAAAGGTTG 


GAATGTGGAA 


TTGCCCCAAG 


TTTTGTCCAC 


GGAAATGAAC 


1020 


GGTGAAGGTT 


GAACCAGTTA 


TTGAACGAAG 


AACATCACTA 


GCTACAAAGT 


CATTGCCTTC 


1080 


AGCTTCAAAA 


CCATAATAAT 


AAATTGGTGC 


ATCAGACCTA 


ATCTTACGCA 


ATTCAGCATC 


1140 


TTCACCATAG 


ACAAAAAGAC 


CCTTGGTGAT 


TTGTTTGGCA 


TAGTCCTTAA 


AGGCATTAAA 


1200 


AACATCCTCG 


AGACTTGTCA 


AATAATCTGG 


ATGGTCAAAG 


TCAATGTTGG 


TGATAATAGA 


1260 
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GTATTCTCGG 


TGGTAAGGCA 


TGAAGTGACC 


CTCATATTCG 


TCAGATTCAA 


AGACAAAATA 


1320 


TTTGGCATTG 


GCCGAACCAC 


GACCTCTCCC 


ATCTCCAATC 


AACAAGCTGG 


TATCTGT AAT 


1380 


GTGACACAAG 


ACATGAGACA 


ACATACCTGT CGTTGAAGTT 


TTTCCATGTG 


CTCCTGCTAC 


1440 


TCCCATGCTA 


ACAAAGTCAC 


GCATAAAGCT 


ACCTAGAAAC 


TCATGGTAAC 


GTTTGTAGCT 


1500 


GATACCATTT 


TGGTCCGCAT 

4 4 WWS#W^^ 4 


AGGCAATTTC 


GACGTTGTTA 


TCTGGACGAA 


AGGCATTTCC 


1560 


nuCun 1 i l 


l LUt 1 a 1 V.nVi 


CCTCTAGATT 


TTTTTCATCA 


AAAftfiAAfSAA 


•PIV^TAATTCC 

4W>1#W^1 A WW 


1620 


TfiCfTfifTPA 


MUALvVJUVJ 4 1 


GGGTAAAGTA 


GTACTTTTCA 


ACATCTCATC 


CCTGAACCTT 


1680 






TCAAGGCCAA 


GGCACTCATC 


WW 4 \*£\ 1 WWW 1 




1740 


& a aaTTaTiT 

nsw\S\ lun 1 J\ i 


GTCTTTGACA 


TGTTTTCTCC 


CCTATTCTGT 


Wn 11^1 ww 1 W 


ACATTCAACT 

t\K»f\ 1 1 wAnW 1 


1800 


w 1 IwOLAAL 


CCGACGTTCT 


TGTTCTGTTT 


GTTTACTTTT 


1 1 1 n A 1 \* 1 nw 


A*^^p 1 ^p^i^^^^^p^^*T* 

nil Awl 


1860 


1 V. 1 1 1 nk*/ww* 




TTTTTCTTTG 


GAGCAGGTGC 


ly^apaPTm 1 

i vmwito i i w i 


1 wn 1 1 w 1 1\M 


1920 




nT\*AAw r ivi 


TCCGCCAAGA 


TATAATGAGA 


V. 1 wuu 1 w n/Vl 


TTVTGGCTAT 


i q fin 


Alii uALAAA 


1 i tnL^AuUn 


TTTTCCTTTT 


GGAAAGGAGC 


TGTCGGTTGA 


TTGCCCTOTC 


2040 


Ta arrararr 




TGACGTCTCG 


CAAGCCTGAA 


ATCCTCAGT7 


AtWT AfVTT AC 
nv>u i nu 4 • nw 


2100 


LnuAli^g 111 


CTTTTTCAAG 


TCCCCACGCG 


CTTCTTCACG 


CGCCACCTCC 


own i now 1 w 1 


2160 


TTCCTTCTTT 


^p^p^p a fc 


AAAGGAGCCT 


TTTTAGGTTT 


TTCGACTTGC 


TTTTPlATPtt 
• <* 4 a wnns 4 WW 


2220 


wlill nl- 1 uu 


TTTTTCTTCA 


GCAATAGGAG 


CCCATTCTAA 


ATAATTTTTA 


A W A W \Jm\ A A 


2280 


w i i wn i 


nil nw 1 wn i w 


AGATCAGACT 


CATCATAGAG 


ATTCATCACT 


GCCATTTCAC 
wivn «ii wnw 


2340 


TCAArATRAP 


w 1 1 WA 1 V. 1 


CACACCAATG 


GAAATCGTTC 


TTGTTTCATT 


4 4 *^»W 


2400 


TTTCAACACT 


TCATTATACC 


GTATTGTCTT 


GATTTTTCAA 


CTGCTCCCTT 


CAGAAATTCC 


2460 


CAAAATTTCT 


W * 4 4 4 W 4 W 


CTAGGGTCAG 


ACTACCACGT GACTCTGTCC 


CGTCCAATAC 


2520 


TTGTGACACC 


AGATGTTTCT 


TTTGTTCTTG 


GAGTTCCTGA 


ATTTTTTCTT 


CAATGGTTCC 


2580 


CTTGGTCACC 


AAGCGATAGA 


CCTCAACCGT 


TTCTTCCTGA 


CCCATCCGAT 


CCGCACGGCC 


2640 


AATGCCTTGC 


GCTTCCACCG 


CAGGATTCCA 


CCAAAGGTCA 


ACCAAGATCA 


CTGTATCTGC 


2700 


ACCTGTCAGG 


TTCAGACCGA 


CCCCACCAGC 


CTTGAGGGAA 


ATCAGAAAGG 


CATCTCTTTC 


2760 


TCCTTGGTTA 


AAGGCCTTGG 


TCATGTCTTG 




GCTGGCGTTG 


AACCCGTAAT 


2820 


TTTAAACCAA 


GTCAGGCCCA 


AGTCTGGCAG 


TTCTTGTTCA 


ATTTTTTCCA 


AC ATT CCCTT 


2880 


GAACTGAGAG 


AAAATCAAGA 


CACCGTGTCC 


GCCGTCTGCC 


ACCTGTACCA 


GTAGGTCTCG 


2940 


GAGACTATCT 


ACTTTGCCGC 


TGGCTCCCTC 


ATAATCTTCC 


ATAAACAGGG 


CAGGAGTGTC 


3000 



rio « orvo i ao I 
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ACATATTTGA 


CGCAAGCGCA 


TCAAACCAGA 


526 
TAAAATTTCC 


ACACGACTTC 


GCTGAAATTC 


3060 


CTGTTCTGAC 


ACTTGAGCCA 


GATGGTCTCG 


CATCTGTTGT 


AACTGGGCAA 


GCTAAATAGC 


3120 


CTTTTGCTCC 


TCTTCCAGTT 


CATTTTTATA 


AACCACCTCA 


ATCAAGTCTG 


GCAATTCACT 


3180 


CAGAACTTCT 


TCTTTCTTGC 


GTCGCATCAC 


GAAAGGCTTG 


ATAAACTGAC 


CCACTCGCTC 


3240 


TGCTCGCAAT 


TTCATAAATT 


CTTTCTTGCT 


TGCCAAAAGT 


CCAGGCATGA 


CCATTTGGAA 


3300 


AATAGACCAC 


AACTCACCCA 


GATGGTTTTC 


AATCGGAGTT 


CCTCACAAGG 


CAAACACCGA 


3360 


CCGCACCACA 


AATTGTCTCA 


AGGTCTGGGC 


AATCTTGGTC 


TCGCCATTTT 


TCATGACCTG 


3420 


AGCCTCATCT 


AAGAAAAGGA 


AGTCAAAGGC 


CATCCCTTCA 


TAAAACTCAC 


TGTCCTGACG 


3480 


GAACGTGGCA 


TAGCTAGTCA 


CATAGATTTG 


ATGGCTCTCG 


GCAAGAATCT 


CCTCACGACT 


3S40 


TGCTTTCAAA 


CCATGAACAA 


CAGTCACATC 


CAACTGTGGA 


GCAAATTTCT 


GAAACTCATC 


3600 


TGCCCAGTTG 


TAAATCAAAC 


CCCACGGAGC 


GAGAATCAAA 


ACCCGACTTT 


CTTTTGTCAC 


3660 


TTGACTAGTC 


AAAAAAGCAA 


TGGTCTCAAG 


GGTTTTCCCA 


AGTCCCATAT 


CATCAGCCAA 


3720 


AATCCCACCA 


AAACCATAAT 


CATGCAGCAT 


CTGCAACCAG 


CCAATTCCCT 


TTTCCTGATA 


3780 


ATCTCCCAAG 


TCAGCCTTGA 


CCTGAGTTGC 


TTGC AAAGG A 


AAGTCCTCTG 


GATGCGTCAA 


3840 


ATCCTGGGCC 


AGATTCTGGA 


ATTCTTGTGA 


AAAAGAAACA 


CGGTCTCGCC 


CTTCAAAGAG 


3900 


ATC AG CT AAA 


CTCTAGGCCA 


AGGATTTCCG 


AGCCTGCAAG 


CTCCCATCTT 


TTAATTCAAA 


3960 


TTGCCCCACT 


TCCTGTACAT 


TTTGGCGAAT 


TTTCTTGGTT 


TCTTCATCGA 


AAAAGTAAAC 


4020 


TTG ATT AG AC 


CAATCAATAT 


AAAAATCCTG 


ATTGGCAACC 


AAGGCCTCCA 


TGGCTTGGTC 


4080 


GATTTCCTCC 


TCGACAATAT 


TTTGAAAATC 


AAACTGGATT 


TCCAAGAGAC 


CTCCCTTGGA 


4140 


GGCAATCTGC 


ACCTGAGGAC 


TCCCTAGGCT 


ATAAACCTCT 


TCTAGTTTAT 


CTGATAGGTC 


4200 


AACATGCCCG 


AGTTTTTCAA 


AG ACT GG AAT 


GATATCATGA 


AAAAAATCAT 


AGACAGACTC 


4260 


CGCTTTTAAG 


CCCTGACGCC 


AAGATTGAAA 


ATCCGCCTCA 


AAG CCCGC AG 


CCAAACAGAC 


4320 


TTGGAAAATT 


CTTTCTTCTA 


AGTCTGCGTC 


ACTTGAAAAG 


GGTAATTCTT 


CTAGCTCTTG 


4380 


TCGGCTAGAT 


ACCTGTCTAT 


TTCCATAATC 


AAALTwnn I I 








TTCTTCCCTG 


TCAAAGTAAA 


AAGAGGGCGC 


AAAAGTTTTG 


ATTTCTAGAC 


GTTCTGGAGC 


4500 


TCAAACGGTG 


CCCATCTGGA 


TAAAAACACT 


CACACAGGAG 


CCCAATTTGT 


CT CG ATC ACT 


4560 


GCTATCAAAT 


TGCAGGTATT 


TCTTTCCTTG 


TTCACCCACA 


CCTAACGCTT 


TAATTTCCTT 


4620 


GAGAAGACGC 


ATCTGCTGGT 


CTGTTAAAAA 


ATAAACCTGA 


CC'TTTATGGA 


AAAGTACTCC 


4680 


TCCCTGATAA 


AAGACATTGA 


CCCTAGGACT 


CTCACTGATT 


TCCATTTCAA 


AATAATCCGA 


4740 


GTATTCTGTT 


ACTGTAAAGG 


CAAATAGATT 


GGCATCAGCA 


TGCATATCCT 


GAAAAAGCAC 


4800 
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GGTTTGwl nu 


CTATCCACTT GATCCTCAAA TTGAAAATGG GGCAAGGCCA 


TCAGTAAATT 


4860 




TCAAAAAAGG 


TCAGACGCAA 


AAAGAGGTGC 


CGACCTTGGT 


TTTGGAAAAA 


4920 


GAGOH-lwn 


ACCAGCCCTT 


CCTCCGTTAG 


TCCGTGCAAG 


AAAGTCAAAA 


CTTCTTGGCT 


4960 


GGCATCATGA 


AAGGCTTCCC 


AAGAAAGAGA 


CTCCTCATAA 


ATCTTGCCAA 


TCATATACGA 


5040 


CTTTCTCTGC 


TCGACAATCC 


TTAAAAAAAG 


TGGAATATCC 


CGAATGACAT 


AGTATTTTTG 


5100 


GCTATTGATT 


TGGCCGATTC 


TCAGAGTCCA 


CAAGATATGA 


TTGCTTCCTG 


CTTCCACCTG 


5160 


ACCCACAGCT 


GATAACTCAT 


AGCCGCATTC 


TGATTTTGCA 


GATAAAATTC 


GATCCAAAAA 


5220 


CTTGCCACCC 


AACGTCACCT 


TGGTTTCAAC 


AGCCTCTTTT 


TCTTCATGAC 


CTTCTTCCAG 


5280 


ACTCCACAAG 


ATTPCCTGAC 


CACGCTCATC 


ATTTTTCAGA 


AAATGCTCTA GCGCTGCCAA 


5340 


ATGCACACAG 




TTTCAAAAAA 


ATCACAGGCA 


CAAAAAACCA 


AATCATCCTC 


5400 


TAAACTATAG 




CTTCTGCAAC 


GCGAGCGTAG 


AGCCGATTGT 


TCTTTTCCTT 


5460 


GATGATATCA 


ACCTTACCAG 


TTTCATAAAG 


GCCAACACCT 


TCGATACGAA 


TTTTCCCCGG 


5520 


AATCAATTTA 


VJlr lut * « • * * * 


CACCTTTACC 


TTATCTTTTT 


ft TWT ft ft* ft ft #f% 


Alii 1 V_*j\_<- 1 


5580 


ATGAAAATAG 




AGACTTTTCT 


CCTAGAAGCC 


TGCATTTTTA 


ACGTTTGGC A 


5640 


AAAGTAGCCA 




AC AG ACTTCT 


TGCAACAGAG 


ATTTGGGCAT 


ACCTATATTG 


5700 


ATGCGCGCAT 




TTCCTCTCCA 


AAATCCAAAC 


CACGGTTGAG 


CATAACCTTG 


5760 


GCTTCATTTC 


TCAACAACTC 


TTGCAATGTT 


TCATCAGTCA 


GCTCATAAGC 


TGAAAAGTCA 


5820 


AGCCAAATCA 


AGTAGGTACC 


TTGCGGTTTC 


ATGACCTTGA 


TTTTAGTCTC 


TTTTCCAAAT 


5880 


AGATCCATCA 


CATAATTGAT 


CTCGTCTTCA 


AAGACTTGCT 


TGAGTTCCTC 


TAGCCAATCT 


5940 


****** ft Mil ft *Y%^ 

TTACCGTATC 


GATAGGCAGC 


TTCTGTCGCC 


AAATAACCCA 


AGCCTGAAAT 


TT C ATGCTGA 


6000 


TTATTGGCCA 


ACAGGCCTTT 


CTGCAAAIjCL 




TAGGATTTTC 


AATCACTGCA 


6060 


TAGCAATTTT 


TTCTTCCAGC 


AATATTAAAT 


CTTTTAGTGG 


CACTGCTCAA 


GACGATAGCA 


6120 


AAATTTTTGA 


ACGCAGGATT 


CATGGTATTG 


AAAGACTGGT 


GTTTGTGACC 


AAAGAGGGTC 


6160 


AAATCTTGGT 


GAATCTCATC 


CGAAACTAAC 


AAAACACCGT 


CTTTTTGGCA 


GAGTTGGCCA 


6240 


ATCTTCTCCA 


ACACTTCTTT 


TTCCCAAACA 


CGTCCACCAG 


GATTGTGAGG 


GTTCCAAAGA 


6300 


ACATAGAGTT 


TAACCTCCTC 


TTCCACCAAA 


TCCTTTTCAA 


GTTGGTCAAA 


GTCAATCTCA 


6360 


AACAGACTAT 


CCTTTTCCAC 


TAAGGAATTA 


GTAATCAATC 


TACGATTATT 


CAACTTGACA 


6420 


CTGCGAGCAA 


AGGGTGGGTA 


GACAGGCCTG 


TTAATTAAAA 


CCGCCTCGCC 


TTCTTTTGTA 


6480 


AAGGTTTGAA 


TAGCTGTTGA 


GATGGCTGGT 


ACCACACCCT 


CGATAAAGAC 


AAGAGCCTCT 


6540 



I 

t 
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TTGTCAAAGT 


TGTAACCGTA 


TTGTGTAGCT 


528 
TCCCACTTTT 


G AACTTC CTT 


AATTAACTCT 


6600 


TCACTGGCAT 


AGGTATAACC 


ATAAACCAGT 


TGGTCTGCGT 


AAGTTTGCAC GGCTTGGCGG 


6660 


ATTTCAGGCA 


AGACCACAAA 


GTCCATATCC 


GCTATCCAAG 


CTGCTAGAAC 


TTCACTATCC 


6720 


GTTTCTGTTT 


CTTTCCATTT 


ATAGGTATGG 


TGCCCTAAAC 


GGTTGGGCAC 


GCTTGTAAAA 


6780 


TCATATTTTC 


CCATCTTTGT 


CTTATCCTTC 


TATGGCTTGG 


CGCAAATCTG 


CAATCAAATC 


6840 


TCTAGCATCC 


TCAATCCCAA 


TAGACAAACG 


CAAGAGGTCA 


TCTGTCAAAC 


CATAAGAATG 


6900 


GCGTACCTCT 


GCTGGAATAT 


CAGCATGAGT 


TTGAGTCGTT 


GGATAACTAA TAAGACTTTC 


6960 


CACTCCACCC 


AAACTTTCCG 


CAAAAGAGAA 


CACCTTGAGA 


CTGTTCAAAA 


TATGAGGAAT 


7020 


GCGTGTTTCA 


TCGGCTACTT 


TAAAGGAAAT 


CATGCCTCCA 


CGACCAGTGT 


AGAGAACTTC 


7080 


CTTAACTGCT 


GGACAATCCT 


TCAAAAAGGC 


AACCACTTCT 


TGGGCCTTAG 


CTGTTGAGCG 


7140 


CTCCATACGA 


AGAGACAAGG 


TCTTGAGACC 


ACGAACCAAC 


TGGTAGCTGT 


CAAATGGACA 


7200 


CAAGACTGCC 


CCTGTTGTAT 


TAAGATTGTA 


AAAAAGCTTC 


TCCTATAGTT 


CTAAACTATT 


7260 


GGTCACAACC 


ACTCCACCCA 


AGACATCATT 


GTCGCCTGCT 


AGATACTTGG 


TTGCTGAATG 


7320 


GACAACGATA 


TCTGCTCCAT 


CTTCAATCCG 


ACGTTGCTAG 


ATAGGGCTAT 


AGAAGGTATT 


7380 


GTCCACCACC 


ACTTTGGCAC 


CCTTAGCATC 


AGCCAATTTT 


CCTAGTTTTT 


CGATATCAAA 


7440 


TTCCAACATC 


AAGGGATTGG 


TTCGGCTTTC 


CATATAGACA 


ACATCCACAT 


CCTTTTCTAA 


7500 


CTCGGCAATC 


AACTCTTCTT 


CTGTATTGGC 


ATAGGTAAAA 


TGCAAATGAC 


CTTCCTGCTC 


7560 


CACTTGGTTA 


AACCAGCGAA 


AAGAACCACC 


CTAAAGATCA 


CCCACTGCCA 


AGACCTTACT 


7620 


TCCTACTGGA 


AAGACGCTAA 


AGGCCACTAC 


AATAGCTCAC 


ATC CCTGAGC 


T ACTCCCT AC 


7680 


GGCATAGTCT 


GCTGACTCAA 


TAGCCGCCAA 


GACTTCCTCA 


GCCTTACTAC 


GAGTTGGATT 


7740 


TTTAGTGCGC 


CTATACTCAA 


ACCCAGTAGA 


TCCACCAAAC 


TCTGGATGCT 


GATAGGTCCT 


7800 


TGAAAAATGA 


AGTGGTGTCA 


CCAAAGCACC 


TGTTGCCTCA 


TCAGACTTGA 


TCCCTGCTTC 


7860 


TCCTAAAATT 


GTGTTAATGT 


CTAATTCCTT 


GCTCATACAA 


TTCCTCCAAA 


TCTATAGTAA 


7920 


CTATwiACC 


ACTTATTTTG 


TATCCTTCGT 




TCAAGAGCTA 


GTTATAGTTT 


7980 


CAAACTATAT 


AAAAAGGGAG 


TTTTTCCTGC 


TCCCTTTAAT 


AGACTATAAA 


ATGCTGAATC 


8040 


TCAAAAGACA 


CCTTCACTCT 


ATCATTTGCT 


CCTGCACAAA 


ACGAGCATAA 


CGCTCATGAT 


8100 


TTTCCAGTAG 


TTCCTTATGA 


GTTCCTGAGC 


CAGTGATTTT 


CCCCTCCTCT 


AAGAAGAAAA 


8160 


TACAATCCAC 


ATCTTTTACC 


GTTGACAAAC 


GATGCGCT AT 


AATCACAACC 


GTCTTCTCCT 


8220 


TTAGTACAGA 


ATAGAGGCTA 


CTGATAATCG 


CATACTCAGA 


ATCCGCATCA 


ACATTAGCAG 


8280 


TGGCTTCATC 


AAATATAAGA 


ATTTCAGCAT 


CTTTTAAGTA 


GGCTCTAGCT 


ATTTGAAGTC 


8340 
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TTTCCTTCGC * 




AGAGTCGTCC 


GCGTTCACCA ACTTCAGTAT CTAGTCCCTC 


8400 


TTTCATGGAG 


r+r* a »pr**TV* Ik. T 
(,UAA IV, i V-/* 1 


CACCTAGTGA 


TACTAAGTCT AGCACTTTCA TCAATTCATC 


8460 


ATCAbii ACT 


AAuwViA 1 i 


AACCGAGACA 


AAGATTGTCA CGAATACTCC CAGATAAGAC 


8S20 


TGCATTATTT 


TGTGAAACCV 


AAfiCCATTTT 


ACTTCTCCAT TCTTTTAAGT TAAAATCATA 


8580 


T AT ACTTGAT 


TGCTCCATTA 


HAATATCTCC 


TGAAAGCGGT TTATAAAACC GCTCTAACAA 


8640 


ACGCACAATC 


GTTGATTTTC 


rTGATCCAGA 


TGGTCCAACA AAAGCAATTT TTTGCCCCTT 


8700 


CAAAATTGAA 


CAAGTAATAT 


ill 


AGGTCGATTT TCATCATAAC C AAAAT AG AC 


8760 


ATCGTTAAAA 


TTCAACCCTC 




CGATTTTCCT CCCTCAAATT TTTCITTAGG 


8820 


AACTCCAACC 


AAGTTCTCCA 




AGATCCCTTG CTCCTAGAAT AAACAGTTAC 


8880 


AAAATTACCT 


ATATTACTAA 


TAuun I 1 #Ww 


TAATTGAAAG AGGTAAATCA AAAACGAAAC 


8940 


CAAGGTTCCC 


ACAGATATAT 




GACCCGATAA CCCCCATAGG TTAGCATCAC 


9000 


AGCTATAGTC 


GCAAAGATAA 




AAACGGGGTC TCAAAAGAAG TAACCCTATC 


9060 


TGATTTCACT 


GAATTGTTTT 


Ol A^*— 1 1 I w 


AATACAATTA TCCAAAACAT CCTGT AC ACT 


9120 


TTTCTCTGCT 


TGGTTAGTCT 


1AAJ 1. i ^ 


ATGTTCTTGA ATCTTTTC AG TCAATTGCCC 


9180 


TGTTAAATTT 


CCTCCTGTAA 


afYI Af~T A 


TACTTTTCAC TGATATTGGA AAGGGGCAAG 


9240 


ATAATAAACA 


TCATACAAGG 


AAuAVi A >j 


AATAAAAGTA GAGAAAGATT CCAATCAAGA 


9300 


CTAAATAAGA 


CTACAATGGA 




ATAACTAAAC TCAGAATAAT ATTTGGGAAA 


9360 


CTCCTAATTA 


AAAACTCACG 


AiTfiACACTC 


GTGTCATTGA CAATGGCAGA AGTCAACTCC 


9420 


CCACTTTGGC 


TCTTATCAAA 


n & AfiftATTTC 


TCTACATAAA TCAACCCCTC TATCACTTTT 


9480 


TTC CTGATTT 


TTGCTATCTT 


L & 4 » * ^#"»W**w 


GATTGACTAA ACAGATAGTA ACCAATAGAA 


9540 


GAAAACAAGG 


CTTGACCAAT 


AAAAATCAAA 


AACGATTGAA ATACTTTGGA GCCTATATTT 


9600 


TCAATAGAAC 


TCCCATCTAT 


TAAATCCTTT 


AAGATAAGGG GAAGCAACAA AGCAAGTAGA 


9660 


CTAGACAGAA 


CAAGTAAGAA 


ACTCCCCATA 


ATCACCTTAG TATCTACTCT TAATAATTTT 


9720 


AATTTCATAA 


ATACTCCTTA 


TAATATTTCA 


ACGGATAAAG TCGGGAATAA CTCAATTTGA 


9780 


GGATAAAATC TAATAAATCT 


TCCTATAACA 


AAACGCATAA CATCTAGGAT TTTATATACC 


9840 


TGATATTATG 


CGTTTTTAAG 


CACAAAGACT 


TCTTACACAA ACTTATCTAC AATTAGATTT 


9900 


TATTTCACAT 


GTTTTGCCAA 


TTCTTCTTGG 


GCTTTTTTAT TGG ATTCTTC TTTTTCTTTC 


9960 


AACCATTTTT 


CTCTGGCTTT 


TGCATATTCG 


TCTGTTGTGA CAATCTTATC TTGTACTTTG 


10020 


AGGTATTTAT 


ATGATTCAAC 


CCCTTTTGTA 


CCGCTTAAAC CATAGGCAGC AGCAAATGGT 


10080 
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ACGGTTCTTC TCAATGATGG TGTTCCCCCA CGCGAAACAC TTGCAAGAAC TAAAGAACTA 10140 

TCAATCAACC AAGCTTGAAT ATCAGCATAT TTCTCATAAC CTTTGGCCGG ATCTTGCTCT 10200 

TTATTAGCTT CTTCCAACAT TTGAGTATAG ACATCCACTC CAACTGCCTT AGCCTTGTCA 10260 

TTGGCCTCAC CAGGCTCTAG TCCAAGATTT TGCAGAAATC CTCCACTATT ACTATTAAAA 10320 

ATATCGAGAT AGGTTGACGG CTCTTGATAA TCAGCTCCCC AACCGCCATC ATATAAATCA 103 80 

TAATCTTTCT GAGCAGCTGT TTGAGCAAAG TAGCCTGAAC TGTCAAACTC ATCTCATGTT 10440 

AATTGCTGAA TGTCAATCAC TACATTATCA GAACCTAAAA CAGATTCAAT TGATTGTTTG 10500 

ATAGAACTAA CTCCTTCTAT GCCTACTTTA TCTGTTACTT CCACAGTCTT ATCCAAGTGC 10560 

ATTGGGAATT GAACACCCTT TGCTTCGAGT TCTTTCTTAG CTTCCGCAAA CTTAGCCTTG 10620 

GCTTTCTCAG GATTGTACTA AGGGTCTTGA CCATCCGCAA AGTTGATACC TTGCCATTCC 10680 

TTACCATAGT TGACCATCTT AGAGGCTACA ACTTCACCAA AGTCTTTTCC CTTGATACTG 10740 

ACAAACTTTG GAGGAACCAC TAGGTTACGC AAAATCTTTG TTGCACCTTC TTTCCCTTCA 10800 

GACTGAGCCC CATAAGATGT TCTGTCAAAA GCAAAATTGA TAGCCTGACG GAAGTTTTTA 10860 

TTGAGAACTG CTTCCTGAGT CGATTTCTTT TCAATGTCAC TT G TTTTAGA ACT AT AATTG 10920 

TAAGACTTCC TATCTAGGTT AAAATTAAAG AAATATGAAG TTCAATTTTG CATACTATAG 10980 

ATGATATTCT TTTTGTATTT TTCTTTAATC CCTTCATACC TCGAGCTGTT AGCAAAAAGA 11040 

CGAGCCCTAG TATAAGCACC AGCTGTAAAA TTACGTTCCA GTCATTCTTC GTCGCTACCA 11100 

TCATAGTACG TCAATTTCAC ATCGTCTACA AAGACATTCT TAGCATCCCA GTAATTAGGG 11160 

TTTTTCTTAT ATTCAATAGC AGATTTTGAG ACAAGTCCTT TCATCAAGAA AGGTCCATTG 11220 

TACAAAATAC TAGATGGATC CGCCTTCCCA AAATCATCCC CTTTTGATTT CAGGAAATCT 11280 

GCATTAACAG GAAAAAGTAT CCTTGCAAGT GTTTTTGAAT TCCAGTAAAC TTCTGGTTTA 11340 

ACCAAAGTAT ATTGAACCGT TTGGTCATCA AGTGCCTTGA CACCGACAGT TGAAAAGTCG 11400 

CTTGTTTTAC CACTGATATA GTCATCCAAA CCACCAACAG AGTCCTGCAC TAGATACAAC 114 60 

GCTTCTGATT TTTTATCAGC TGCATATTGC AAACCTGTCA CAAAATCCTG GCCACTTACA 11520 

GGCGCATATT CTTCTCCCTC AGAAGTAAAC CACTTGGCAT CCTTACGAAG TTTGTAGCTA 11580 

TAGGTCAAAC CGTCCTGAGA AACAGTCCAA TCCTCTGCTA ATCATGCAA7 AATATTCCCA 11640 

TATTCCTCAT TTTCTAATAA CCCGTCTACC AAATTTGCAA CAATATCCGA TGTTGCTGCG 11700 

CGGTTTTCTG CTAGATAGTT CAAGCTAGAT GGATCACTTG AATAAACATA GTTGTAGGTT 11760 

TTTGACGCCG TGCTAGAATT TCCACACGCG CTCAATAAAA CTCCTGTACC CAGGACAAGA 11B20 

CCTGCCAAGG TTAGATATTT CCTCTTAGAC TTTTTCATTT CCGG 11864 
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(2> INFORMATION FOB SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2412 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



txi> SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

T AACTG C ACT AAACATAATA TAAGGAGAGA AAATCTCTGC AATAGAACGT ATTACAAAAG 60 

CTGCTCACTT AATTGATATG AACGATATTA TCCGTGAAGG GAATCCTACT CTACGCGCGA 120 

TTGCTGAGGA AGTCACTTTC CCCCTATCTG ACCACGAAAT CATCCTAGGC GAAAAGATGA 180 

TGCAATTCCT TAAACATTCC CAAGATCCTG TCATGGCTGA AAAAATGGGA CTCCGCGGTG 240 

GTGTTGGACT GGCTGCTCCC CAGTTAGATA TCTCAAAACC CATTATCCCT GTTTTGGTAC 300 

CTAATATTGT TGAAGAAGGC GAAACTCCAC AGGAACCCTA CGATTTGGAA GCCATTATGT 360 

ACAATCCAAA AATCGTCTCT CACTCTGTTC AACATGCTGC TCTTGGCGAA GGAGAAGGTT 4 20 

GCCTGTCTGT TGACCCTAAC CTGCCTGGCT ATGTTCTTCG CCATGCCCGC GTTACTGTTC 4 80 

ACTACTTTGA CAAAGATGGA GAAAAACACC GTATCAAACT CAAAGGCTAC AACTCCATTG 54 0 

TTGTTCAGCA TGAAATTGAC CACATTAACC GTATCATGTT TTACGATCGC ATCAATGAAA 600 

AACACCCATT TGCAGTTAAA GATGGTTTAC TGATTCTTGA ATAAAGAAAA TCCCGTTGCA 660 

AGACGGGGTT TTGTGTTATA ATAGAGGCAT GAAAACAAAT CATATTGTCT ATGGTGTCCA 720 

CCCCCTTACC GAAGCCCTCC TTGCAAATAC AGGAAACAAA CTCTACCTCC AAGAAGATCT 780 

CCCAGGTAAG AATGTTCAGA AAGTCAAGGA ACTAGCTACA GAAAAGAAGG TCTCCATTTC 840 

TTGGACATCA AAAAAATCTC TCTCTGAGAT TACTGAAGGT GCTCTTCATC AAGGTTTTGT 900 

TCTACGAGTG TCTGAATTTG CCTATAGCGA GCTAGATTAC ATCCTTGCAA AAACACGCCA 960 

AGAACAAAAT CCACTTCTAT TGATTCTAGA TGGTCTAACC GATCCCCATA ATCTGGGTTC 1020 

TATCTTCCGA ACAGCCGATG CGACCAATGT TTCAGGTGTC ATCATTCCCA AGCACCGTAC 1080 

TCTCGGAGTA ACTCCTGTCG TTGCCAAAAC AGCCACAGGT GCTATTGAAC ACGT tCCAAT 1140 

TGCCCGAGTG ACCAACCTCA GTCAAACCTT AGGATAAACT TAAGGATGAA GGTTTCTGGA 1200 

CCTTTGGAAC GCATATGAAC GGTACTCCTT GCCACAAGTG GAATACAAAA GGGAAAATCG 1260 

CCCTCATCAT TGGAAATGAA GGAAAACGTA TCTCTACCAA CATCAAAAAA CAGGTCGATG 1320 

AAATGATTAC CATTCCGATG AATGGACATG TTCAAAGCCT TAATGCCAGT GTTGCTGCGG 1380 
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CCATTCTCAT CTACGAAGTT TTCCCAAATA GACTATAAAA 
GGAAACTTTT TTATGATTAA CTATGTTCTG TAATGAATTT 
TAGCTCCATC TCCAACCGCT GTTGTTACTT GGCGAAGGTC 
CTGCAAAGAT ACCGTCGACT GCAGTTTTCA TGTGGTTATC 
GATCTTGGAT ATTCAATTCT TTAACAAAAT CGCTAAGAGG 
AGACACCACC GAAGCCTTGT TCTGTCACTT CACCTGTTTT 
ATTCTACTCG GTTTTCACCC TTGATTTCCC TTACTACAGA 
TTTCATTCGC AAAGGCGCGA TCTTGTAAAA CCTTTTGGGC 
GAACAATGGT AACAGTCTTA GCAAAACGAG TCAAGAAGAG 
CTCCACCACC AACTACCAAT AAATCTTGGT CACGCAACAA 
AGTAAGAAAC ACCACGACTG TTCAGTTCTT CTTCTCCAGG 
TAGAACCAGT TGCTACGATA ACTGTACGTG TTTCATATGT 
TCTTAAAATC ACCATGGCTT CGACATTTTC AACATAACCA 
ATTTTCAAGT GGTTCAAACA TCTTTTCAGC CAATTCAGGT 
TGGGT AATTT TCGATATCAG ATGTATTATT CATCTGACCA 
CAAAGCTACT TTTAGATTGC TTCCAGCAGC ATACAAGGCC 
ACCACCGATA ATAATACTAT CGTACATATA GATTCCTTCT 
ATTCTAACTC TG 

(2) INFORMATION FOR SEO ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7760 base pairs 

(B) TYPE: nucleic acid 
tC) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



AAGTTTCCAG TCATCTGATT 1440 

ATACGCTTCT TGACCAGCGA 1500 

TTTCAAGCGA ACATCTCCAA 1560 

TGTCACAATC CATCCTGCCT 1620 

GTCCAAACCA ACATAGATAA 1680 

CACATTTTCA AATACCACTG 1740 

ATCCCAGATA AACCTGATTT 1800 

ACGAAGTTGC TCACGACGGT 1860 

GGCTTCTTCA ACAGCTGAAT 1920 

AGCACCATCA CACACAGCAC I960 

CACTCCCAAA GGACGGTGTT 2040 

TTGGTCATCA GTCATCACTT 2100 

TAAATGTGCT CAACACCAAG 2160 

CCACTAATAT TAGCGTATCC 2220 

CCTGGCAGAC CACCTTCAAT 2280 

GCAGTCATCC cTGCAGCTCC 234 0 

TTCTTGGTGT AACTATCTTT 2400 

2412 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CCGATTTGGT GGAATTTTTG TCTCATCATT TAGAAGGTGT TGCAAGAGCA 
TGGTGCTTCA TACCAAATTG GGAGAAGCCT CTGTTTTGGC AAATATTGTA 
AGGATGAATG GATTTTAGGA ACAGTTGCTG GTCCCAATAC CTTATTGGTT 
ATCAGCACGT TGCCAAACTC ATGGAAGATC CTTTGCTAGA TTTGATGAAA 
GTCTTGGGAG TTGCTCTCAA GACTTATTTT TGAAAAGGAG AGACAGAAAA 
AAAGTTATCA CCCGGCATGC AACAGTATGT GGATATTAAA AAGCAATATC 



GAGTTTACCT 
CATCTAAACA 
ATTTGTCGAC 
GATAAGTAAG 
TGGCGATAGA 
CAGATGCTTT 



60 
120 
180 
240 
300 
360 



WO 98/18931 



PCT/US97/19588 



533 



TTTCCTCTTT 


CGGATGGGTG 


ATTTTTATGA 


ATTATTTTAT 






420 


CCACATTCTG 


CAAATTTCCT 


TAACGAGTCG 


CAACAAGAAT 


GCCGACAATC 




480 


CGCGCGTGTT 


CCCTATCATT 


CTGCCCAACA 


GTATATCGAT 


GTC TTG ATTG 




540 


TAAGGTGGCT 


ATCGCAGAGC 


AGATGGAAGA 


TCCTAAACAA 


GCAGTTGGGG 


TTGTTAAACG 


600 


AGACGTTGTT 


CAGGTCATTA 


CGCCAGGGAC 


AGTCGTCGAT 


AGC AGT AAGC 




AAA 


CAATAATTTT 


TTGCTTTCCA 


TAGACCGCCA 


AGCCAATCAA 


TTTCGCCTAG 


LIlATATGGA 




TTTGGTGACG 


GGTGACTTTT 


ATGTGACAGG 


TCTTTTGGAT 


TTCACGCTGG 


TTTGTGGGGA 


# ou 


AATCCGTAAC 


CTCAAGGCTC 


GAGAAGTGGT 


GTTGGCTTAT 


GALTTGTCTG 


AGGAAGAAGA 


OA ft 


ACAAATCCTC 


AGCCGCCAGA 


TGAATCTGGT 


ACTCTCTTAT 


CAAAAAGAAA 


GCTTTGAAGA 


900 


CCTTCATTTA 


TTGGATTTGC 


GATTGGCAAC 


GGTGCAGCAA 


ACGGCATCTA 


GTAAGCTGCT 


960 


CCAGTATGTT 


CATCGGACTC 


AGATGAGGGA 


ATTGAACCAC 


CTCAAACCTG 


TTATCCGCTA 


1020 


CGAAATTAAG 


CATTTCTTGC 


AGATCGATTA 


TGCGACCAAG 


GCTAGTCTGG 


ATTTCGTTGA 


1080 


GAATGCTCGC 


TCAGCTAAGA 


AACAAGGCAG 


TCTTTTCTGG 


CTTTTGGATG 


AAACCAAAAC 


1 140 


GGCTATGGGG 


A TGCCTCTCT 


TGCGTTCTTG 


GATTCATCGC 


CCCTTGATTG 


AT AAGG AACG 


1200 


AATCCTCCAA 


CCTCAAGAAG 


TACTGCAGCT 


CTTTCTCGAC 


CATTTCTTTG 


AGCGTACTGA 


1260 


CTTGACAGAC 


AGTCTCAAGG 


CTCTTTATGA 


CATTGAGCGC 


TTGGCTAGTC 


GTGTTTCTTT 


1 JZU 


TGGCAAAACC 


AATCCAAAGG 


ATCTCTTCCA 


CTTGGCGACT 


ACCTTGTCTA 


GTGTGCCACG 


1 JeU 


G ATT CGTGCG 


ATTTTAGAAG 


CGATGGACCA 


ACCTACTCTA 


GCCTATCTCA 


TCGCACAACT 


1 <4A 

1 44U 


GGATGCAATC 


CCTGAGTTGG 


AGAGTTTGAT 


TACCGCAGCG 


ATTGCTCCTG 


AAGCTCCTCA 




TCTGATTACA 


GATGGGGGAA 


TTATCCCCAC 


TGGATTTGAT 


GAGACTTTAG 


ACAACTATUu 


1 SAO 
X sou 


TTCCGTTCTC 


AGAGAACGGA 


CTAGCTGGAT 


TGCTGAGATT 


GAGCCTAAGG 




1620 


CTCTGGTATC 


AGCACGCTCA 


AGATTGACTA 


CAATAAAAAG 


GATGCCTACT 


ATTTTCATGT 


1680 


GACCAATTCG 


CAACTAGGAA 


ATGTGCCAGC 


TCACTTTTTC 


CCCAAGGCGA 


CGCTGAAAAA 


1740 


CTCAGAACGC 


TTTGGAACCG 


AAGAATTAGC 


CCGTATCGAG 


GCAGATATGC 


TTGAGGCGCG 


1B00 


. TC AC AACTC A 


pCCAACCTCG 


AATACGAAAT 


ATTTATGCGC 


ATTCCTGAAG 


AGGTCGGCAA 


I860 


GTACATCCAG 


CCTTTACAAG 


CTCTAGCCCA 


AGGAATTCCG 


ACGGTTCATG 


TCTT AC AG AG 


1920 


TCTGGCGGTT 


CTCGCTGAAA 


CCCAGCATTT 


GATTCGACCT 


GAGTTTGGTG 


ACGATTCACA 


1980 


AATTGATATC 


CGGAAAGGGC 


GCCATGCTGT 


CGTTGAAAAG 


GTTATGGGGG 


CTCAGACCTA 


2040 


TATTCCAAAT 


ACGATTCAGA 


TGGCAGAAGA 


TACCAGTATT 


CAACTGGTTA 


CAGGGCCAAA 


2100 



OMcnnnn. ^\Kir\ 
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CATCACTGGG 


AAGTCTACCT 


ATATGCGTCA 


GTTAGCCATG 


ACGGCGGTTA 


TGGCCCAGCT 


2160 


CGGTTCCTAT 


GTTCCTGCTG 


AAAGCCCCCA 


TTTACCCATT 


TTTGATGCGA 


TTTTTACCCG 


2220 


TATCGGACCA 


GCACATGACT 


TGGTTTCGGG 


TCAGTCAACC 


TTTATGGTGG 


AGATGATGGA 


2280 


GGCCAATAAT 


GCCATTTCGC 


ATGCGACCAA 


GAACTCTCTC 


ATTCTCTTTG 


ATGAATTGGG 


2340 


ACGTCGAACT 


GCAACTTATG 


ACGGGATGGC 


TCTTGCTCAG 


TCCATCATCG 


AATATATCCA 


2400 


TCACCACATC 


GGAGCTAAGA 


CCCTCTTTGC 


GACCCACTAC 


CATGAGTTGA 


CTAGTCTGGA 


2460 


CTCTAGTTTA 


CAACACTTGG 


TCAATCTCCA 


CGTGGCAACT 


TTGGAGCACC 


ATGCGCAGGT 


2S20 


CACCTTCCTT 


CACAAGATTG 


AACCGGGACC 


AGCTCATAAA 


TCtACGGTAT 


CCATGTTGCC 


2S80 


AAGATTGCTG 


GCTTGCCAGC 


AGACCTTTTA 


GCAAGGGCGG 


ATAAGATTTT 


GACTCAGCTA 


2640 


GAGAATCAAG 


GAACAGAGAG 


TCCTCCTCCC 


ATGAGACAAA 


CTAGTGCTGT 


CACTGAACAG 


2*700 


ATTTCACTCT 


TTGATAGGGC 


AGAAGAGCAT 


CCTATCCTAG 


CAGAATTAGC 


TAAACTGGAT 


2760 


GTGTATAATA 


TGACACCTAT 


GCACGTTATG 


AATCTCTTAG 


TAGAGTTAAA 


ACAGAAACTA 


2820 


TAAAACCAAG 


ACTCACTAGT 


TAATCTAGCT 


GTATCAAGGA 


GACTTCTTTG 


ACAATTCTCC 


2880 


ACTTTTTTCC 


TAGAATAACA 


TCACACAAAC 


AGAATGAAAA 


GGAGCTGACG 


CATTGTCCCT 


2940 


CCCTTTTGTC 


TATTTTTTAA 


CGACAAAGTA 


TGCTCATTCA 


GAAAATAAAA 


ACCTACAAGT 


3000 


GCCAGGCCCT 


GGCTTCGCTC 


CTGATGACAG 


G CTTG ATGGT 


TGCTAGTTCA 


CTTCTCCAAC 


3060 


CGCCTTATCT 


GCAGGAAGTC 


TTACGCGCCC 


TCCTTACTGG 


GAAATATGAA 


GCTATTTATA 


3120 


GTATCCCGCC 


TTGGTTGATT 


GGTGTGGCCG 


TAGTCGGTCT 


AGTTGCTCCT 


CGACTCAATG 


3180 


TTGTCCTCGC 


ACCCTATATT 


GCCCAAGGAC 


TTTCATCCGA 


CCTTCGGGAG 


GATGCCTTCC 


3240 


GTAAAATTCA 


AACCTTTTCT 


TATGCTGATA 


TTGAACAATT 


TAATGCGGGA 


AATCTAGTCG 


3300 


TTCGAATCAC 


AAATGATATC 


AACCAGA7TC 


ACAACGTTCT 


CATGATGACC 


TTCCAAATTC 


3360 


TTTTCAGACT 


TCCCCTCTTG 


TTCATCGCTT 


CGTTTATCCT 


AGCGCTTCAA 


ACCTTACCTT 


3420 


CTCTGTGGTG 


GGTGATTGTT 


CTCATGGTAG 


TCTTGATTTT 


TGGTTTGACT 


GCTGTCATGA 


3480 


TGGGAATCAT 


GGGGCCTCGT 


TTTGCCAAGT 


TTCAAACCCT 


TCTTCAGCGC 


ATCAATGCCA 


3540 


TTGCCAAGGA 


AAATTTACGT 


GGCGTTCGTG 


TGGTCAAGTC 


CTTTGTCCAA 


GAAAAAGAGC 


3600 


AATTTGCTAA 


GTTTACAGAG 


GTCTCAGACG 


ACCTTCTTGG 


TCAAAACCTT 


TACATTGGTT 


3660 


ATGCCTTTTC 


ACTAGTCGAA 


CCCTTTATGA 


TGTTGGTTGG 


TTACGCGGCG 


GTCTTCCTCT 


3720 


CTATTTGGCT 


GGTCGCGGGA 


ATGGTTCAGT 


CGGATCCGTC 


TGTTGTTGGT 


TCCATCGCTT 


3780 


CTTTTGTTAA 


TTACCTAACC 


CAGATTATCT 


TTACCATTGT 


TATGGTTGGA 


TTTTTGGGAA 


3840 


ATTCTGTCAG 


CCGTGCCATG 


ATTTCCATGC 


GTCGTATTCG 


AGAAATTCTT 


GACGCAGAGC 


3900 
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CACCTATCAC 
ATGTGACCTT 
TTCAACCTGC 
CTCAATTGAT 
ATATTCGACA 
CCATTCTTTT 
TATTTGAAAT 
AGAAAACCTT 
AAAGGATGTC 
CGACCTCAGC 
TGAACGGGAC 
AGATCTTGGT 
CCAACAATCC 
AAAGTCAATA 
TAGAGCACGG 
TTCACAAATT 
ACGGTTCGGA 
TCTAATTTAG 
CCTTGTATGA 
TCAATCTGAT 
GGATTATCAA 
CCAATCCCAG 
GTTTCCCCCT 
CCTCTATGCG 
GGAAATTGTC 
CACTTGGGCT 
CACAGTTCAA 
CCTGATGATT 
GGTGACGCAG 



CTTCAAGGAT 
TACCTATCCA 
TCAAATGGTT 
TCCACGTCTC 
AGTGAGTGAA 
TAGTGGAACG 
GGAGCGCGCA 
TGAAAGTCCA 
GATTGCGCGT 
CTTGGATGCC 
GACAACCATT 
TCTAAATCAA 
CGTTTACCGT 
GTTTTATCTA 
CAAAAAGCCC 
ACATCACTTT 
TAAGTTTTTT 
TCTTAAGATA 
GTACCTACCG 
CTGACTGATA 
AGGCATGAAT 
TAACCGTCGT 
TGTCAATGAG 
TTATCAAACT 
AAGCGATTGG 
TTGCCAGTAG 
TTTTTTTGGC 
GTTCTGG CG A 
CTAGCCAATT 



ATCCCAGATC 
ATGGACAAGG 
GCTGTAGTTC 
TTTGATCCAC 
GGAACCCTCC 
ATTGCAGATA 
G CCAAT ATTG 
CTTCAACAAC 
GGGATTCTCA 
AAATCAGAGC 
ATTATTGCTC 
GGACGATTGA 
GAAATCTATG 
AACTATTTCT 
TTGAAAAAGT 
TTCTTCGTCA 
TCCAAGGTGG 
CGCCTTAAAA 
CAGATGAGGG 
AATAGAAGAA 
ATTTCGAATC 
GATGACCGAG 
CCTCTTGTAA 
CATTACCAAT 
AGGAAATGAA 
CTAACTGCGC 
ATTATTTTAA 
CTTTTGCCCA 
TAGTTCAAGC 



535 
AAGAGTTGGT 
AACCGATGCT 
GACCGACTCG 
AGGACGGGGC 
GTAAAACAGT 
ACTTGAGACA 
CCCAGGCTAG 
GGGGAACCAA 
GCAATCCACG 
CCTTCCTCCA 
AAAAAATTAG 
TTGGTCAAGG 
AAACACAGAA 
TATTTCAATT 
CCATTTTTTC 
AATGGCAGCT 
ATGATGGCTA 
G C AGGCC AAA 
GAACTCCCTT 
TCCACTCCAG 
TCAGCTAAAA 
TTGAACTCAG 
TGTTTGATGT 
TAAAACAAAA 
CTAATCCACA 
TAAATATAAT 
CCTCTACAAG 
AGCCCTCTTT 
TTATCAAAAT 



TGGAAGTCTT 
GAAAGATGTC 
TGCAGGAAAG 
CATTAAAATC 
TTCCATCCTT 
GGGGAACGGG 
TGAATTCATT 
TTTCTCTGGT 
TATTCTGATT 
AGAACCTTTG 
CTCGGTTCTC 
TACGCATGCA 
ATGAAACACA 
TGATGATTTG 
AAAGGTAATC 
CTTTTTTTAG 
CATTGTAATC 
AGCGAGGGCA 
TCACCATTCT 
CGAAAGCTTG 
TGACCGCCCC 
CCATCAAGTC 
TTTCATTACA 
AGCTGTGGTT 
GCGGCTTATT 
ATAAGGAGGA 
TTCTCATTTG 
CCAGTCTTTT 
GGCAATCCAG 



ACCTTTGAAA 
AG CTTT ACT A 
TCAACCTTGG 
GGTGGCAAGG 
CTCCAACCTG 
AATGCTACTC 
CATCGTATGG 
GGACAAAAAC 
TTTGATGATT 
AATAAGGACT 
CATGCAGACA 
CACTTGGTTG 
AACTATAAGA 
GCGATGATTT 
CTGTGTTAAT 
GATATAAAAC 
TTTTCCTTGT 
TGCTTTGGCA 
TCCTGCTAAA 
TAATTGAGCA 
TAAACGATCC 
ATTGACACAT 
CGAGATAAAA 
ACATCCTTTC 
CCAACTATAC 
GTAAAATGAA 
TACTTGTCAT 
CTGGACAAGC 
AACTTGTATG 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4S60 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
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GCAAACCCTA 


TCAGGAATCA 


TGGTCAATCT 


TCGCCTCCTG 


GTTTTGGTTC 


TATTTATCTC 


5700 


TACTGTAATA 


TACATGTGTC 


TCATGACGCG 


CGTGATTCCA 


GAATCGACCA 


ACCAGATCCG 


5760 


CAAACGCCTC 


TTTGCTAAGC 


TTGCTCAGTT 


GACGGTTTCT 


TTCTTTCACC 


GTCGACAAGA 


SB20 


TGGCGATATC 


CTGTCTCATT 


TTACCAGTGA 


TTTGGATAAT 


ATCCTCCAAG 


CCTTTAACGA 


5860 


AAGCTTGATT 


CAGGTCATGA 


GCAATATTGT 


TTTATACATT 


GGTCTGATTC 


TTGTCATGTT 


5940 


TTCGAGAAAT 


GTGACGCTGG 


CTCTCATCAC 


CATTGCCAGC 


ACCCCATTGG 


CTTTCCTTAT 


6000 


GCTGATTTTC 


ATCGTGAAAA 


TGGCACGCAA ATACACCAAC 


CTCCAGCAGA 


AAGAGCTAGG 


6060 


GAACCTCAAC 


GCCTATATGG 


ATGAGAGCAT 


CTCAGGCCAA 


AAAGCCGTGA 


TTGTGCAAGG 


6X20 


AATTCAAGAG 


GATATGATGG 


CACGATTTCT 


TCAACAAAAT 


GAGCGCGTGC 


GCAAGGCAAC 


6180 


CTTTAAAGGA 


AGAATGTTCT 


CACGAATTCT 


TTTCCCTCTC 


ATGAATGCGA 


TG AG CCTGAT 


6240 


TAATACAGCC 


ATCCTCATCT 


TTGCTGGTTC 


GGCTGTACTT 


TTGAATGATA 


AGTCTATTGA 


6300 


AACAAGTACA 


GCCCTAGGTT 


TGATTGTTAT 


GTTTGCACAA 


TTTTCACAGC 


AGTACTACCA 


6360 


GCCTATTATC 


CAAGTTGCAG 


CGAGTTGGGG 


AAGCCTTCAG 


TTGGCCTTTA 


CTGGAGCTCA 


6420 


ACGAATTCAG 


GAAATGTTTG 


ATCCACAGCA 


GGAAATCCGA 


CCTGAAAAGG 


CTCCAACCTT 


6480 


CACTAAGTTG 


CAAGAAAGTG 


TTCAAATCAC 


TCATATCCTT 


TTTTCATACT 


TGCCTGATAA 


6540 


ACCTATTTTG 


AAACATCTCA 


GCATTTCTCC 


CCCTAAAGGC 


CAGATGACAC 


CACTTCTTGC 


6600 


GCCGACAGGT 


TCAGGAAAAA 


CGACTATTAT 


GAACCTCATC 


AATCGCTTTT 


ATGATGTTCA 


6660 


TGCTGGTGGT 


ATTTATTTTG 


ATGGTAAAGA 


CATTCGTGGC 


TATGACTTAC 


ATAGTCTTAC 


6720 


AACCAAGGTG 


GGAATTGTAT 


TGCAAGATTC 


GGTCTTCTTT 


AGCGCAACGA 


TTAGAGACAA 


6780 


TATCCGATTT 


GGTGTGCCAG 


ATGCTAGTCA 


GG AAATGGTT 


GAGGTACCAG 


CAAAAGCAAC 


6840 


CCACATTCAC 


GACTATATCG 


AAAGTTTGCC 


TCATAAGTAC 


GATACTCTTA 


TTGATGATCA 


6900 


CCAGAGCATC 


TTTTCAACAC 


CGCAGAAGCA 


ATTGATTTCA 


ATCGCTCCAA 


CCCTGATGAC 


6960 


ACATCCAGAA 


GTTCTCATTC 


TCGATGAAGC 


AACTTCAAAC 


GTAGATACGG 


TGACAGAAAG 


7020 


CAAGATTCAG 


CATGCCATGG 


AGGTGGTTGT 


AGCAGGTAGA 


ACTAGTTTCG 


TCATTGCCCA 


7080 


CC G CTTG AAA 


ACCATTCTCA 


ATGCAGATCA 


GATTATTGTC 


CTTAAAGATG 


GAGAAGTCAT 


7140 


TGAACGTGGT 


AACCACCATG 


AACTTTTGAA 


GCTAGGTGGC 


TTTTATTCAG 


AACTCTATCA 


7200 


CAATCAATTT 


CTTTTCGAAT 


AAGAAAGAAG 


TTGTCCTATC 


TGGCCAGCTT 


TTTCTTGTCC 


7260 


ATAAAAAATG 


TTTATCACAG 


CCTTAAAAAA 


AACATATTAG 


ACG AAAGTCA 


TTTTGAGTGA 


7320 


TATGATAGGA 


CTATCCTTAG 


CATTCGAAAG 


GAGAGGCATC 


ATGGCTAGAA 


CGGTTGTAGG 


7380 


AGTTGCTGCA 


AATCTATGTC 


CCGTAGACGC 


AGAAGGCAAA 


ATCATTCATT 


CATCTGTATC 


7440 
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TTCTAGATTC GCAGAGATCA TTCCTCAACT CCGTGGTCTC CCTTTAGTCA TTCCTGTTGG 7500 

TCATGACTCA GTTGTACGTG ATTATGTGGA AATGATTGAC AAACTCATTT TGACAGGAGG 7 560 

CCAAAATGTT CATCCTCAGT TTTATGGAGA GAAAAAGACC GTCGAGAGCG ATGATTACAA 7 620 

TCTGCTCCCT GACGAATTTG AATTGGCACT CTTGAAGGAA CCGCTTCGTC AGAATAAACC 7680 

AATTATGGCA ATCTGTCCCG GTGTCCAACT TGTCAATGTT GCCTTTGGTG GAACCCTCAA 7740 

TCAAGAAATC GAAGCTCAGG 7760 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 
(CI STRAND EDNESS : double 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



GAGGTTTTAA 


TTCACTTACC 


TCTsCCGTAT 


CTTTATTTAA 


AATGAATTCT 


TTTACGGTTG 


60 


TATTTCTTGC 


AAAATCTTTT 


ACAACAATCT 


TAATGTTTAG 


TGTCTTGTCT 


ATTATTTGTT 


120 


TAATATCATT 


AAATGATCTA 


TATTCTTTTC 


CATTTATATA 


AATATGTTGT 


TCTTGAATCT 


180 


CACCATCGAA 


TCCATTATTT 


CTTTTATCAT 


TGATCTTAAA 


CACTACAGAT 


TTTCCATCAG 


240 


CATATTCGAT 


ACTAGTATTT 


CCCTTAGGAT 


CAATGTTTAC 


TTCGGGTTTA 


ACATTATCAT 


300 


ATAAAAACTG 


ATAGTGGACT 


CCAACTCCTT 


TAGCATTCAA 


ATCGCTATAG 


CCAGTTTCAA 


360 


GATAAACATT 


TCCATCCATA 


TCTGTTACCT 


TATCTGGAAA 


TCCGTTTGCT 


TTATAGTCTT 


420 


TCATTCCCCA 


GTCCATGATG 


TCACCGTCTT 


TAACATTCAG 


CTTAATATTA 


AAATCTCTAG 


480 


TGTTATCAAT 


GTGTAAATCT 


CCGTAGATTA 


AATAATTATC 


TACAACCGAT 


TCATTAACTC 


540 


TCAATTCCCA 


CTTAAAACCA 


CCCTTATCAG 


AAATCTTACC 


TCTTAAATAA 


AATTCTGGAT 


600 


TTCCTACATA 


AATTTTATTA 


GATTTAGATG 


GATTAAAGTA 


GTTCTTATCC 


ATTGAAAGGT 


660 


TTACTGGTTT 


GGTATCAATA 


AATAACATGG 


AGCCATCTTC 


TTTTATAGCT 


TCTACATTGA 


720 


ACTTATCCTC 


TCCAGTGTAT 


TCTTTATCAT 


CCTTACCAAA 


TAATACAAGT 


TTAGAAGAAT 


780 


CTGTCACAAG 


ATTTCCGTCT 


TTATCGATAG 


CTTCCCCTTT 


ATCCTTCATT 


TTAAATGTAA 


840 


ACACTTGATA 


CCTTATAATG 


TTAAAGCCGT 


CCAAAGCCGA 


CATTAATACA 


GATTGGGTAC 


900 


TTCTTCCATC 


TTCAACATTT 


CTACTATCAG 


CATAAATTGT 


TGTTTCTGAA 


AGGGCTCTTA 


960 


GATTAGGATT 


GGCCTTTTGT 


ATTTTTGCTA 


TATCTTCCTT 


GCTATAGACT 


CCATTTCCTT 


1020 
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CTAACATATC 


CCTTTTTCCA 


GGATTATAGG 


TAGTCACTTT 


TAGTGCATAG 


CCTTTTCTTA 


1080 


GAATGATATT 


ATCCTTTAAC 


AGATATTGTT 


CTTTTTCTGA 


ATCAGAATAG 


ATTTTACCAG 


1140 


ATTCCATTTT 


AGTTAAATTG 


TCTGGTTTGT 


TTTTTGAAAG 


ATCTCCTTCC 


CCTAATTCTA 


1200 


TCACATTCCC 


ATAACTTGAT 


ACATAGGGAT 


ATTCTGATTT 


AGTTTCCTTA 


ATTTTTTCAG 


1260 


GCATTCTAAT 


TTTAATTTCA 


GCTTTTTTCT 


GATCATTATC 


TTTAACAAAT 


AATCTCATAT 


1320 


CTCCTGCAAA 


AGCTAATCCA 


TCCACAATAT 


CATTAATATT 


AGCCTATAGA 


TCAAATGTCA 


1380 


TCGTTTTTGA 


GTGGAAATCA 


TACTTCGTCC 


CTTTGATTTC 


TATAGATTTA 


TAGTTATTCC 


1440 


CATAATATAC 


CTTGGCATTT 


TTAGAAACAT 


TACTTATCTT 


TCCAAGAATT 


TCAAAGTGTC 


1500 


CATCTTTAGA 


CGGACTTAGA 


ACACCATAAA 


TTTTTGATTT 


GATTTCGTCA 


AGTTTCTCAG 


1560 


TTTCATATTC 


TAGATCAGTC 


CCATCATCGT 


AGGCTATTAT 


ATTTCCTTTA 


TCATCGTATT 


1620 


TATAATCGTA 


TTCCTCCATT 


CTCTTACCAG 


TTTCACTTGT 


AAAATCATCA 


ACTTCTCTAA 


1680 


ATTTCTTTTT 


AATGACTTTC 


TTTAAGTCTT 


TATTTTCAAA 


GTCTCTAATT 


GTTCAAATAT 


1740 


TTCTATCAAT 


AGTAAAACTA 


GATTTTTCTT 


TAATAGACTC 


TTCATTTTCT 


TGATGATGAT 


1800 


GTTCTACCCC 


AGTTGTATCT 


TTTTTTAGAC 


TACCCTCTTT 


TCCATTTCCT 


AAATTTTTAA 


1860 


ATTTAGATTC 


TGCAATCTCG 


ccaagctttt 


GATATTTAGA 


TGAATCTTGA 


TCAGGATCTA 


1920 


CTAGATAATA 


GGAAATCATC 


CCCTTTTCAT 


CAGCCTGATT 


AGCAAATTTA 


ATTCTATGAA 


1980 


TCTTTGTGAA 


ATTGCTAGAA 


CCATCTAATG 


CAATGACTTC 


AATGATTTTT 


CCCCTTAAAT 


2040 


CTCCCGCACC 


TTTAATTTCA 


TAAATGCTAT 


TTCCGTCTTT 


ATCAAGTTTT 


CTATTTCTTC 


2100 


CTTGACCCTC 


ACCTGCCTAA 


GTTACTTCAA 


GATTTTTTTC 


AACCTCTCCA 


TCTTCATTAA 


2160 


CAAGAGCGGC 


GCCAGCATAC 


CAAACTTCGT 


TCGCAATCTC 


CTCAAATTTT 


TCAGGATCTT 


2220 


CTTTTTGATC 


TCTCGCAAAT 


AGCGTTTCAT 


TCTTATACTC 


ATCTTTTACC 


TTATGATAAG 


2280 


TATCCTTTGT 


AATCAACTTA 


ATTTTTTCAG 


GATTTGAAAA 


ATCAACCGAA 


ACAATCTTAG 


2340 


GGGCGGTGTT 


ATCAATTTTT 


ACAGGAATAT 


AGGAAACCTG 


CCATGGGTAA 


TCTTTAGTTA 


2400 


ATCTATATTT 


AAATTTATAG 


AAATATTGAC 


CTTCCGCAAT 


CGGTTCAAAT 


TGACCTCTTA 


2460 


TCTTAGTAGC 


AGGATCTTGA 


TTATCCTTAC 


TTTCTGGTGC 


ATTTTCTTCT 


CTACCTCTAG 


2520 


GATTATAGAT 


GAGTCCATCC 


CACTTCAAGT 


CACCCCAAAC 


TTTTACTTTA 


GATGATTTGA 


2580 


TTCCCTTTGC 


ATCATTGCTT 


TTAGAATTTA 


AAATTCCTCT 


AATAAACTCT 


TCTCTCGAAA 


2640 


TCACTTTTAA 


GTCTCTTTGA 


TTTTCTCCCT 


CTTTATTTGT 


ATTTACTATT 


GAAATCAATC 


2700 


CTTCTTCTGC 


ACTTCTTAAT 


ACA 








2723 


(2) INFORMATION FOR SEQ ID NO: 65: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11831 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND EDN ESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



4 



AAAAAAGTGG 


GAATGACTCA 


AATCTTCACT 


GAAGCTGGCG 


AATTGATCCC 


TGTAACAGTT 


60 


ATTGAAGCAA 


CTCCAAACGT 


TGTTCTTCAA 


CTTAAAACTG 


TTCAAACAGA 


CGGATACAAC 


120 


GCTATCCAAG 


TTGGTTTCGA 


TCACAAACGC 


GAAGTATTGA 


GCAACAAACC 


TGCTAAAGGA 


180 


CATGTAGCGA 


AAGCTAACAC 


GGCTCCTAAG 


CGCTTCATTC 


CTGAATTCAA 


AAACGTTGAA 


240 


GGCTTGGAAG 


TTGGTGCTGA 


AATTACAGTT 


GAAACATTCG 


CAGCTGGAGA 


CCTTGTTGAC 


300 


CTAACGGGTA 


CTTCTAAAGG 


TAAACGTTTC 


CAAGCTGTTA 


TCAAACGCCA 


CGGACAATCA 


360 


CGTGGACCAA 


TGGCTCACGG 


TTLTOGTTAC 


CACCGTCGTC 


CAGGTTCTAT 


GGGGCCTGTT 


420 


GCACCTAACC 


GCGTATTCAA 


AGGTAAAAAC 


CTTGCAGGAC 


GTATGGGTCG 


CGACCGCGTA 


480 


ACAATTCAAA 


ACCTTCAAGT 


TGTACAAGTT 


GTTCCAGAAA 


AGAACGTTAT 


CCTTATCAAA 


540 


GGTAACCTAC 


CACCTCCTAA 


GAAATCTCTT 


ATCACTATCA 


AATCAGCAGT 


TAAAGCTGGT 


600 


AAATAATAAA 


GAAAGGGGAA 


ATCAGTCACA 


ATGCCAAACG 


TAACATTATT 


TGACCAAACT 


660 


GGTAAAGAAG 


CTGGCCAAGT 


TGTTCTTAGC 


GATGCACTAT 


TTGGT ATCG A 


ACCAAATGAA 


720 


TCAGTTGTGT 


TTGATGTAAT 


CATCAGCCAA 


CGCCCAAGCC 


TTCGTCAAGG 


AACACACGCT 


780 


CTTAAAAACC 


GCTCTGCAGT 


ATCAGGTGGT 


GGACGCAAAC 


CATGGCGTCA 


AAAAGGAACT 


840 


GGACGTGCTC 


CTCAACGT7C 


TATCCCCTCA 


CCACAATGGC 


CTGGTCGTGC 


TCTTGTCTTC 


900 


CGACCAACTC 


CACGTTCATA 


CGGCTACAAA 


CTTCCACAAA 


AAGTTCGTCG 


CCTACCTCTT 


960 


AAATCAGTTT 


ACTCTCAAAA 


AGTTGCTGAA 


AACAAATTCG 


TAGCTGTAGA 


CGCTCTTTCA 


1020 


TTTACAGCTC 


CAAAAACTGC 


TGAATTTGCA 


AAACTTCTTG 


CAGCATTCAG 


CATCGATTCT 


1080 


AAAGTTCTTC 


TTATCCTTGA 


AGAAGGAAAT 


CAATTCCCAG 


CTCTTTCAGC 


TCGTAACCTT 


1140 


CCAAACGTGA AAGTTGCAAC TGCTACAACT 


GCAAGTGTTC 


TTGACATCGC 


AAATAGCGAC 


1200 


AAACTTCTTG 


TCACACAAGC 


AGCTATCTCT 


AAAATCGAGG 


AGGTTCTTGC 


ATAATGAATT 


1260 


TGTATGATGT 


TATCAAAAAA 


CCTGTCATCA 


CTGAAAGCTC 


AATGGCTCAA 


CTTGAAGCAG 


1320 


GAAAATATGT 


ATTTGAACTT 


GACACTCGTG 


CACACAAACT 


TTTGATCAAG 


CAAGCTGTTG 


1380 


AAGCTGCTTT 


CGAAGGTGTT 


AAAGTTGCCA 


ATGTTAACAC 


AATCAACGTA 


AAACCAAAAG 


1440 
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CTAAACGTGT 


TGGACGTTAC 


ACTGGTTTTA 


CTAACAAAAC 


TAAAAAAGCT 


ATCATCACAC 


1500 


TTACAGCTCA 


TTCTAAACCA 


ATCGAGTTGT 


TTGCTGCTGA 


AGCTGAATAA 


TCTAAGGAGG 


1560 


AAATATCGTG 


GGAATTCGTG 


TTTATAAACC 


AACAACAAAC 


GGTCGCCGTA 


ATATGACTTC 


1620 


TTTGGATTTC 


GCTGAAATCA 


CAACAAGCAC 


TCCTGAAAAA 


TCATTGCTTG 


TTGCATTGAA 


1680 


GAGCAAGGCT 


GGTCGTAACA 


ACAACGGTCG 


TATCACAGTT 


CGTCACCAAG 


GTGGTGGACA 


1740 


CAAACGTTTC 


TACCGTTTGG 


TTGACTTCAA 


ACCTAATAAA 


GACAACGTTC 


AAGCAGTTGT 


1800 


TAAAACAATC 


GAGTACGATC 


CAAACCGTTC 


TGCAAACATC 


GCTCTTGTAC 


ACTACACTGA 


1860 


CCCTGTGAAA 


GCATACATCA 


TCGCTCCAAA 


AGCTCTTGAA 


GTACGTCAAC 


GTATCGTTTC 


1920 


AGGTCCAGAA 


GCAGATATCA 


AAGTCGGAAA 


CGCTCTTCCA 


CTTGCTAACA 


TCCCAGTTGC 


1980 


TACTTTGATT 


CACAACATCG 


AGTTGAAACC 


AGGTCGTGGT 


GGTGAATTGG 


TACGTGCTGC 


2040 


TGGTGCATCT 


GCTCAAGTAT 


TGGGTTCTGA 


AGGTAAATAT 


GTTCTTGTTC 


GTCTTCAATC 


2100 


AGGTGAAGTT 


CGTATGATTC 


TTGGAACTTG 


CCCTGCTACA 


GTTGGTGTTG 


TCGGAAACGA 


2160 


ACAACATGGA 


CTTGTAAACC 


TTGGTAAAGC 


AGGACCTAGC 


CGTTGGAAAG 


GTATCCCCCC 


2220 


AACAGTTCGT 


CGTTCTGTAA 


TGAACCCTAA 


CGATCACCCA 


CACGGTCCTG 


CTGAAGGTAA 


2280 


AGCACCAGTT 


GCTCGTAAAG 


CACCATCTAC 


TCCATGGGCC 


AAACCTGCTC 


TTGGTCTTAA 


2340 


AACTCCTAAC 


AAGAAAGCGA 


AATCTCACAA 


ACTTATCGTT 


CGTCCTCCCA 


ACGAGAAATA 


2400 


ATATTAAACT 


AGTCGCTTAA 


GCAACTAGTA 


AATCCGCCAG 


CTCGGTACCG 


CTCCATAGGA 


2460 


GTGCAAGCCG 


CTGTGCTACA 


ACATTTAAAG 


GAGAAAATAT 


AAAAATGGGA 


CGCAGTCTTA 


2520 


AAAAAGGACC 


TTTCGTCGAT 


GAGCATTTGA 


TGAAAAAACT 


TGAAGCTCAA 


CCTAACGACG 


2580 


AAAAGAAAAA 


AGTTATTAAA 


ACTTGGTCAC 


GTCCTTCAAC 


GATCTTCCCA 


ACTTTCATTG 


2640 


GTTACACTAT 


TGCAGTTTAT 


GACGGACGTA 


AACACGTACC 


TGTTTACATC 


CAAGAAGACA 


2700 


TGGTAGGCCA 


CAAACTTGGT 


GAATTTGCAC 


CAACTCGTAC 


TTACAAAGGT 


CACGCTGCAG 


2760 


ACGACAAGAA 


AACACCTAGA 


AAATAAGGAG 


AACATAAATG 


GCAGAAATTA 


CTTCAGCTAA 


2820 


AGCAATGGCT 


CGTACAGTAC 


GTGTTTCACC 


TCGTAAATCA 


CGTCTTGTTC 


TTGATAACAT 


2880 


CCGTGGTAAA 


AGCGTAGCCG 


ATGCAATCGC 


AATCTTGACA 


TTCACTCCAA 


ACAAAGCTGC 


2940 


TGAAATCATC 


TTGAAAGTTT 


TGAACTCAGC 


TGTAGCTAAC 


GCTGAAAACA 


ACTTTGGTTT 


3000 


GGATAAAGCT 


AACTTCCTAG 


TATCTGAAGC 


ATTCGCAAAC 


GAAGGACCAA 


CTATGAAACG 


3060 


TTTCCGTCCA 


CGTGCGAAAG 


GTTCAGCTTC 


ACCAATCAAC 


AAACCTACAG 


CTCACATCAC 


3120 


TGTAGCTGTT 


GCAGAAAAAT 


AAGGAGGTAA 


AATCGTGGGT 


CAAAAAGTAC 


ATCCAATTGG 


3180 


TATGCGTGTC 


GGCATCATCC 


GTGATTGGGA 


TGCCAAATGG 


TATGCTGAAA 


AAGAATACGC 


3240 
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GGATTACCTT 


CATGAAGATC 


TTCCAATCCG 


TAAATTCGTT 


CAAAAAGAAC 


TTGCTGACGC 


3300 


AGCAGTTTCA 


ACTATTGAAA 


TCGAACGCGC 


AGTAAACAAA 


GTTAACGTTT 


CACTTCACAC 


3360 


TCCTAAACCA 


GGTATGGTTA 


TCGGTAAAGG 


TGCTGCTAAC 


GTTGATGCaC 


TCCCTGCAAA 


3420 


ACTTAACAAA 


TTGACTGGAA 


AACAAGTACA 


CATCAACATC 


ATCGAAATCA 


AACAACCTGA 


34BO 


TTTGGATGCT 


CACCTTGTAG 


GTGAAGGAAT 


TCCTCGTCAA 


TTGGAGCAAC 


GTGTTGCTTT 


3540 


CCGTCGTCCA 


CAAAAACAAG 


CAATCCAACG 


TGCAATGCGT 


GCTGGAGCTA 


AAGGAATCAA 


3600 


AACTCAAGTA 


TCAGGTCGTT 


TGAACGGTGC 


AGATATCGCC 


CGTGCTGAAG 


GAT ACT CTGA 


3660 


ACGAACTGTT 


CCGCTTCACA 


CACTTCGTGC 


AGATATCGAT 


TACGCTTGGG 


AAGAAGCAGA 


3720 


TACTACATAC 


GGTAAACTTG 


GTGTTAAAGT 


ATGGATCTAC 


CGTGGTGAAG 


TTCTTCCAGC 


3780 




AP/FAAAGGAG 


GTAAATAACC 


AATGTTAGTA 


CCTAAACGTG 


TTAAACACCG 


3840 


TCCTCACTTr 

A ^ W A i i V» 


CGTCGAAAAA 


TGCGCGGTGA 


AGCAAAAGGT 


GGAAAAGAAG 


TAGCATTCCC 


3900 


A Vj/w\ A nwuu A 


rTTrAAGCTA 


CAACTAGCCA 


CTGGATCACT 


AACCGCCAAA 


TCGAAGCTGC 


3960 




ATfiAPTCGTT 


ACATGAAACG 


TGGTGGTAAA 


GTTTGGATTA 


AAATCTTCCC 


4020 


ACACAAATCA 


TACACTGCTA 


AAGCTATCGG 


TGTGCCTATG 


GGATCTGGTA 


AAGGCGCACC 


4080 


TG AACCTTCC 


GTAGCACCAG 


TTAAACGTGG 


TAAACTGATG 


TTCGAAATCG 


CTGCTCTATC 


4140 


TCAACAGATT 


GPJVCGTGAAG 


CGCTTCGACT 


TGCTAGCCAC 


AAATTGCCAG 


TTAAATGTAA 


4200 


ATTCGTAAAA 


CGTGAAGCAG 


AATAAGGAGA 


AGGCATGAAA 


CTTAATCAAG 


TAAAAGAATT 


4260 


TGTTAAAGAA 


CTTCGTGGTC 


TTTCTCAAGA 


AGAACTCGCG 


AAGCGCCAAA 


ACGAATTGAA 


4320 


AAAAGAATTG 


TTTGAACTTC 


GTTTCCAAGC 


TGCTACTGCT 


CAATTGGAAC 


AAACAGCTCG 


4380 


CTTGAAAGAA 


GTTAAAAAAC 


AAATCGCTCG 


CATCAAAACA 


GTTCAATCTG 


AAGCGAAATA 


4440 


ATAGACTAGG 


GAAGGAGAAA 


TTTCAATGGA 


ACGCAATAAT 


CGTAAAGTTC 


TTCTTGCACG 


4500 


TGTTGTATCT 


GACAAAATGG 


ACAAGACAAT 


CACAGTTGTA 


GTTGAAACAA 


AACGTAACCA 


4560 


CCCACTCTAT 


GGTAAACCTA 


TTAACTACTC 


TAAAAAATAC 


AAAGCTCATG 


ATGAAAACAA 


4620 


TGTTGCCAAA 


GAAGGCGATA 


TCGTACGTAT 


CATGGAAACT 


CGCCCCCTTT 


CAGCTACAAA 


4680 


ACCTTTCCGT 


CTTGTAGAAG 


TTCTTCAAGA 


AGCGGTCATC 


ATCTAATCAA 


ACCTGAAAGG 


4740 


AGAAAACTGA 


AATGATTCAA 


ACACAAACTC 


GTTTGAAAGT 


CGCAGACAAC 


AGCGGTGCTC 


4800 


GCGAAATCTT 


GACTATCAAA 


GTTCTTGGTG 


GTTCAGGACG 


T AAATTTG C A 


AACATCCCTC 


4860 


ATGTTATCGT 


GGCATCTGTA 


AAACAAGCTA 


CTCCTGCTCG 


TGCGGTTAAA 


AAAGGTGACG 


4920 


TTGTTAAAGC 


AGTTATCGTT 


CCTACTAAAT 


CAGGTGCTCG 


TCGTGCTGAT 


GGTTCATACA 


4980 
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TCAAATTTGA 


CGAAAACGCA 


CCAGTTATCA TCCGTCAACA 


CAAAACTCCT 


CGCGGAACAC 


5040 


GTATCTTTGG 


CCCAGTTGCA 


CGTGAATTGC GTGAAGCTGG 


CTTCATGAAG 


ATCGTGTCAC 


5100 


TTCCTCCAGA 


AGTACTTTAA 


TTTTTAGGAA 


CAAACTAGTC 


CCCTAGCTTC 


AAGCT AGGGT 


S160 


CCCCTTATGG 


GCGTAAGAAA 


AATCAAGGAG 


AAACCTAATG 


TTTGTAAAAA 


AAGGCGACAA 


S220 


ACTTCGCCTA 


ATCGCTGGTA 


AAGATAAGGG 


AACAGAAGCT 


GTTCTCCTTA 


CTGCCCTTCC 


S280 


AAAACTAAAC 


AAAGTTATCG 


TTGAACCTCT 


TAACATTCTT 


AAGAAACACC 


AACGTCCAAC 


5340 


TAACGAGCTT 


CCTCAAGGTG 


GTATCATCGA 


GAAAGAAGCA 


GCTATCCACG 


TATCAAACGT 


S400 


TCAAGTTTTG 


GACAAAAATG 


GTGTAGCTGG 


TCCTGTTGCA 


TACAAATTTG 


TAGACGCTAA 


S460 


AAAACTTCGC 


TACAACAAAA 


AATCAGGCGA 


AGTGCTTGAT 


TAATCACGAA 


GCAAAGGAGA 


S520 


AGTATAATCG 


CAAATCGTTT 


AAAAGAAAAA 


TATCTTAATG 


AAGTAGTTCC 


TGCTTTGACA 


S580 


CAACAATTCA 


ACTACTCATC 


ACTGATCCCT 


GTGCCTAAAG 


TAGATAAGAT 


TGTTTTGAAC 


S640 


ATGGCTCTTG 


CTGAAGCTGT 


ATCAAACGCT 


AAAAGCCTTG 


AAAAAGCTGC 


TGAAGAATTG 


5700 


GCACTTATCT 


CAGGTCAAAA 


ACCACTTATC 


ACTAAAGCTA 


AAAAATCAAT 


CCCCGGCTTC 


5760 


CCTCTTCGTG 


AAGGTGTTGC 


GATCGGTGCA 


AAACTTACCC 


TTCCTGGTCA 


ACCTATGTAC 


5820 


GAATTCTTGG 


ATAAATTGGT 


ATCAGTTTCA 


CTTCCACGTC 


TACCTGACTT 


CCACGGTGTC 


5860 


CCAACAAAAT 


CATTTGATGG 


ACGCGGGAAC 


TACACACTTG 


CTGTGAAACA 


ACAATTAATC 


5940 


TTCCCAGAAA 


TCAACTTCCA 


TGACGTTGAC 


AAAACTCGTG 


GTCTTGACAT 


CCTTATCGTA 


6000 


ACAACTGCTA 


ACACTGACGA 


AGACTCACGT 


GCATTGCTTA 


CAGGCCTTGG 


AATGCCTTTT 


6060 


GCAAAATAAT 


ATACGAGGTA 


AATCTAATCG 


CTAAAAAATC 


AATGGTACCT 


AGAGAGGCTA 


6120 


AACGCCAAAA 


AATTGTTGAC 


CGTTATCCTG 


AAAAACCTCC 


TGCATTAAAG 


GCGGCAGGGG 


6180 


ACTACGAAGG 


TTTATCTAAA 


TTACCTCGCA 


ACGCCTCACC 


CACTOGTTTA 


CATAATCCTT 


6240 


GTACCCTTAC 


GGGGCGCCCA 


CATTCAGTTT 


ACCGCAAATT 


TGGTCTGAGT 


CCTATCGCTT 


6300 


TTCCCGAACT 


TGCGCATAAA 


CGTCAAATTC 


CTCCTCTAAC 


AAAAGCATCT 


TGGTAATTTA 


6360 


AGATATCAAG 


AGCGTCAAAA 


CTCCAAGTAA 


AAATAGGAAA 


CTTGACGAAG 


AAACTAAAGT 


6420 


TTCTAGGAAA 


GTTTATCTTT 


TTC AC AC AG A 


GTTTACCCCG 


GGTTCAATTG 


GGCTTGCCAA 


6480 


TTTGAACACG 


AGCTACAGCT 


TTGGCAAAAA 


AGACCAATTT 


GCTTTGGAGC 


ATTCCTTCTG 


6540 


CATTAAATTG 


TCTATTTTTG 


CTCCTGCTGT 


TACGCTCTTT 


GTATCATGTA 


TTAACTAGCA 


6600 


AGTGCAACTT 


GCAAACTACT 


AGTAAGAGGA 


GAAAAACAAA 


ATGGTTATGA 


CTCACCCAAT 


6660 


CGCAGACTTC 


CTAACTCGTA 


TTCGTAATGC 


T AACC AAGCT 


AAACACGAAG 


TACTTGAAGT 


6720 


ACCTGCATCA 


AACATCAAAA 


AAGGGATTGC 


TGAAATCCTT 


AAACGCCAAG 


GTTTTGTAAA 


6780 
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AAACGTTCAA 


ATCATTGAAG 


ATGACAAACA 


AGGCGTCATC 


CGTCTATTTC 


TTAAATACGG 


6840 


ACCAAATGCT 


GAGAAAGTTA 


TCACTAACTT 


GAAACGTGTT 


TCTAAACCAG 


GACTTCGTGT 


£ Q Art 


CTACAAAAAA 


CGTGAAGACC 


TTCCAAAAGT 


TCTTAACGGA 


CTTGGAATTG CCATCCTTTC 


6960 


AACTTCTCAA 


GCTTTGCTTA 


CTGATAAAGA 


AGCACGCCAA 


AAGAATGTTG 


GTGGTGAGGT 


7020 


TATCGCTTAC 


GTTTGGTAAA 


ATCAAGATAC 


AAACCTCCTA 


AAGAACAAAG 


CAAAATTAGG 


7080 


AAGTTGGAGA 


AGTTTGTTTA 


CAAACAAGCC 


AACTTATCTA 


TTTTGCACAG 


TTCTTAGAGC 


7140 


GTGTTCAGTT 


CAGCTCTTGA 


ACTAAATAAG 


TATCTGAACC 


rcCTGAAAAC 


TGGCCGTTCT 


7200 


CGCCTGACAA 


TTTAACAGCA 


GAAAATAAAC 


ATGTCACGTA 


TTCGTAATAA 
i 4 ww * • 


AGTTATCGTG 


7260 


TTGCCTGCTG 


GTCTTGAACT 


CGCTAACAAT 


GACAACGTTG 


TJVACTftTAAA 


AGGATCTAAA 


7320 


GGAGAACTTA 


CTCGTGAGTT 


CTCAAAAGAT 


ATTGAAATCC 


Olul wwa*W\ww 


TACTGAAATA 


7380 


ACTCTTCACC 


GTCCAAACGA 


TTCAAAACAA 


ATGAAAACTA 




TACTCtSTCSPf* 

A 1WU 4. XJIw^i 


7440 


CTTTTGAACA 


ACATGCTTGT 


TCGTGTATCA 


GAAGGATTCA 




4 w/WA 1 W\_W 1 


7S00 


GGGGTTGGTT 


ACCGTGCACA 


GCTTCAAGGA 


TCTAAACTTG 


111 1 WWV. 1 v l 


4 WW 4 rWW* 4 W 4 


7560 


CATCCAGACG 


AAGTTCAAGC 


TCCAGAAGGA 


ATTACTTTTG 




CCCAACAACA 


7620 


AT CCTTGTT A 


GCGGAATTTC 


AAAAGAAGTA 


GTTGGTCAAA 




^ w i w a r\WV_ 


7680 


CTTCGTTCAC 


CAGAACCATA 


TAAAGGTAAA 


GGTATCCGTT 


t\\* W 4 a WW 4 WA 


» in* rCTT^flf^ 
n 4 * WW 4 4 www 


7740 


CCTAAAGAAG 


GTAAAACAGG 


TAAATAATCT 


TGACTGGTTC 


ATCATCAACC 


ACCAACCTAT 


7800 


TTTCCAACTT 


TGTGCATAGC 


ACACGATTTA 


AAACTAAAGA 


GGTGAAAACT 


GTGATTTCAA 


7860 


AACCAGATAA 


AAACAAACTC 


CGCCAAAAAC 


GCCACCGTCG 


CCTTCCCCGA 


AAACTCTCTG 


7920 


GAACTCCTCA 


TCGCCCACGT 


TTGAACGTAT 


TCCGTTCTAA 


TACAGGCATC 


TACGCTCAAG 


7980 


TGATTGATCA 


CGTAGCGGGT 


GTAACGCTCG 


CAAGTGCTTC 


AACTCTTGAT 


AAAGAAGTTT 


8040 


CAAAAGGAAC 


TAAAACTGAA 


CAAGCCGTTG 


CTGTCGGTAA 


ACTCGTTCCA 


G AACGTGCA A 


8100 


ACGCTAAAGG 


TATTTCAGAA 


CTGGTGTTCG 


ACCGCGGTGG 


ATATCTATAT 


CACGCACGTG 


8160 


TGAAAGCTTT 


GGCTGATGCA 


GCTCGTGAAA 


ACGGATTGAA 


ATTCTAATAG 


GAGGACACTA 


8220 


GAAAATGGCA 


TTTAAAGACA 


ATGCAGTTGA 


ATTACAAGAA 


CGCGTAGTTG 


CTGTCAACCC 


8280 


TGTTACAAAA 


GTTGTTAAAG 


GTGGACGTCG 


TCTTCGTTTC 


GCACCTCTTG 


TTCTTCTTGG 


8340 


TGACCACAAT 


GGTCGCGTAG 


GATTTGCTAC 


TGGTAAAGCT 


CAAGAACTTC 


C AG AAGCAAT 


8400 


CCGTAAAGCA 


GTAGATGATG 


CTAAGAAAAA 


CTTGATCGAA 


GTTCCTATGG 


TTGGAACAAC 


8460 


AATCCCACAC 


GAAGTTCTTT 


CAGAATTCGG 


TGGAGCTAAA 


GTATTCTTGA 


AACCTGCTGT 


8520 
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AGAACGTTCT 


GGAGTTGCCG 


CTGGTCGTGC AGTTCGTCCC 


GTTGTGGAAT 


TGGCAGGTGT 


8SBO 


CGCAGATATT 


ACATCTAAAT 


CACTTCGTTC 


TAACACTCCA 


ATC/CACATTG 


TTCGTGCAAC 


8640 


TGTTCAAGGT 


TTGAAACAAT 


TGAAACGCGC 


TGAAGAAATT 


CCTGCCCTTC 


GTGGTATTTC 


8700 


AGTTTCTGAT 


TTCGCATAAG 


AAAGGGGATA 


AAATGGCTCA 


AATTAAAATT 


ACTTTGACTA 


8760 


AGTCTCCAAT 


CGGACCCATT 


CCATCACAAC 


GTAAAACTGT 


TGTAGCACTT 


CGACTTGGCA 


8820 


AATTGAACAC 


CTCTGTTATT 


AAACAAGATA 


ACGCTGCTAT 


CCGTCGTATG 


ATCACAGCAG 


8880 


TATCTCACTT 


AGTAACAGTT 


GAAGAAGTAA 


ACT AATG Aa G 


TTTTAGGGGA 


TCTCCACTGT 


8940 


ACCATCCCCT 


AAAACTACAT 


ATAGTCATCT 


ATGATCACAT 


CGTATAGGCG 


AGTTGATGGG 


9000 


GGACACAACC 


TTTTCTCCCT 


TATCGGCGCT 


AGCATTTTAC 


AAAAGAGGAG 


AAAATAAAAA 


9060 


TGAAACTTCA 


TGAATTGAAA 


CCTGCACAAG 


GTTCTCGTAA 


AGTACGTAAC 


CGCGTTGGTC 


9120 


GTGGTACTTC 


ATCAGGTAAC 


GGTAAAACAT 


CTGGTCGTGG 


TCAAAAAGGT 


CAAAAAGCTC 


9180 


GTAGCGCTGG 


CGGAGTTCGC 


CTTGGTTTTG 


AAGGTGCACA 


AACTCCATTG 


TTCCGTCGTC 


9240 


TTCCAAAACG 


TGGATTCACT 


AACATCAACG 


CTAAAGAATA 


CGCAATTGTG 


AACCTTGACC 


9300 


AATTGAACCT 


CTTTGAAGAT 


GGTGCTGAAG 


TAACTCCAGT 


TGTTCTTATC 


GAAGCAGCAA 


9360 


TTCTTAAAGC 


TGAAAAGTCA 


GGTATTAAAA 


TTCTTCCTAA 


CGGTGAGTTC 


ACTAAGAAAT 


9420 


TCACTGTGAA 


ACCACCTAAA 


TTCTCTAAAT 


CAGCTGAACA 


AGCT ATCACT 


GCTAAAGGTG 


9480 


GTTCAGTAGA 


AGTCATCTAA 


GAGAGGTGAC 


CTATGTTTTT 


TAAATTATTA 


AGAGAAGCTC 


9S40 


TTAAAGTCAA 


GCAGCTTCGA 


TCAAAAATTT 


TATTTACAAT 


TTTTATCGTT 


TTGGTCTTTC 


9600 


GTATCGGAAC 


TACCATTACA 


GTTCCTGCTG 


TGAATCCCAA 


TAGCTTGAAT 


GCTTTAAGTC 


9660 


GATTATCCTT 


CTTAAACATG 


TTGAGCTTGG 


TGTCGGGGAA 


TCCCCTAAAA 


AACTTTTCGA 


9720 


TTTTTGCCCT 


AGCACTTAGT 


CCCTATATCA 


CCGCTTCTAT 


TGTTGTCCAA 


CTCTTGCAAA 


9780 


TCGATATTTT 


ACCCAAGTTT 


GTAGAGTGGG 


GTAAACAAGG 


GGAAGTAGGT 


CGAACAAAAT 


9840 


TGAATCAAGC 


TACTCGTTAT 


ATTGCTCTAG 


TTCTCGCTTT 


TGTGCAATCT 


ATCGGGATTA 


9900 


CAGCTGCTTT 


TAATACCTTG 


GCTGGAGCTC 


AATTGATTAA 


AACTGCTTTA 


ACTCCACAAG 


9960 


TTTTTCTGAC 


GATTGGTATC 


ATCTTAACAG 


CTGGTACTAT 


GATTGTCACT 


TGGTTGGCTG 


10020 


AGCAAATTAC 


AGATAAGGGA 


TACGGAAACG 


GTGTTTCCAT 


CATTATCTTT 


GCCGGGATTG 


10080 


TTT CCTCAAT 


TCCAGAGATG 


ATTCAGGGCA 


TCTATGTGGA 


CTACTTICTG 


AACGTCCCAA 


10140 


GTAGCCGTAT 


CACTTCATCT 


ATCATTTTCG 


TAATCATTTT 


G ATT ATT ACT 


CTATTGTTGA 


102 00 


TTATTTACTT 


TACAACTTAT 


GTTCAACAAG 


CAGAATACAA 


AATTCCAATC 


CAATATACTA 


10260 


AGGTTGCACA 


AGGTGCTCCA 


TCTAGCTCTT 


ACCTTCCGTT 


AAAAGTAAAC 


CCTGCTGGAG 


10320 
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TTATCCCTGT TATCTTTGCC AGTTCGATTA CTGCAGCcTG CCGCTATTCT TCAGTTTTTG 10380 

ACTGCCACAC CTCATGATTG GGCTTGGGTA AGGGTACCAC AAGACATGTT GGCAACTACT 10440 

TCTCCAACTG GTATTGCCAT GTATGCTTTG TTGATTATTC TCTTTACATT CTTCTATACG 10500 

TTTGTACAGA TTAATCCTGA AAAAGCAGCA GAGAJcCCTAC AAAAGAGTGG TGCCTATATC 10560 

CATGGAGTTC GTCCTGGTAA AGGTACAGAA GAATATATGT CTAAACTTCT TCGTCGTCTT 10620 

GCAACTGTTG CTTCCCTCTT CCTTGGTGTG ATTTCCATTT TACCGATTGC ACCTAAAGAT 10680 

GTATTTCGTC TTTCTGATCT TGTTGCCTTT GCTGGAACAA GTCTCTTGAT CATTATCTCT 10740 

ACAGGTATCG AAGGAATCAA GCAATTGGAA CGTTACCTAT TGAAACGTAA GTATCTTGGT 10800 

TTCATCGACA GAACAGAATA AAAGTATTTA CTGAATCACT AAATACTGAC GGAGTGGAGG 10660 

TTTAAACTCT GACATTTGTA AGACTTGGAT CTCCCCTCTT CTATTTTGTT TTTAAATCGG 10920 

GGTCAAAAGA CTTTTTGCTT CTATTTAAAA ATAAAATAAG GACATCAAAT CATGAATCTT 10980 

TTGATTATGG G CTT ACCTGG TCCAGGTAAG GGAACTCAAG CAGCAAAAAT CCTAGAACAA 11040 

TTCCATGTTG CACATATCTC AACAGGTGAT ATGTTCCGCC CTGCAATCGC AAATCAAACT 11100 

GAAATGGCTG TTCTTGCTAA GTCATATATT GACAAGGGTG AATTGGTTCC TCACGAACTT 11160 

ACAAATGGAA TCCTAAAACA ACGCCTTTCA CAACATCATA TTAAAGAAAC AGGATTCTTA 11220 

TTGGATGGTT ACCCACGTAC AATTGAACAA GCTCATGCCT TCCACAAAAC ATTGGCTGAA 11280 

CTTGGCATTG AACTAGAACG TGTTATCAAT ATTGAAGTGA ACCCTGACAG CCTTTTGGAA 11340 

CGTTTGACTG GGCGTATCAT CCACCGCGTA ACTGCACAAA CTTTCCACAA GGTCTTTAAC 114O0 

CCACCAGTTG ACTATAAAGA AGAAGATTAC TACCAACGTG AACATGATAA GCCTGAGACA 11460 

CTAAAACGTC GTTTCCATCT TAATATTGCT CAACGAGAAC CAATCATTCC 7CAC7ACCCT 11520 

CCCAAAGGTT TGGTTCATGA CATCGAAGGT AATCAAGATA TCAATCATCT CTTCTCAGAT 11580 

ATTGAAAAAG TATTGACAAA TTTGAAATAA AGCGTTTTTC ACACTTGCAA AAATCCGCTA 11640 

CAAATGTTAT ACTGAGATAG TCTGACTTAT AATTGTTGTC T CTGTGTCT A GAGGCATCGA 11700 

ATCGAAATTT ATGGAGGTGC TTTTGCGTGG CAAAACACGA TGTGATTGAA GTTGAAGGCA 11760 

AAGTACTTGA TACAATGCCG AATGCAATGT TTACGGTTGA ACTTGAAAAT GGACATCAGA 11820 

TTTTAGCAGG G 11831 
(2) I NFORMATI ON FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CCCGGCATTT GAAACCTATT CGTGAACGAT TTATGATGCC AATGCCTTTG ATTTTAGTCG 60 

GCTCTTTATT TCTTATTCTA ATCAGTTCGC CTCAAGAGGC TTTTACAAAT TGGCTGAATA 120 

GTCTTGCATT GCTAAGTATC TTCACAACTA TGAATCAGTC AACAGTAGCG ATTATCTCCT 180 

TGGTCGCTTG TTTCGGTATT GCCTACAGCT TGTCGGAAGG ATATGCTACA GATGGTCCGT 240 

CGGCAGGGAT CATAGCCTTA TCCAGTTTTG TATTGATGGC ACCTCGTTTT TCGAGTATGG 300 

TTTATGATAA AAATGGGGAG CACGTCAAGC AGTTATTTCC CCGCGCAATA CCATTTTCTA 360 

GCCTGAATGC ATCTTCTTTG TTTATGGCGA TTACTATTGG ATTGGTTACA GCAGACATTT 420 

ATCGTATGTT TATCCAGCGC GGAATTACGA TAAAAATCCC AAGTGGTGTC CCAGATGTAC 480 

TAAGTAAATC ATTTTCAGCT CTTTTATCTC GTTTTACTAC TTTTCTTTTG TGCGCTTTGG 540 

TCTTAAAACG TCTTGAACCG GCAGGAGTTG CACCACGTCT CAACCGACTC CTAGGTGCAA 600 

TTGTTGGAAC ACCCCTTAAG TTAATTGCAG GAACGCTTCC AGGTATGATT CTATGTGTTA 660 

TTGTAAACTC ATTCTTTTGG TTCTGTGGAG TTAATGGGGG ACAAGTTTTA AATGCTTTTG 720 

TAGACCCAGT TTGCTTACAA TTTACTACAG AAAACCAACA AGCTCTCCCT GCAGGACAAA 7 80 

CACTCCAACA CATTATTACA TTACCCTTTA AAGATTTATT TCTATTTATT GCTGGCGCTC 84 0 

GAG CC ACT AT TGGTCTTGCG ATTTGTCTCT TCCTATTTAG TAAGACTCCT GCGAATAAAA 900 

CATTAGGTAA GCTAGCTATT ATACCCTCTA TTTTTAATAT CAATACAGCT ATTCTATTTA 960 

CCTTTCCAAC AGTTTTAAAT CCGATTATGC TGATTCCCTT TATTGCTACT CCTACAATCA 1023 

ATCC CTTG AT TACCTATGTA TCAATCCCTG TAGGATTAGT ACCCTATACA ACAGGTGTAA 1080 

TCCTTCCGTG GACAATGCCA CCCATTATAG GAGGCTTCCT TGCAACAGGG GCTAGTTGGC 1140 

GAGGAGCTCT ATTACAACTT G TTTTGATTT TGCTTTCTGT AGCAATTTAT TATCCATTCT 1200 

TCAAAATTCC AGATAAACGC AATCTTGAAA AAGAAAAAGC TACTGTTGGA GCGAAATAAG 1260 

ATGGTTATCA GAGTATTTGA TC AAC AG AAA AATACTTATT CTAGCTTTGC CTTAGAGGAA 1320 

TTAAGTTACT ATATGAATCG GGTCTTTAAG ACTAACATAG AGCTTGTCGA GGAGAAGGAA 1380 

G CGGAT ATTT TTGTAGGATT AGTCAATAAA GAGGACAGAA AACACCATGT TCTTATCTCA 1440 

TTAGACAAGG CTAAGGGGAG AATTGAGTCT AATACAATTG TAGGTTTACT TATTGGAATT 1500 

TACCGAATGT TTCATGAATT TGGG G TTGTG TATACTAGAC CAGGGCGCAG ACATGACTTT 1560 

GTTCCAGAGT TACGATTTGA AGATTTTTTA G ATAAAC AGC TATCTATAGA TGAAACAGCC 1620 
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ACTTACTATC ATAGGGGAGT ATGTATAGAG GGAGCGGATT CATTTGAAAA TATACTAGAT 1680 

TTCATTGATT GCCTACCTAA GATTGGGATG AACAGTTTTT TCATCCACTT TGAAAATCCT 1740 

TACTCTTTTT TGAAACCTTG GTATCAACAT GAATTTAATC CATATCTAAA TAAAGAACAA 1800 

TTTTCAAATG AATTAGTACA AGAATTGAGT GATAGGTTGG ATAAAGAATT GCAAAAAAGA 1860 

GGTCTTATTC ATCATCGTGT TCGTCATGGA TGGACAGGTG AAGTTTTAGG TTACTCTTCA 1920 

AAATTTGGCT GGGAATCACG TCTTAGTATT TCAGACGAGA AGAAACCCTA TGTCGCTCAA 1980 

ATAAACGCGA AACGAGAATT GTTTAATACG GCTCCGATTT TAACCAGCCT GGATTTTTCA 2040 

AATCCAGATG TAGCTGATAA GATGGTAGAA ATTATCAAGG ATTATGCCAA GAAAAGACCT 2100 

GATGTTAACT ACTTACATGT ATGGTTGTCG GATGCTCGTA ATAATATTTG TGAATGCGAA 2160 

AACTGTAGAC AAGAATTCGT TTCGGATCAG TATATTCGTA TTCTCAATCA ATTGGATAGG 2220 

GCTTTAACGA GTGAGGGATT AGATACAAAG ATTTGTTTTC TGCTTTATCA TGA G TT G TTA 2280 

TCCCCACCTC AGAAAGAAAA ATTAGATAAT CCTGAACCCT TTACCATCAT GTTTGCACCG 2340 

ATTACAAGAA CATTTGAAAT GAGTTATGCA GATGTAGATT TTGACAATTC CATACCTACC 2400 

CCTAAACCTT ATATCCCTAA TAAAATTATA CTTCCGAATT CTCTTGAGCA AAATTTATCT 24 60 

TATCTTTTTG AGTGGCAAAA AGC ATTT AAA GCACATAGTT TCCTATATGA CTATCCTTTA 2 520 

GGGCCTGCTC ATTATGGCCA TTTAGCCTAT ATGAAAATTA GTCAAACTAT TT AC AG AGAT 2580 

GTATCTTATC TTTCCAACCT ACATTTGAAC GGCTACATTT CCTGTCAAGA ATTACGTGCC 2640 

GGATTCCCTC AT AATTTTC C TAATTATGTC ATGGGGGAAA TGCTCT G GAA GAAGACAAGA 2700 

AGTTATGAAG AATTCATTGA AGAATACTTT TCTGCTTTGT ATGCGGAAAA TTGGCAGTCT 2760 

GTTGTTGAAT ATTTAGAAAA ATTATCCATT TATTCCTCTT GTGATTATTT TAATGCAATT 2820 

GGCAGCCGTC AAAGTGATGT TTTAGCGAAT CATTATTATA TACCTTACAA TCTAGCTGAT 2880 

AATTTTTTAC CAATTATTGA GGAAAATATT TCTAAGTTAT TAAATAGTCA AAAGGATGAA 2940 

TGGAAACAGC TCAGTTATCA TCGTGAATAT GTTGTTAAGA TGGCGAAGGC TTTATATCTT 3000 

CAACCAACTG CAAAAACAAC GCAAGCTCAA CATGAATCCA GAAATCTCTT CAATTATATC 3060 

CGTGGGCACG AATTGCTATT TCAATCTAAT TTGGATCTTT ATCGTGTAAT TGAAGTAGCA 3120 

AAAAATTACG CTG G TTTCCA CTTATAAATC ATAAGTATAG AAAATGAACT AAGGTATTCA 3180 

GAGAAGATTG ATCCTAAATA TTATGAAATT TAAGCATTTT TAAGATATTT AGGGTCAACT 3240 

TTCTATTTAT ATCGTAGCGA AGTCATTTTA ATAATGATCT GTAAAAGATG GATCAAGATT 3300 

GAGGAGGAAG AAAGATGAAA TCAAAAGAAG AAATAAATAT GCTTGGTTTT ACAATTGTCG 3 360 
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CTTACGCAGG 


AGATGCAAGG 


TCAGATTTGA 


TGCATCCTTT 


GGCGTTTGCG 


AGAGATGGAT 


3420 


ATTTTGAACA 


GGCAAGAGAA 


TTGGTTGAGT 


CTGCAAACGA 


CTCAATAGTG 


TCTGCCCATC 


3460 


GAGAACAGAC 


TAATTTATTA 


GCGGAGGAGG 


CATATGGAGA 


TAATTTTGAA 


GTGAGCTTTA 


3S40 


TTATGATTCA 


TGGTCAACAT 


ACTTTGATGA 


CAACGATGCT 


ATTGTATGAT 


CAGGTAAAGT 


3600 


TTTTTATTGA 


TGAATATGAA 


CGAATTCGAA 


AGATTGAAGA 


ACRTATTGGT 


TTGCAATGAG 


3660 


G ATT ACT CAT 


GGAAAATTTA 


CAGGTTAAAG 


CCTTACCGAA 


GGAGTTTTTA 


TTAGGAACTG 


3720 


CTACCGCTGC 


TTATCAAGTA 


GACGGTGCAA 


CTAGGGTAGA 


TGGCAAAGGA 


ATAAATATGT 


3780 


GGGATCTTTA 


TTTGCAACAA 


AATAGTCCGT 


TCTTACCAGA 


TCCAGCTAGT 


GATTTTTATT 


3840 


ATCGTTACGA 


AGAGGATATA 


GCTTTGGCGG 


CAGAACATGG 


TTTGCAGGCT 


TTGCGTTTAT 


3900 


CTATTTCTTG 


GGTTCGTATA 


TTTCCTCATA 


TAGATGGGGA 


TGCTAATCTA 


TTAGCTGTTC 


3960 


ATTATTACCA 


TAGACTTTTT 


CAGTCTTGCT 


TAAAACATAA 


TGTGATTCCC 


TTTGTTTCTT 


4020 


TACATCATTT 


TGATTCGCCT 


CAGAAAATGT 


TAGAAACAGG 


GCATTGGTTG 


AACAGAGAGA 


4080 


ATATTGATCG 


TTTCATACGA 


TATGCTCGCT 


TTTCTTTCCA 


AGAATTTACA 


GAAGTCAAGC 


4140 


ATTGGTTTAC 


AATCAATGAA 


CTGATGTCTC 


TTGCTGCAGG 


TCAATATATA 


GGAGGTCAGT 


4200 


TTCCTCCAAA 


TCATCATTTT 


CAATTATCTG 


AAGC AATTC A 


AGCGAATCAT 


AATATGTTGT 


4260 


TGGCGCATGC 


TCTTGCAGTC 


CTCGAATTTC 


ATCAATTAGG 


GATTGAGGGA 


AAGGT AGGTT 


4320 


GTATTCATGC 


TTTAAAGCCA 


GGCTATCCTA 


TTGATGGGCA 


AAAAGAAAAT 


ATTTTGGCAG 


4380 


CTAAACGGTA 


TGATGTTTAT 


AATAATAAAT 


TTCTATTAGA 


TGGAACTTTT 


TTGGGCTACT 


4440 


ACAGTGAGGA 


CACGCTTTTT 


CACTTGAATC 


AAATATTGGA 


AGCTAATAAT 


TCTAGCTTTA 


4500 


TTATTGAAGA 


TGGTGATTTA 


GAAATTATGA 


AGAGACCTCC 


ACCTCTTAAT 


ACGATGTTTG 


4560 


CCATCAATTA 


TTATCCrrCA 


GAATTTATTC 


GTGAATACAA 


AGGTGAAAAT 


ACACAAGAAT 


4620 


TTAATTCAAC 


AGGAATAAAA 


GGACAGTCTT 


CTTTTAAATT 


AAATGCTCTA 


GGTGAATTTG 


4680 


TAAAAAAACC 


TGGTATTCCG 


ACAACAGATT 


GGGATTGGAA 


TATTTATCCT 


CAAGGCTTAT 


4740 


TTGATATGTT 


GCTTCGTATC 


AAAGAAGAAT 


ATCCTCAACA 


TCCGGTCATT 


TATTTAACTG 


4800 


AAAATGGTAC 


AGCCCTTAAA 


GAAGTTAAGC 


CAGAGGGCGA 


GAATGATATT 


ATTGATGACA 


4860 


GTAAGAGAAT 


CCGTTATATT 


GACCAACATT 


TACACAAAGT 


TTTAGAGGCT 


CGAGATAGAG 


4920 


GAGTCAATAT 


TCAAGGCTAT 


TTTATATGGT 


CTTTGCAAGA 


TCAATTTTCT 


TGGGCGAATG 


4980 


CCTACAATAA 


GCGATATGGT 


CTTTTCTTTG 


TTGATTATGA 


AACACAGAAG 


AGATATATTA 


5040 


AGAAAAGTGC 


TCTTTGGGTA 


AAAGGGCTAA 


AACGGAATTA 


AGGTTAGCGA 


TTTGACTGAT 


5100 


GTTTAATATG 


TTTTAAATAT 


GAGGTTGAAT 


TTTTTATAGG 


AGGAGTTTTA 


TGGATAAGCT 


5160 



o»ierwwn. jwn 0010001 ao i ^ 



WO 98/18931 



PCTAJS97/19588 



S49 



4 





All uiwMAuL 


AA^AAW*l»AA 


Al l luAAAAA 


f\ I 1 4^1 AL 4 #\ 


ATA.AfTA.TAT 


5220 


OA iuUU Inl i 


A AAflATC^ AT 


I 1 1 AL 


1 A 1VCU I I 1 A 


f\ 1 k r\ i w 1 A 4 4 


L^VIVJW 4 ■» * * * 


5280 


t» A I Aj A I I A I I 


Ail AIuAI l(» 


CTAAAAATTT 


(•wOAu 4 AUAw 


4 1 nbWvAu 4 


L/iOv 4 A k V 4 


5340 




AAAGTGTATA 


Tvjuaaccat 


GGGAGTTTTG 


GGTATTAT/TC 




3VUU 


TGTTGGAAAG 


TCATTAGTTG 


GAAATGTTAA 


CACAAAAATC 


V. Li\>ALuuAn 


AVjO 1 AAs 4 AAA 


340U 


TGATATTTCT 


GCAATGTTGG 


CAtrCCATATG 


T AGTT AT CTG 


GTATTAAU i\j 


TAACGCTTGT 




AGTTGATGAG 


AAGACGGGAT 


CTACAAGTTT 


GTCGACAAAC 


TATTTAGGAT 


CTCAAGGATT 


5580 


GATAACTTCG 


TTTGTCAGTG 


CCTTTATTAC 


TGTAAATGTT 


TACCGATTCT 


GTATTAAGCu 




AGACATTACT 


ATTCATTTAC 


CTAAGGAAGT 


TCCTGGGGCT 


■l m m ifif* ft 14 ft ^» 

ATATCACAAG 


CTTTTAGAGA 


5700 


TATTTTCCCT 


TTTTCTTTTG 


TTTTACTTAT 


TAGTGGTTTG 


TTAGATATTG 


TATCTCGGTT 


5760 


TAGTTTAGAT 


GTTCCTTTTG 


CCCAACTATT 


TCAACAACTA 


TTGACTCCTA 


TTTTTAAGGG 


5820 


GGCAGAATCA 


TATCCTGCTA 


TGATGTTGAT 


TTGGTTTATG 


TGTGCTTTCC 


TTTGGTTTGT 


5880 


TGGAATTCAT 


GGACCATCTA 


TTGTCTTACC 


TGCTCTTACA 


GCTTTGCAAC 


TGAGCAATAT 


5940 


GGAAGAGAAT 


GCTCAACTTC 


TTGCAAATGC 


GCAGTTCCCT 


TATCATTCTT 


TAACACCTAA 


€000 


TTTCCCGAAT 


TATATCGCTC 


CTATTGGAGC 


AACGGGGGCT 


ACCTTTCTTC 


ft* Ik ft twrvr ft #v* 

TACCATTTAT 


6060 


TTTGATTTTC 


TTTATGCGGT 


CTAAACAATT 


AAAATCGGTA 


GGTAAAGCTA 


CAATTACTCC 


6120 


TGTTTTATTT 


GCGGTAAATG 


AACCTCTTCT 


ATTTGGTATG 


CCTGTTATTT 


TGAATCtCTA 


6 180 


TCTTTTTGTC 


cc'i r'nncA 


TGACTCCACC 


ft M • ft ft ft 

AGTGAATGTA 


1 ITL1 AGGAA 


AGGTCTTTAT 


OJrt 
Di4U 


TGATTTCTTT 


GGAATGAATG 


GATTTTATAT 


**** m |i iiiiii flk ^»T< 

CCAGTTACCT 


TGGACCTTTC 


CTGGTCCCTT 


OJvU 


GGGATTGTTA 


ATTGGAACGA 


ATTTTCAACT 


TATCTCCTTT 


GTATTTTTAT 


CTTTGATTTT 


a i An 

D JOU 


AGTTGTCGAC 


ATATTGATTT 


ATTTGCCATT 


trr\»TAt»Av-»UVrf 


TATwATAGAC 


AGTTACTGGT 


HA? ft 


GAAACAAGAT 


ATTCCAAGCT 


CAAATGATAT 


TATTTTAGAC 


GAGGATACAA 


GTGAAATAAT 


6460 


TC CTGCTG AG 


ATAGATGAAA 


TAAAAAGTAA 


GGAGTTGAAA 


GTACTGGTTC 


TTTGTGCAGG 


6540 


GTCTGGAACA 


ACTGCGCAAT 


TAGCCAATCC 


AATTAACCAG 


GGGGCTAACT 


TAACACAGGT 


6600 


TAGAGTGATT 


GCGAATTCAG 


GAGCCTACGG 


AGCTCATTAT 


GATATTATGG 


GTGTTTATGA 


6660 


TTTAATTATT 


CTGGCCCCAC 


AAGTTCGGAG 


TTATTATAGA 


GAGATGAAGG 


TGGATGCAGA 


6720 


AAGATTAGGT 


ATTCACATAG 


TTGCTACCAG 


AGGAATGGAA 


TATATTCATT 


TAACAAAGAG 


6780 


TCCAAGTAAA 


GCCTTACAAT 


TTGTATTGGA 


GC ATT ACC AA 


GCTGTGTAGT 


AAGTTTTTCC 


6840 


ATCTTTTATT 


TGACTAAAGA 


TTTTCTTTAC 


AGATAGGCTT 


GGATTTAAAA 


ACCTTCCCCC 


6900 
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TTTTTTAATA 


TAAGAATCCC 


TCTTTCACAA 


TTCTAAAAAG 


AGGGATTTTC 


TATTTTATCT 


6960 


CTTAGACCAA 


GTTCTCTTCA 


TAAAGAGAAG 


GAGGATTGGG 


TAAATCTCCA 


AGCGCCCTGC 


7020 


AATCATTGCA 


AAGGATAGGA 


GAATTTTTGA 


GATGGGACTA 


AAGATTGAGA 


AACTAGAAGT 


7080 


GGTTCCTAGA 


ATAGGCCCGA 


TATTATTGAA 


ACACCTAAAG 


ACAGCGCTGG 


TCACGACCAG 


7140 


AAAATCATTG 


CTATCTAGGC 


TGACAATAAA 


GATAACCGCT 


AGCAAAATCA 


TAGCATAGAT 


7200 


GACAAAGTAC 


TTGAGAATCT 


TATGCTCGGT 


ATCTTTGTCA 


ATCACCGTTT 


TATTAACATG 


7260 


GAGGGTCAAA 


ACACGGTGGG 


GCCATAGGAT 


TGACAAAATT 


TGGTTTTTGG 


CAATTTTTGA 


7320 


AAGGATGAGG 


CCTCGAATAA 


TCTTGAGTCC 


ACCTGCAGTT 


GATCCAGCAG 


AGCCACCGAT 


7380 


TGCCATGAGG 


AAAAGGAGGA 


TAAACTGCGA 


GAAGAGGGGC 


CACTTGGTAA 


TATCTCCATA 


7440 


TCCAAAACCA 


GTTCTTGTAA 


TGATGTTGGA 


AACCTGGAAG 


AAGCTCATTT 


CAAAGCTCTT 


7500 


TGAAAACCCT 


GGGTAGAGGT 


AGAGGGTGTT 


GAGGCTAATC 


AAGCCTGTAG 


AAACCAGTAC 


7560 


AATGACCAAG 


TAACCCCTAA 


GCTCTTCATC 


TCCAAAGAAG 


GCCTTGATGC 


GACCGAGCAT 


7620 


CAGGTAGTAG 


TAGAGGTTGA 


AATTTACTCC 


AAAAACCAGA 


ACTCCGATAC 


TGACCAGATA 


7680 


GGTAATCAGT 


CAGCTCCCAT 


AGTGGGCAAT 


TCCCTCGTTA 


TAGACGGTAA 


AGCCTCCAGT 


7740 


TCCCGCTGTC 


CCCATAGCAA 


TAACAAAACT 


ATCGTAGAGA 


GGCATACCCG 


CTAGATAATA 


7800 


GATGATGACA 


AAGAGGGAGA 


AGAGAGCTAG 


ATAAAGGAGA 


TAGAGAATCT 


GCCCAGTGTT 


7860 


TTTTAGTTTG 


GATACAACCT 


TGCCAAAAAC 


ACGACCTCCA 


ACCTCACCCT 


TCATCACCTC 


7920 


TAGGTGGCTA 


TTTTTGGCAT 


TCTCCATAAT 


AGCAAGTGCA 


AAAACAAGCA 


CTCCCATCCC 


7980 


TCCAATCAAG 


TGGGTAAAAC 


TTCGCCAGAA 


GAGGAGGGAA 


CGCCTGAGAA 


CCGAAACGTC 


8040 


CTTCAAAATA 


CTTCCTCCAG 


TAGTTGTAAA 


TCC AG AACT A 


ATTTCAAAAA 


AGGCATCAAT 


8100 


AAGGCTGGCG 


ATTTGCCCAG 


AAAAGACAAA 


GCGCAGACCA 


CCAAACAAAG 


ACCAAAGGAT 


8160 


CCAACAGAGC 


GCAACGATCA 


AGACTCCCTC 


CTTGCCATAA 


ATCCGTTGAT 


TTTTTCGCTT 


8220 


CTGTAAACTC 


CCTGAACCCC 


CTAACAATAC 


GAGAATCCCT 


ATGGTCGAAA 


AGAGGGCTGT 


8280 


AAAGACTTGC 


CTCGATTCAC 


GGTAATAGAC 


AGCAATCGCA 


ACAGGAACCA 


AAAGAAGAAC 


8340 


AGCTTCAATC 


AAAAGTAATT 


TTGAAAGGAG 


GTAACGAATC 


ATACTTTTAT 


TCATTTCTTA 


8400 


CCTCGCCATC 


AAGTCATAAA 


TCTTGGTGAT 


GTTTGGCAAC 


AACCTTCTTA 


CTAGCAGCTT 


8460 


GTCTCCAACT 


TCCAACATAT 


CCTCCCCAGT 


TGCGAAAATA 


CTCTTGCCCT 


TTCGAATAAT 


8520 


CGCTGCAATA 


AGAACCCCTT 


TTTTCAATTT 


CAGTTGAGAA 


AGAGCTTTGG 


CAGTCATTTT 


8580 


ATTGGCTTCC 


TTGATATGGA 


ATTGC AGGGT 


TTCGATTTGC 


CCATTGGCTA 


GATGGTGCAT 


8640 


AGCTTGAAGG 


TCTCAATACT 


GGGCATTAAC 


TCGACCACGA 


ATAAAGTGCA 


TAATCCTATC 


8700 



WO 98/18931 



PCTAJS97/19588 



551 



T ACACCGATG 


CTTTTAGGTG 


TCATCATACT 


TGAAAAATCA 


GGCGCATTGA 


TAATCTCGAG 


8760 




CGATTCACCT 


TAGTAATATT 


TTTCTGTACA 


CCTACCCTCT 


CAAGGAACAT 


8820 


AGATGTAATC 


AGATTTTC CT 


CATCGACTCC 


TGTTAGAGTC 


G CAACGGC AT 


CATAGTGTTG 


8860 




TCCAGCAGGA 


TATCTTTTGC 


GGTTCCATCT 


CCTTCAACGA 


TGTAGAGATT 


8940 


iuVlunA i 1 IV. 


TCGCTAAACA 


AGCTGGCGAT 


TTCAGGATTG 


ATTTCAATGA 


CTTTTGTATC 


9000 


\»t\ 1 ALunU 1 A 


TCTTTCAGAA 


T AC CAAGT AG 


ATAATAGGCA 


ATTCTACCTG 


CCCCAACGAT 


9060 


/— » f» ft i/T/TTT 
GAGAAGGtTl A. 




GTCATTTAAA 


ATAATTATGG 


AAGAGTATCA 


TATCGACACC 


9120 


oil nLLAu 1 v» 


AfAJLAGATTC 


A A W A * * •* * 


CTGTACAGTC 


ATGTCACCGC 


TTGGAATGAT 


9180 


AA i l 1GATUA 


TyrrrrTPTA. 

lULV. 1 Vb 1 W * f% 


TCGCACAGAC 


AATGACATTA 


CCAAATTTTT 


TACGAAAATC 


9240 


AGAAATGGGG 


Alii 


GACCCCTGGT 


GGACTTGACG 


ACAAATTCCA 


TGAGGCTAAC 


9300 


GCGTCCAtCA 


AAAlav. (•» 1 A 


PGACAGACAG 


GGCGTTGGGG 


AAGTCAATGA 


TATTCGCGAT 


9360 


ACCCCGGCCA 




f*Af^fi ATTAAC 


GATAAGAGAA 


AAACCGAGAA 


TATTCTTTTC 


9420 


CTTGAAATAA 




Ail Unuw * i 


CCGCACCCGA 


ACCATAGTTT 


CTTTAGCTCC 


9480 


CATTTTCTTG 




CTGCAATCAT 


GTTGACTTCA 


TCGTGCTCAG 


TCAGGGCGAT 


9540 


AAA G AT AT L. A 




CGCTGGCTTG 


CTCAAGAATG 


GCAAAATCGC 


CCCCCTTACC 


9600 


*k *>rt>M 

AAGGATAI.CA 




AGCGACTGAC 


AATATGATTG 


AGAACAGCTT 


CGTCTTGCTC 


9660 


AATCAIjCAAA 




TTTCTGCAAC 


CAAGGACCGA 


CAGAGGGCAA 


AACCAACTTT 


9720 


TCCCCCTCLvi 


Ar , &A<V!ATAA 
Ab> AAwwA 1 SW\ 


TTTTCATAAT 


AAAACCTACT 


TTTTCATGAT 


GTAACTATCA 


9780 


TACCCTTTTT 


rAAnAAAAAA 


TGCACCTACT 


AGCTAATAAC 


AAGAGTTTTT 


AGTGAAAATT 


9840 




TAAAACTATA 


CCCTAACCAA 


TTGAAATAGC 


TATTAGCGAC 


TTTCTCTGAA 


9900 




ATAAAGGATA 


TACAAGGAGA 


TAAAATGAAT 


AATAATTTAC 


TGGTATTACA 


9960 


ATCACACTTT 


GGTCTGGTTG 


ATGUluCUUT 




ATTGGAGTGG 


CTTT AG AAG A 


10020 


gtctccaacc 


TTAAAAATAC 


ATCACTTGAC 


GCACGATATC 


ACGCCTTATA 


ATATTTTTGA 


10080 


GGGCAGCTAT 


CGTCTCTTTC 


AGACGGTGGA 


TTACTGGCCT 


GAGGGAACGA 


CGTTTGTATC 


10140 


GGTTGTCGAT 


CCAGCTGTCG 


GTTCGAAACG 


TAAGAGTGTA 


GTTGCCAAGA 


CTGCAAAAAA 


10200 


TCAATACATT 


GTCACGCCAG 


ATAATGGGAC 


GCTTTCCTTT 


ATCAAGAAAC 


ACGTTGGCAT 


10260 


TGTAGCCATT 


CGTGAGATTT 


CTGAGGTGGC 


CAATAGGCGT 


CAAAACACAG 


AGCATTCTTA 


10320 


TACCTTCCAC 


GGTCCTGATG 


TCTATGCCTA 


TACTGGTCCT 


AAACTGGCCA 


GTGGTCACAT 


10380 


TACTTTTGAG 


GAAGTAGGGC 


CAGAGCTCAG 


TCTGGAACAC 


ATTGTAGAGC 


TTCCAGTCGT 


10440 
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AGCGACCATC ATAGAAGATC ATCTGGTGAA GGGAGCCATT GATATTCTCG ATGTGCGTTT 
CGGTTCGCTT TGGACCTCTA TCACACGGGA AGAATTTTAC AAGCTGGAAC CAGAATTTGG 
TGATC G TTTT GAAGTGACCA TCTATCATGC TCATATGCTG GTCTATCAAA ATCAGGTTGT 
CTATGGCAAA TCATTTGCAG ATGTGAGAAT TGGCCAACCs ATcTTTACrc TCAGCaTCTt 
CGATTAGCTG GGCAATTCGT TCTAGTTGGA TTTCGTCAAT CAAGGT 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7163 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 
(O) TOPOLOGY: linear 



10500 
10560 
10620 
10680 
10726 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTATCTTTAA CGATATCAAT CAAGATCTGG TCAATAAAGG GATTGGGGCT TATCCTGAAG 60 

TTGGCATCCA AGCCCATCGA TATCTCTGTG ACGTGACAGA CGAGGACGGT ATCCAACCCA 120 

TCGTCAAGCA AATCGAACAA GAGGTTCGTC TCATTGACAT CCTCGTTAAT AACGCTGGTA 180 

TTATCCGCCG AGTTCCAATG TGCGAAATGA GCGCCCCTGA TTTCCCTAAG CTCATCCATA 240 

TTGACTTAAA CGCACCATTT ATCCTTTCAA AGGCAGTT AT TCCTTCTATG ATAAAGAAAG 300 

GGCATGGAAA GATTATCAAT ATTTGTTCGA TGATGAGCCA ACTGCGACGT CAAACAGTTA 360 

GCGCTTATGC TGCTGCTAAA GGGGGCTTGA AAATGTTCAC CCGCAACATT GCCTCTGAAT 420 

ACCGTGCAGC CAATATCCAA TGTAACGGAA TTGGACCGGG TTATATTGCC ACTCCTCAAA 480 

CAGCACCTCT TCGTGAATTG CAAGAAGATG CTTCTCGCCA CCCATTTGAC CAGTTCATCA 540 

TTGCAAAAAC AC CTG CTGCA CGTTGGGGAA ATACTCAAGA TTTGATGCGC CCTCCTGTCT 600 

TTCTCGCTAG TGATGCCAGC AATTTTGTCA ATGCCCACAT CCTATATGTA G ATGGCGGT A 660 

TCTTAGCCTA CATCGGAAAA CAACCTGAGT AAAAATAGAA AGAAGATCTT ATGAAAATCG 720 

CATTAATCAA TGAAAATAGT CAAGCTAGCA AGAATCACAT TATTTACCAT AGTCTAAAAG 780 

AAGCGACAGA TAAAAAAGGC TACCAATTAT TTAACTATGG TATGCGTGGA GAAGAAGGAG 840 

AAAGTCAATT AACTTATGTG CAGAACGGAC TAATGCCTCC CATCCTTTTA AATACAAAGG 900 

CAGTTGACTT TGTTGTTACC GGCTGTGGTA CGGGTGTAGG GCCTATGCTT CCTTTAAACA 960 

CCTTCCCTGG TGTTGTCTGT GGTCT AGC AG TGGACCCAAC TGACGCTTAC CTTTATTCTC 1020 

AAATCAATGG TGGTAACGCC TTGTCTATCC CTTATGCCAA AGGATTTGGC TGGGGGGCAG 1080 

AACTGACCCT CAAATTGATG TTTGAACGCT TATTTGCTGA AGAAATGGGC GGTGGCTACC 1140 
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SS3 



CAAGAGAACG 


«^/'w|> ft ft *tf 

TGTAATlCCT 


CAACAACGCA 


ACGCTCGTAT 






1200 


IK ^"^»^ft k 

TC ACC C AC AA 


TGAn i CATC 


ACCATCCTTA 


AAATAATCGA 


r*r* a. if* & f*r^f* 

l i t. 


r»*r f a a Afl Atf* A 
C 1 C4vv*u#ftd 


1260 


CCATCTCTGG 


^ ft ft ft <^ ft / HIM 1^** 

CAAATAcrrrc 


CAAGAATACT 


TCTTTGAAAA 


V. 1 VK_Ovvk*/\ I 


r* A1Y5 AACTTG 


1320 


CTGCTTATTT 


GAAAGAAGTA 


TTAGCCAAGT 


AAAGCTATTC 


TAAAtuAGAA 




1380 


GGATGACGAA 


AATATTACTG 


TTTGCCGAAC 


CATTAATTCG 


AAillCACCA 


*r*r» & f* & irrr a 


x *t v u 


CCAGTATCGG 


CGATCATCTT 


GCCAGTTCGA CTTATTTTGG 


f*^^ ft«T>^ft.^ It ft 

CGGATCAGAA 


ATTAAUA* s.v» 


13 Uw 


CTTGTAATTT 


CCAAGCCCTG 




ffl A A ARTPTT 


TACCGCACTC 


CCTGCCAACG 




AGATTGGAGA 


TCGTTTTCTC 


ACATTCTTGA 


A IkC AfSC AC£* A 
AAUlviv /\\_ 


AAT CGATACC 


AGTTCAATCT 


1620 


CTCCGCTTGG 


CGATCGAATC 






CGGCTTTGGT 


iix"m^^>n/' ft. ft ft 

TGTCGTCAAA 


1680 


GTGAAGTTTT 


CTACGATCGT 




fiTA*Tf*AC£ , CA 


GATTOGCCCA 


AACATGCTAG 


1740 


ATATGGATTC 


TCTCTTTCAG 




ATTTTCATTT 


TAGTGGAATC 


ft _ ,j u ».m ft AMtbA 

ACCGTAGCTA 


1800 


TCGGTCAAGA 


GGTCCCTGCG 


ATCCTTCTC C 


TAfTP TTftfl A 

X AC tLJ 1 VjklA 


AGAACCCAAG 


CGCCGAGGAA 


1860 


TTGTCGTTTC 


AATCGATCTC 


AATfTCACJUV 


CAAACATCAT 


TTCAGTCCTA 


GAAGCCAAGT 


1920 


ATGAATTTTC 


TAAGTTTGCA 


rr TTTTACTG 

V— w 1 1 1 1 1 w 


ACT ATT G CTT 


CGGTATTGAT 


CCTCTCATGA 


1980 


TTGATGACCA 


AAATCTAGAG 


ATCTTTCCAA 


GACACACTGC 


TAGCCTAGAA 


GAGGTGGAAA 


2040 


ATCCCATGCC 


ACTTTTAAAA 


GAAGCCTATG 


GTTTCAAGGC 


CATTTTCCAT 


ACCCTCCGCT 


2100 


CTAGTGATGA 


GCAAGACAAA 


AATGTCTATC 


AAGCCTATGC 


TCTAGAAGAA 


ft «MW||A ft ft ^ft 

CTATTTGAAG 


2160 


AGTCTGTCCA 


ACTAAAAACT 


GCAGTCTATC 


AACGAATTGC 


TAGCGGGGAT 


CCCTTTATAT 


2220 


CTCGTGCCCT 


TTACCAACTA 


CTCCATCATT 


CCTCCCTAAA 


Jfc R ft ft fM*n 

AACTACCATT 


GACTTTGCAG 


22 BO 


TTGCGAGCGC 


flft fk #Mfl/MA ft ft ft 

AACTCTCAAA 


TGCACTCTTC 


CAGGAGACCA 


TCTCTCCACT 


TCV. IVAAL 1 A 


^ J4U 


^*^V* ft fV»Mft^* ft * ■ 

GTATTGAAAA 


TTTACTGCCA 


AATCCACAAG 


ATATCATTCC 




•P ft MV»fti"/'> ft 


^4UU 


AATCAGATAC 


GATTATTGAA 


CTAAAAAAAC 


AAAAAATTGT 


CCCTGTTATT 


CGAGGAAATA 


2460 


CAAAGGAAGA 


AGGACTACAA 


GCCTCCATTG 


CTTGTATCAA 


GCGCGGTATC 


AAAGCTATTG 


2520 


AAATCGCCTA 


TACCAATCAG 


TATGCAGGAC 


AAATCATCAA 


GGAACTTCTA 


CACTTCTATC 


2580 


AGGACGATCA 


GAGTGTTTGT 


ATCCCTGCAG 


CTACTGTGCT 


TGATGCCCTA 


ACTGCTAGAG 


2640 


ATGCCATTCT 


AGCTGGAGCA 


AATTACGTTG 


TTTCTCCATC 


TTTCCATGCT 


GAAACTGCGA 


2700 


AAATGTGCAA 


TCTCTACAGC 


ACACCGTACA 


TTCCAGGCTG 


TATTACCCTC 


ACAGAGATCA 


2760 


CGACTGCACT 


TGAAGCCGGT 


ACTGAAATCA 


TCAAACTCTT 


CCCAGGTAGT 


ACTCTCAGTC 


2820 


CAGCATATAT 


CTCTGCAGTC 


AAGGCACCGA 


TCCCACAAGT 


TTCCGTAATG 


CTAACCGGAC 


2880 
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GAGTCGCCCT 


AAACAACATC 


CCTCAATGGT 


TCGCTGCTGG 


TGCAGATGCC 


GTTGGAATTG 


2940 


GTCGCGAACT 


CAATAAACTC 


CCTTCCCAAG 


GCAACTTTGA 


CCGCATCAGC 


GAGATTGCCC 


3000 


AACAGTATAT 


TACACTCAGA 


TAAAATCATA 


ACTACCCGTC 


TAACCGGTGG 


TTTATCTCAG 


3060 


ACCTATAAGC 


CCAAATCATC 


AGCCAGCGCC 


TAAAGACGCT 


GGCTTTCACG 


TTGTTCAAGC 


3120 


CTTATTCCTC 


TTGACTCGTC 


ACTTGCCTCT 


TTAAGAGACT 


TTCGTATTAC 


TTACCACTAT 


3180 


CCCTAAAGGG 


ATCCTCATAT 


TCTTTTACAC 


TCAATTTATC 


TAGTGCTATA 


GTAGATTCAA 


3240 


ACTGGAATAG 


TACACCTCTG 


CTTCTAAAAC 


ATTGTTAAAA 


ATCGATTTGA 


CTGTCCTGAT 


3300 


CGATTTTGTC 


CTGTTCTTAT 


TTCATTTTAC 


TATATATCAT 


ACTTTACTCG 


TTCTCAAATT 


3360 


TTCATACTCA 


TGAAGAAATC 


ATCCACTCGA 


TAATTTCTTT 


AATCTTGACT 


ATATTTCTTA 


3420 


ATTGTGGCTT 


CATTAACCCC 


TACTGGACTT 


ACATAATAAC 


CTTCCTCCCA 


GAAATGCCGA 


3480 


TTCCCAAACT 


TGTACTTGAG 


ATTGGCGTGT 


TTGTCAAACA 


TCATGAGTGC 


ACTTTTGCCT 


3S40 


TTTAAATACC 


CCATAAAACT 


TGAAACACTT 


AGCCTCGACG 


GAATACTGAC 


TAACATGTGT 


3600 


ACATGGTCTG 


GCATTAAGTG 


ACCCTCGATC 


ATTTCAACAC 


CTTTATAACT 


ACACAACCGA 


3660 


TGAAATATTT 


CGTCTAAACT 


ACTTCTATAT 


TGATTATAGA 


TGACTTTTCC 


TCTATACTTA 


3720 


GGGGTGAACA 


CAATATGATA 


CAACACCTCC 


ACTTTGTGTA 


TGATAAACTA 


TCAGTCTTTT 


3780 


CTGCCATATT 


TTTTCTCCTT 


TCGCTTTACA 


ATTGGATTGA 


ACACCTTTAT 


TCTATCCCGT 


3840 


TTGGAGTTTT 


TTTGGTATAA 


CCTTCGACGC 


GCACCCCTAT 


AGCGGCTGGT 


TCTTTTGTCT 


3900 


CGCACCTCAC 


GGAGCGAGAC 


GGACTAATAT 


ACTGGAGTCA 


AATAGGATAC 


GAACAAATTG 


3960 


ATTAGGAAAA 


TCAAATGAAT 


TTATAGAAAT 


CTTTTAGCAG 


TTATAACGTT 


CTATTCTAGT 


4020 


TTCAAAACGC 


TATACTCACA 


TAATAATCAA 


CTAAAAAAGG 


ATAAGTATCA 


ACTTATCCTT 


4080 


TTTTAAAAGA 


AAAATCCGAA 


GATATTTGCC 


CTTCTTCGGA 






414C 


TTCATCTAAT 


TCATCTAGAT 


GATGAACAAA 


TTAGTTGTTC 


TTTCCTCTAC 


GGAATAGATA 


4200 


AAATGCCCCA 


ACTAGCAAGA 


ACCCTAGACT 


TGCCAAGATT 


GACTGACCTT 


CTCCTGTCTG 


4260 


AGGCAGATTC 


TTTTGATCCG 


AATGGTTCTT 


TTCCTCTTCA 


GATTTTTCCT 


TTTC TTTTGA 


4320 


ATTCTGTACT 


TGTGGCTGAG 


CTGCTTCCTC 


TAGCTTTTTA 


AAGACTTCCT 


GATCTGGAGC 


4380 


TGATTCCTGG 


GTTTCAGGAT 


TATAGTAGGC 


AATCTTATAT 


TCATCCCCTT 


CTTTTCGAAT 


4440 


GGTATAGACT 


CCACGTTTCA 


AAACTTGGAA 


TTGGTTGGAA 


ATAGTAGAGA 


CAGAATCATC 


4500 


ATATTTCACA 


ATGCCCCAAA 


CTCCTTGTTT 


AGCATCATAA 


ACAGACTGAA 


GGGTTTCGTT 


4560 


ATTTTCGATG 


AGGCTACTTT 


CTAACTCTTT 


TATCATTTGA 


TTGAAGGTGG 


CACGATCCAC 


4620 


GTTAGGAATG 


AGCATATAGC 


CATAAGAATC 


TCTATTTTGC 


TTATGAGCCT 


CACTAATCGT 


4680 
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AAGAAATTCA TTTTCAACTT CCTTGTCTGA CTCTCCTTCA TTGATATCCT TCCAGGCTCC 4740 

CTTTTGCAAA GCCTTACTCA TACTGATTCA ACTCTTCTTA AACAAAAAGT AACCAATATT 4 800 

CTTTTTCCAA TCGAACGATT CTAAAAAGAC ACTTTGGGTT TCAGGATAAT CCTTTTCTTG 4860 

TTCTCTAAGG GAGGCTTCTT TATCATTGAC ATAGACTTTA TATGGATTAC CTGATTCCAC 4920 

TTTTCTCTGG TCAATTCTAG TTGCAGCAGT ATCTGTTGAA GTGTTTTGGA TATTGCTTCC 4 980 

TAAAAAGGCG ATCTTATCCT TTAGCATAAA CCACCTCTTA TGACCAGTCA ATGTTTGATT 5040 

CCAGTTGGTG AAATCCATGG TTGCTGTCGC ATTGGCATCA TCTA C TTT G C TCGTTCCAAC 5X00 

GAAAGCAGAC GGTAAAACTT TACCTGTATC GCTATCCGCT CTCTTAGCAT CCGTCTCTGT 5160 

TGTACCAGGC ATCTTATATG GATTAACTCT TGCCCAGTAG CCATCCCTAT AGTGACTCAA 5220 

ATCGCCATTG TAAAGATAGA ACATCCCATC ACTCGTATAC CAACCAC GTT TATTTTCCTT S280 

GTTCATGTCT TCGTAATTCA ACGTACCACT GGAAAAGAGT GACAACCCAA ATCCAAACCC 5340 

TTTCTCTGCA TTGTACATGC CT G TTTTATC CAT C TT G TTA AAGGCACATA GGTAACTTGG 5400 

TCTTGGAACA CTTGCGACTC CTGCATCACT TAACAAGGAT TGCATCAAAC TCATATCCTT 5460 

ATAAGTCTTC AAATTCTTAA AGACATCATA ATAACTATCC GATTGAACAA TGCTCTTCAC 5520 

AACACTCTGC AAACATTGTT TGGTTTCTCC TTCACACATA TCCCCTATTC GCTCAATCCC 5580 

TCTTAGTACT TCTACTGCGG CCACGTGCCC CTCGCTATTT GCACGACTGA TCGACCGTCC 5640 

ACGACTCATA TCCATCAACT CTCCATTCAC CAGCAAAGGA CCAAACGATT TATCAATCCA 5700 

CTGGTACATG GTTTGCATTT TATCTTTATC GATTGCATTC TTCCTC TTTT GAATCACTGG 5760 

CAACAGTTGA GACAGGCCAT CAATCAAAAC ATTCCCATAA GCACCCGTAT AGGCAACATT 5820 

GGTGTCGTCG ATATAGGATC CATCTTGATA AAAACCTTCA CCTTCCTCTA CCAACTTGAA 5880 

CACTTGCTCA ATCGAGCGAA TGGTAGAAGA AATTT C TTGA TCATCCTTAC GCAGTAAACC 5940 

AG CT ATT ACT TTTACCCTTC CCATATCAAC TAACTTTCCA CCTAGAGCCT TGAATGGGTT 6000 

ATCAGTCGTC TTTCGGAAAT GTTCGGGATC TGGTACAAAT TTTTCAATCA CATCTGTATA 6060 

TTTTTTAATT TCCTCATCAG AGAAGTATTC TTTCATCAGA GACAAGGTAT TGTTGATGGC 6120 

ACGAGCTGTA CCGATTTCAT AATCCCACCA GTTCCCAACA ATGCTCTT T T CACTATTGTA 6180 

GACATGTTTA TGCATCCATT CCATGGAATC CCTGACTGTT CGAACGACAG TTTCATCTTG 6240 

ATAATAACCA GAAGAAGGAT TGGTCACTTG CTTGGCCATC TCCTCCAATT TCCGATAAGT 6300 

GGCAGTCAGA TTTGCAGACG TTTTATAATT TGAAAATTTT TCCCACAAAT ACCTGCGGTC 6360 

CGCCTGACTT GAAATACTGG ATAGGCTATC ACCTACCTTT CCTTCCAATT CCTGGTTTAA 6420 
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TTTGGCCATC TGTTCATTTT TACAATCATA GTATTGATTC CCACCGATGA TGCCATTCCA 6480 

GTCATCCAAA CGGTCTGTGT ATGCATCCTT AACAGACGCC AGAATCTTCA AAGGAATCTT 6540 

TTTCACTTCC TTGCCATCTT TACTGACAAT GACATTGGTT GTCCCTTCCT TAAGAGGTTC 6600 

TAAAATTCCA TTTTTGACTG AAGCAACGTC AGGATTTTCT ACCTTATAAG TATAGTCCCC 6660 

AAGAGAAAAA ACATCTTTTT TTCCAATTCG TAAATCAATC TTTTCCTCAA GCTGTTTATC 6720 

TGTTTGAGAA TCCTCAGAAA GCTGGTCTGC TACCTCTACC AGCTCAATAT CCTTAAAGGA 6780 

AACACTCCCA GTTCCTGTTT CATAGAATAA CTCCAGCTTG ATTTTATCAA CATCTAAAGT 6840 

CGGGCTATAG TCTGCTTCAA TGGTCTGCCA GTC C T T T G TT CCTGACGTCG TTGCAGAATT 6900 

CCACAATCGC TT G TCCTTAC CA C TTTCCTC AATGATACGA ACTTTGGCAA TCCCGATTTT 6960 

ATTATCTGTT TTAATCTTGA AACGCAGTTT ATACTTTTTC TTAGCTTCAA TAGGAACCAT 7020 

ACGGTGAAGC CCTGCCCTTA ATTTCTCATG GCTTGAGATA GTGATAGCCC CATCCTTAGC 7080 

CTCAATGACT CGAGTTGAGG CATCTGCACT ATTCTTCTCC TCTACCCAAG CTGACCACCC 7140 

CCTGACCTTT GCTTCCTCTC CCG 7163 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 9244 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANOEDNESS : double 
<D> TOPOLOGY: linear 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



CGTTATAACA 


TACATGTAAG 


CGGTACCCAA 


AATGCTGCCA 


AGTCAAAATT 


TTTAAGGACG 


60 


AAAATACATG 


TCTTCACATC 


CAATTCACCT 


CTTCTCAGAA 


ATTGGGAAAC 


TGAAAAAAGT 


120 


TATGTTGCAC 


CGTCCAGGCA 


AGGAGTTAGA 


AAACTTGTTC 


CCGGACTATC 


TTGAAAGGCT 


180 


TCTTTTTGAT 


CATATTCCTT 


TCTTGGAAGA 


TGCTCAAAAA 


GAACATGATG 


CATTTGCCCA 


240 


AGCTCTTCGC 


GATGAAGGAA 


TTGACGTTCT 


CTACCTAGAA 


CAACTCGCTG 


CTGAATCATT 


300 


GACCTCTCCA 


GAAATCCGCG 


ATCAATTTAT 


CGAGGAATAC 


TTAGACGAAG 


CCAACATCCG 


360 


TGATCGTCAA 


ACCAAGGTTG 


CTATTCCTGA 


ATTGCTTCAC 


GGCATCAAGG 


ACAACCAAGA 


420 


ATTGGTTCAA 


AAAACAATGC 


CTGGGATTCA 


AAAAGTTGAA 


TTGCCAGAAA 


TTCCTGACGA 


480 


AGCTAAAGAT 


CTAACTGACT 


TAGTTGAATC 


AG AGTATC C A 


TTTGCAATTG 


ACCCGATGCC 


540 


AAACCTCTAT 


TTCACTCGCG 


ACCCATTTGC 


AACAATTGGA 


AACGCCGTAT 


CCCTTAACCA 


600 


CATCTTTGCA 


GACACTCGTA 


ACCGTCAAAC 


ACTCTACCCT 


AAGTATATCT 


TCAAATACCA 


660 
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CCCAATCTAT 




TGGAli 1 Vjo 1 




CAAflAAfiATA 


CGCGTATCGA 


720 


m r*r**T*r*r* ft f+ ft ^ 

AGGTGGAGAC 


GAGTTAGTTC 


TTTCTAAAGA 




V> 4 nuu 1 *\ * W 4 


CTCAACGTAC 


780 


AGACGC AG CT 


TCTATvGAAA 


AACTTTTGGT 


r~ a. a riTfTTf* 




TTGGCTTCAA 


840 


CAAAGTTTTG 


GCCTTTGAAT 


TTGCTAACAA 








900 


CACTATGGTA 


GACTATGACA 


AGTTCACTAT 






fcrf"y*T**P^'* Af V?^* 


960 


TTACTCACTT 


ACTTACGAAA 


ACGAAAAACT 


TAAAATCGTT 


/- * ft/* ft a ft/"* 


rTflafTTlfir 
O 1 WW. 1 lAU^i 


IVtv 


TGAACTTCTT 


GCTCAAAACC 


TTGGTGTAGA 


AAAAGTTCAT 






1 ft Art 
1UDU 


CAATATCGTA 


GCAGCTGCGC 


GTGAACAATG 


ft M ft ^^^^ 


TCTAAUAvTTT 




1 1 AO 


ACCTGGTGTG 


GTAGTTGTTT 


ATGACCGCAA 


TACCGTGACC 


ft ft fl»ft ft ^ ft*t»*^W 

AATAAGA l l I 




1«UU 


CCGCCTTCGC 


TTGATTAAGA 


TTCGCGGAAG 


TGAATTGGTT 


CGGGGCCGTG 


GTGGACCTCG 




TTGTATGTCT 


ATGCCATTTG 


AACCTGAAGA 


AGTGTAATCG 


CT GTTCG AT A 


TTCGTCAATA 


1 lOA 
lJ4U 


GAAAATGTAA 


AAAATAGAAA 


GAGGAAATAA 


TAAAATGACA 


AATTCACTAT 


TCCAAGGACG 


13ou 


CAGCTTCTTA 


GCACAAAAAG 


ACTTTACCCG 


TGCAGAGTTA 


GAATACCTTA 


TTGGTCTTTC 


1440 


AGCTCACTTG 


AAAGATTTGA 


AAAAACGCAA 


TATTCAACAC 


^ ft J **t* ft 

CACTACCTTG 


CTGGCAAGAA 


1 cnn 


TATCGCTCTC 


CTATTTGAAA 


AAA C ATCT AC 


T CGT ACTCGT 


GCAGCCTTTA 


CAACTGCGGC 


130U 


TATCCACCTT 


GGTGCTCACC 


CAGAATACCT 


CGGAGCAAAT 


GATATTCAGT 


fw^*^*^mft ft ft ft & 
TGGGTAAAAA 




ACAATCTACT 


GAAGATACTG 


CTAAACTATT 


GGGACGTATG 


TTTGACGGGA 


TTGAATTCCG 


1 AAA 
1DOU 


CGGATTCACC 


CAACGTATGC 


TTGAAGAATT 


ft y"^ Ik ftJ*4*ft^ 

GGCAGAATTC 


TCAGGCGTTC 






CGGTCTAACT 


GACGAATGGC 


ACCCAACTCA 


ft ft fW^*f«^^^f% 

AATGCTCGCT 


UAwTALTluA 




1 sna 


AAACTTCOGT 


CGCTTCGAAG 


GC TTG ACATT 


GCTATACTGT 


(A* 1 V* A 1 




1860 


TGCCAACAGC 


TTGCTCGTAA 


CAGGTGCTAT 






1W1 IWi w 


1920 


AAAAGAACTC 


TTCCCAGAAA 


AACAAATCGT 


TGAATTGGCA 


GAAGCATTTG 


CTAAAGAAAG 


1980 


TGGCCCACAT 


GTTCTCATCA 


CTGAAGATGC 


TGATGAAGCA 


CTTAAACATG 


CACACGTTCT 


2040 


TTACACAGAC 


GTTTGGGTAT 


CAATGGGTGA 


AGAAGACAAA 


TTCGCAGAAC 


GTCTAGCTCT 


2100 


TCTTAAACCT 


TACCAAGTCA 


ATATGGACTT 


AGTTAAAAAA 


GCAGGCAATG 


AAAACTTGAT 


2160 


CTTCCTACAC 


TGCTTGCCAG 


CATTCCACGA 


TACTCACACT 


GTTTATGGTA 


AAGACGTTGC 


2220 


TGAAAAATTT 


GGTGTAGAAG 


AAATGGAAGT 


AACAGACGAA 


GTCTTCCGCA 


GCAAGTACGC 


2280 


TCGCCACTTC 


GATCAAGCAG 


AAAACCGTAT 


GCACACTATC 


AAAGCTGTTA 


TGGCTGCTAC 


2340 


ACTTGGTAAC 


CTTTATATTC 


CTAAAGTATA 


ATTTTAGATA 


ATAAACCGTC 


TACCAACAGC 


2400 
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TATGAGGGCT 


GCGACTAATA 


GCTTTAGTCC 


GGTCCTCTTT 


TATGTAATGG 


TAATCTATTA 


2460 


TTTCTTATAA 


AATATCTGAA 


AAATCATTAA 


ATTGAAATCT 


AAACGCATTC 


TATTGAGTGT 


2520 


GATAAAGGAG 


AATTTATGGC 


AAATCGTAAA 


ATTGTAGTAG 


CTTTGGGAGG 


AAATCCGATT 


2580 


CTTTCTTCTG 


ACCCATCAGC 


AAAGGCTCAA 


CAAGAAGCTT 


TAGTTGAAAC 


AGCTAAGCAT 


2640 


CTTGTAAAAT 


TGATTAAAAA 


TGGAGATGAT 


CTGATTATCA 


CTCACGGTAA 


TGGACCTCAA 


2700 


GTTGGGAATC 


TCTTCCTCCA 


ACATTTGGCA 


TCAGACTCTC 


AAAAGAACCC 


TGCCTTCCCA 


2760 


CTCGACTCAC 


TTGTCCCTAT 


GACAGAAGGT 


AGCATCGCTT 


TCTGCTTGAA 


AAATGCTTTG 


2820 


CAAAATGCTC 


TCTTGGATGA 


ACGCATCGAA 


AAAAATGTTG 


CCTCTGTTGT 


AACGCAAGTT 


2B80 


GTCGTAGATA 


AAAATGATCC 


AGCTTTTGTT 


AACTTGAGTA 


AACCAATCGG 


TCCTTTCTAT 


2940 


TCAGAAGAAG 


AAGCAAAAGC 


AGAAGCCGAA 


AAAAGCGGAG 


CGACTTTCAA 


GGAAGATGCT 


3000 


GGCCGTGGCT 


GGCGTAAGGT 


CGTTGCCTCA 


CCAAAACCTG 


TTGACATCAA 


AGAAATTGAA 


3060 


ACCATCCGTA 


CTCTTTTAAA 


TAATGGTCAA 


GTCGTCGTAG 


CTCCAGCTGC 


TGGCGGTATT 


3X20 


CCCGTCGTCA 


AAGAAAACAA 


TGCACATTTG 


ACTGGTGTCG 


AAGCCCTTAT 


TGATAAAGAC 


3180 


TTCGCTTCCC 


AACGTTTGCC 


AGAATTCGTT 


GATGCAGACC 


TCTTCATCCT 


TTTGACAGGT 


3240 


GTAGATTATG 


TATTTGTTAA 


CTACAACAAG 


CCAAACCAGG 


AAAAATTGGA 


ACATCTCAAT 


3300 


GTTGCCCAGC 


TGGAAGAATA 


TATCAAACAA 


GATCAGTTTG 


CACCAGGTAG 


CATGCTTCCA 


3360 


AAACTAGAAG 


CACCTATCGC 


TTTTCTCAAT 


CGTCCTCCAG 


AAGGAAAAGC 


AGTTATTACT 


3420 


TCCCTTGAAA 


ATCTAGGCGC 


CTTGATTGAA 


TCTGAAAGCG 


GAACAATTAT 


TGAAAAAGGA 


3480 


TAAGTTGTTT 


TACTAATAAG 


ATGTATTCTA 


TTTCTAGTAT 


CTTTATATCA 


AATTAGAAAT 


3540 


TATTCTTGAA 


AACATGTACA 


ATATTTCAAA 


AGATACTAGT 


TTTAGACTTT 


AATATGGTAA 


3600 


AACAAATATA 


AATAGAAAGC 


GTTTTCTTCA 


ATGTTTATTT 


AACAAACTAG 


TTCGTTTTTT 


3660 


ACACTTTCTT 


AGACATCAGG 


AGGAAAAACA 


AATGAGTGAA 


AAAGCTAAAA 


AAGGGTTTAA 


3720 


GATGCCTTCA 


TCTTACACCG 


TATTATTGAT 


AATCATTGCT 


ATTATGGCAG 


TGCTAACTTC 


3780 


GTTTATCCCT 


GCGGGGCCCT 


TTATAGAAGG 


TATTTACGAC 


ACTCACCCTC 


AAAATCCACA 


3840 


AGGGATTTGG 


GATCTCCTCA 


TGGCACCGAT 


TCGGGCTATG 


CTAGGTACTC 


ATCCAGAGGA 


39Q0 


AGGTTCGCTC 


ATTAAAGAAA 


CGAGCGCAGC 


GATTGATGTA 


GCCTTCTTCA 


TCCTTATGGT 


3960 


TGCTCGTTTC 


CTTGGCATTG 


TCAACAAAAC 


TGGTGCTCTT 


GACGTAGGGA 


TTGCCTCTAT 


4020 


CGTGAAGAAG 


TATAAGGGCC 


GCGAAAAAAT 


GTTAATTTTG 


GTACTGATGC 


CTTTGTTTGC 


4080 


CCTCGGTGGT 


ACAACTTATG 


GTATCGCTGA 


AGAAACAATG 


GCCTTCTATC 


CACTCCTTGT 


4140 


GCCAGTTATG 


ATGGCCGTTG 


GTTTTGATAG 


CCTGACTGGT 


GTTGCAATTA 


TTTTGCTCGG 


4200 
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TTCTCAAATC 
GACTGCCCGA 
GACTGCTCTT 
TAAGTCACTG 
TTCATCTGTA 
GACATTCATC 
TGATGACTTT 
TACTTCTGCA 
TATCCTGATT 
TGGTGCTGCT 
TATCATGAAC 
CGGTCTATCT 
CTTGATCCCA 
AGAATTTGTA 
CTTGAACTTG 
CAACATTCGT 
CATCGCCCTT 
TTCCATGAAA 
CTTGATTTCC 
AATCCAAGAT 
TCTTGACCCT 
CATTCTCTGT 
ATTTGAAGCA 
CCCTTCGCTC 
AAAGCGCGTA 
CTACAATACC 
CACCTATGCT 
AGAGCTTGAA 
CCTCTATGAA 



GGCTCTTTGG 
GTTGGTACAG 
AGTACTTGGT 
GTTTATACTA 
GAATCTACAC 
TTGATGCTAT 
AATACTTGGT 
CTAGCTACTT 
GGTGTTATTT 
GACTTGCTCA 
GACGGTATGA 
TCACAAGTCT 
TCTTCATCTG 
AATGTCCCTC 
ATTGCACCAA 
ACTTGGTGGA 
CTTCTCCTTG 
AT AG ATAT AA 
TATCCTTCAG 
GTCCTAGAAA 
AAAGGTTATT 
CATTTGCATG 
ACTATCAAAG 
GCAGCTCTCT 
CGCTTTATCT 
ATCGAAGAAC 
GAAAAACGGC 
GTAGGAGGCG 
CAGGTTTGTA 



CATCTACTCT 
GGGACGGTAT 
TTGTTTACCG 
CTCGCAAAGA 
TTACCAGCAA 
TGAGCTTCAT 
TGACTGGTCT 
GGTACTTCCC 
ATGGTCTTAA 
GTGTTGCCTT 
TTACCGATAC 
TTATCGTTGT 
GTCTTCCCAG 
CTAGCTTGAT 
CATCTGGTAT 
AATTCATGGG 
GAACCTTCCT 
CAAATCAAGT 
TACTCAATGA 
AAACTTTAGA 
ACGGATATGC 
TTGTTCCATC 
ACGGCTGGGT 
ATGCACTAAA 
TTGGTACCGA 
AGGCCAGTAT 
TTCTACAGGT 
CCTTTAACGT 
ACGGTCTCAA 



SS9 
GAATCCATTT 
CGTACTTCGT 
TTATGCGGAT 
AGATTTCAAA 
ACAAAAATCA 
TCCATCCACA 
TCCAGTTATT 
AGAAGCCGCA 
AGAAGATAAG 
GATCGTAGCG 
AATCCTCAAC 
AACTTATATC 
CGCAACTATG 
TATCACTCCT 
TGTCATGGGA 
CAAACTCGTA 
TCCATTCCTA 
TAAAGATCAA 
AGGAGAAAAT 
GATTTCTCGA 
AGAAATCGCT 
AGCTGATGAA 
ATTCGGACGT 
AAGCTTGCTG 
TGAGGAAACC 
GGGCTTTGCA 
CAAACTTCAT 
TGTACCACAC 
AGAAGCTGGT 



GCGACAGGTA 
CTGATCTTCT 
AAGATTCAAA 
CACTTTAACG 
GTTCTCTTCT 
GACCTTGGCG 
GGTAATATTG 
ATGCTCTTTG 
ATTATCTCTT 
ATTGCTCGTG 
TGGGGTAAAC 
TTCTATCTAC 
GCTATCATGG 
TACCAATCTG 
GCTCTTCCAC 
GTCGCTATTA 
TAAAATAGTG 
TTTCTTATAT 
GGAACACCTT 
GACATAGGTT 
CAGGGAGCAG 
GCAGATTGGC 
GGTGTCCAAG 
GACCAAGGTA 
CTCTGGCGCT 
CCTGACTCAT 
CCCCCTGGAT 
AAGGCCAACT 
TATGATTACC 



TTGCTTCAGC 
GGGTTACCTT 
AAGATCCGAC 
TAGAAGAATC 
TATTTGTGTT 
TTACCATTTT 
TCGGTTCATC 
CCTTTATGGG 
CCTTCATGAA 
GTATTCAAGT 
AAGGCTTCAG 
CTATGTCATT 
CTCCACTTGG 
CTTCAGGTGT 
TTGGACGTAT 
TTGTACTCAC 
AGTGAGGTGA 
CATTAAAAAC 
TTGGACAAGC 
TCACTACCTA 
AGCTTCTGGC 
ACACACCGCC 
ATGATAAAGG 
TTCAGTTCAA 
GCATGGCACG 
CTTTTCCTCT 
CGCATCAACT 
ACCAAGCTCT 
AAACCACTGA 



4260 

4320 

4380 

4440 

4500 

4S60 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

S040 

5100 

5160 

5220 

5260 

S340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 
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ACAAACCGTA 


ACGGTTCTCG 


GAGTGCCAAA 


CCATGCTAAG 


GATGCTAGTC 


AAGGTATCAA 


6000 


TCCTCTCATC 


CCACTAGCTA 


CCATTCTTGC 


TCCTCTCCAA 


GAACACCCTG 


CTCTCAGTTT 


6060 


TCTTGCAACA 


CAAGCAGGTC 


AAGACCGCAC 


AGGAAGACAA 


ATCTTTGGTG 


ATATAGCAGA 


6120 


TGAACCTTCT 


GGTCACCTAT 


CCTTTAATGT 


CGCAGGTCTC 


ATGATCAATC 


ATGAACGTTC 


6180 


TGAAATCCGT 


ATTGACATTC 


GGACTCCTGT 


CTTAGCTGAC 


AAGGAAGAAC 


TAGTAGAGTT 


6240 


CCTTACAAGA 


TGTGCACAAA 


ACTACCAACT 


CCGCTACGAA 


GACTTTGACT 


ATCTAGCGCC 


6300 


TCTATACGTC 


GCAGAAGACA 


GTAAACTCGT 


TAGCACACTG 


ATGCAAATCT 


ACCAAGAAAA 


6360 


CACTGGCGAT 


AACAGTCCTG 


CTATTTCATC 


CGGTGCTGCC 


ACTTTTGCTC 


GCACCATGCC 


6420 


AAATTCTGTA 


GCCTTCCGCG 


CCTTATTCCC 


AGGAGCGAAG 


CAGACAGAAC 


ATCAGCCAAA 


6480 


TCAATGTCCC 


GTTCTAGAAG 


ATTTGTACCG 


TCCTATGGAT 


ATTTATGCCG 


AAGCCGTCTA 


6540 


TCGACTTGCA 


ACTTAATCAG 


GCAACTGTTT 


CTACCAAAAA 


AAATCGACCG 


ATTAATGAAC 


6600 


TGCACCCCAA 


AAGTTACACA 


GAATAAATCT 


AACTTTTGGG 


GTGTTTTATT 


ATGAAATTGA 


6660 


GTTATGAAGA 


TAAACTTCAG 


ATCTATGAAC 


TAAGAAAGCA 


AGGACAAAGC 


TTCAAACAGC 


6720 


TTTCAAAAAC 


ATTTGGTGTG 


CATGTTTCTG 


GTCTAAAGTC 


ATCTGAATCT 


TTGAGATGAG 


6780 


CTTTATAAAT 


CGCTTTTTTC 


AGTTTTTGCA 


CTCCTCTTTC 


GATAAACTCA 


AACTTTTTAG 


6840 


CCGTGGTATT 


GCCTGATTTT 


ATACTATATT 


CAAACTAGAA 


TACTACACCT 


CTCCTTCTAA 


6900 


AACATTTTTA 


GAAATCGATT 


TGACTGTCCT 


GATCGATTTG 


TCCTGTTCTT 


ATTTCATTTT 


6960 


ACTATATTTG 


AGCCACTTCG 


TCTTTAACGG 


CTTTATTCAT 


AAGCTCTTGT 


AATTTTTCTT 


7020 


TACT AT C AAT 


TACTTCTGAT 


TTTCCGTTGT 


AATTTATTGT 


AATAGGTTTT 


AACTTACCTA 


7080 


ATTTCTCGAC 


ACGCTCATTA 


ATTTGATCTT 


TTTTCAACGC 


TGCTTATGTT 


TTTCCTAAGA 


7140 


TTTTTTCAAA 


AATATATTTA 


TCAGATAGCG 


GTTTGTCTTC 


TTCTTCAGCT 


TGGTTTTTGT 


7200 


ATTAATTTGA 


AACATAAGGA 


ACAAATCCTT 


CATAGTAACC 


TAATGCTCCC 


ATAAGTTCAA 


7260 


AAGCTTGTTT 


TCTAATTCAA 


ACCATTGCAA 


CTCAGATTTC 


AGCTTTTCAG 


ATAAATCCTG 


7320 


CTCATCCAAA 


TAATGACTTG 


AAATTAGTGC 


TGAACTCGTT 


TCTCTATCCT 


CTACAGGCTG 


7380 


AGCACCCATA 


CCAGCAAAAA 


ATAAACTCGT 


TCCTAGCAAG 


ACCGAACAAG 


CTCCTATTGC 


7440 


ATATGGCCTC 


AAAGAAAAAC 


GCTGCTTTCT 


CTCAAATTGA 


AATTCTTTCA 


TCCCATCTCC 


7500 


CATCATTCAT 


TATTACTGTA 


TATTTTGTAT 


ATCAGAAATA 


GTTTGTATTC 


ACAAATCTTT 


7560 


CTAGTT ATT C 


CCTTATCATT 


CCTAATTAAG 


GGAGATAACA 


TACAATAATT 


TTTAGTTAAA 


7620 


TGTATATCGA 


TGTTTTTTGT 


TTTTCTTAAT 


AAACGCAATA 


CAAAAAG AG C 


CTGTTACCAA 


7680 


GCTCTTTCTA 


CTCAATGAAA 


ATCAAAGAGC 


AAATTAGGAA 


ACTAGCCACA 


GGTTGCTCAA 


7740 
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AACACCCTTT 


TGAGCTTGCA 


CATAGAACTG 


ACGAAgTCAG 


CTCAAAACAC 


TGTTTTCAGG 


7800 


TTGCAGATAG 


AACTGACGAA 


GTCACTAACA 


TCTATACGGC 


AAGGCGACGC 


TGACGTGGTT 


7860 


TCAAGAGATT 


TTCGAAGAGT 


ATTAGTCTAT 


TATTTCTTCT 


CAGCGCGAAG 


GGCTGACAAC 


7920 


ATTTGTGTTC 


GGATATCATC 


CACACCATTT 


GGACTATTTG 


GTAAAAAGAT 


ACTTTGATTT 


7980 


CCTTTAGAGG 


CAAACGTATT 


CAAGGTATCC 


AAATACTGCT 


TGGTCAAGAG 


GATAGACATG 


8040 


ATTTGTTCTT 


CTGTCATGCC 


AACATTCGCT 


TCCTTGAGTT 


CGGTGATACA 


CTCTGCCAAT 


8100 


CCATCCACAA 


TCGCCTTACG 


TTGTTGGGCA 


ATCCCCACAC 


CATGAAGGCG 


GTCTTTTTCT 


8160 


GCTTCTGCTT 


CAGCTGCAGT 


GACAATTTTA 


ATCTTCTCAG 


CTTCCGCCAA 


TTCTTGTGCT 


8220 


GCGACCCGCT 


TACGTTGCCC 


CGCATTGATT 


TCATTCATGG 


ATTGCTTAAC 


TTCTGCATCT 


8280 


GGTTCCACCT 


TGGTAATCAA 


GGTTTTCACG 


ATAATGTAGC 


CGTAAGTGCT 


CATTTCTTCT 


8340 


GCTACTTGGT 


GTTGAACTTC 


AAGGGCAATC 


TCATCTTTTT 


TCTCAAACAA 


TTCATCCAAG 


8400 


GTTAATTTTG 


GAACAGAAGA 


GCGAAGAGCA 


TCTTCGATAT 


AAGATTTAAT 


CTGAGATTCT 


8460 


GGACGTATGA 


GTTTATAGTA 


ACCATCTGTC 


ACGCTCTCCT 


CCTTGACACC 


GTACTCAGTC 


8520 


GCTACATTCA 


TCATAACGAA 


CACATTGTCC 


TTCCTCTTAG 


TCTCAACCAC 


AATATCACTT 


8580 


TGCAACAAGC 


GCAACTCAAT 


CCCTCCTGCA 


ATCGACTCAA 


TCCCAAAAGG 


CAAGCCAATA 


6640 


TGAATACCGC 


TATTAGCAAC 


CTTTTGGTAT 


TTCCCAAAGC 


GTTCAATAAT 


CGCCACCGAC 


8700 


TGCTGACGAA 


CCACATAAAC 


TCTACTCAGT 


GTCACTATCA 


CCAATAGGAG 


CACACAAACA 


6760 


ATCAGAAAAA 


TCATGAAAAA 


TATTGCCATA 


ATCGAACCTC 


CACAAGTATT 


TTTCTAGTAT 


8820 


TATAGCACAT 


TTAAAGAAGG 


CTGTGCCGTT 


TTTACTGCGA 


TTTTTCCTGA 


AATGTCAATA 


6880 


ATT AG AGCTG 


AATTGTCCTA 


TTGTCGTCCA 


ATCTCTTCCT 


AAAATAACTC 


TTTATAAAAC 


8940 


GCAATCGTTT 


CTTCTAAGGT 


TCGCATAAAT 


GGATTTCCTG 


GTGCGCACGC 


ATCAATCAAG 


9000 


GCATTCTTAG 


AAAGGTATTC 


AAAGTCGAAA 


TCTTTTTCTT 


CAATACCAAG 


TTCAGTCAGT 


9060 


TTCTTAGGAA 


TACCTACTGT 


CTCAGAAACC 


TTCTCAATCT 


CAGCAATCGC 


ATAATCGGCA 


9120 


CATTCTTGAT 


CTGATTTACC 


TTCTACATGA 


AGTCCCAAGG 


CTTTGGCAAC 


ATTGCGGAAA 


9180 


CCTTCTGCTA 


CACGTTTAGC 


ATTTTCACGT 


TCTATAACTG 


CTAGCAACAT 


GGCACAGCAC 


9240 


ACGG 












9244 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(0) TOPOLOGY : linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69; 
CATCTCAACT TTATCATCAT AACTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT 
TTTCTGTCTA ACTTTTGGGG TGTAGTTCAG TCATTCGACT GACGTTTTTT TGTATGCTTA 
TTTTGATTTG ATGTAGTTGA TACCATCTGC TTTTCGTGCC ACTGCTTTTC CAAAGAAGGC 
TGCTAAGACA AGAATTCTCA AAACATAAGG TGCAATTTGA AGATAAACCG CTGGCACTCC 
TTGTAGGAAC GGCAATTGAG AACCGATAAC AGCCAAACTT TGTCAAAGTC CAAACAAGAG 
ACTAGAAACC ATAGCACCGA TTGGATTCCA TTTCCCAAAG ATCATCCCAG CAAGGGCGAT 
AAATCCAGCT CCAACAATAG TTCTCACTGA CAAGTTAACT GAGATTGATT GCCCATAAAT 
CGCTCCGCCA ATTCCACCTA GAAAACCTGA AATAATAACC CCTAAATATC TCATCTTCTA 
GACGTTGATT CCCAAGCTAT CCCCTGCTTG AGGATGTTCA CCCACAGAGC CGACACGAAC 
ACCAAATTGA GTCTTAAAGA GAATAAACCA AGCAAGGAAT GAGAACCCAA TCCCCAGATA 
ACCAACTAGA CTAGTTGACT TGAAGAAGAT ATCACCAATC ACTGGGATAT TTGCCAAGAC 
TGGGAAATCA AAGCGTCCAA AAGTTTGACT TAGCTTCTCG GTTTGTCCTT TGTTATAAAG 
AACTTTAACT AACAAAACAG CCAAGGCAGG CGCCATCAAG TTCAATACCC TACCGCTCAC 
AACATGGTCT GCACGGAAAT GAACCGTCCC TGCTGCGTGG ATCATAGACA AAACACTACC 
AACCAATCCT GCTACAAGCA AGGATAGCCA TGCAGTTGCT GCTCCAAATT GTTCTGCAAA 
TTCAACGTTA AAGACAACTC CAGAAAAGGC ACCCATAACC ATAATTCCTT CAAGGCCAAC 
GTTTACCACA CCACCACGTT CAGAGAAAAC ACCACCGATA CTTCTAAAGA TGAGAGGTGC 
TGAGTAAATC AGCATAGAAG ACACCAACAC GGGGAGCAAG CTTATAATAG ACATCTTTAC 
TTACCTCCTT TAACTTGTTT TTTCGGTTTG ACAAAGCCTT CGATAAGCTA ATGAACACTG 
ACAAAGAAGA TAATAGACGC TGTTACAATG CTGACAAGCT CACATCCTAC CTGCGCCGCA 
TTCATACCAG GAGCCCCAAC TTCGAGAACG CCAAATACGA AGGCTGCAAA GAGTATACCA 
ATTCGTGAGT TGGCCGCAAG CAAACTAACC GCCATTCCGT TAAATCCGAT ACCTAATGAC 
GAACCTTCAA CATAGACGTT CTGGAAGCTT CCCAAACCTT CAACAGCTCC ACCAAGACCT 
GCCAAGGCAC CTGAAATAAT CATAGATAGG ATAATAGTCC GCTTGGCAGA AATACCAGCA 
TATTCTGAAG CATGTGGATT AAGACCAACT GCACGGATTT CAAAACCAAG AGTTGTTTTC 
TTGAGCATCA ACCAAATAAC TGCAACGCCA ATGATGGCAA AGAAAATACC AATATTCATC 
CGTGAGTTAC CAGTCAACTC AGCCAACCAA GGTGTCTGAT ACGTTGCATT AGCCCCAACA 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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CGAATGCTCG 


AATCTGTACT TTGCATCAAG 


TCTTTAGGGA 


AAGCATGGAT 


AAAGGCATTC 


1680 


i MWn A fy\*r\ 


ACACAATGTA CTTCATCATG 


ATGGTTACAA 


TAACCTCTCA 


CGTCCCTAGA 


1740 




GAATACCTGG 


AATCGCTCCC 


ACAATCCCAC 


CAGCAATCAA 


GGCAATCACG 


X800 




GAATCATCAA 


GCGACGGGGC 


ATATCTCGAT 


GCGACAGGGC 


AAACCAACCA 


1860 




AACCTCCCAA 


AGCCTGACCA 


GGAAGTCCGA 


CCTTAAACAA 


ACCAGCTCGA 


1920 


w rwuLAAWMt 


CAAAACCAAC 


ACCAATCAAG 


ACCAGAGGAC 


CCATAGCACG 


GAAGATTTCT 


1980 


CLAATCCCAL 


GCAGACTGCC 


AAACGCTCTA 


TAGAACAATT 


CTTCGTACCC 


CCAAATAGCA 


2040 


TCATAACCGA 


AGATCCACAT 


GACAATGGCT 


WUWnU 4 rWVl 


TTCCTACGAA 


TACACAAATC 


2100 


AAGGGAACCQi 


AAATTTGTTG 


TAATTTTTTA 


WAWV A w/%W A V» 


WW • 4 1 v 


CCAAGTTTCC 

A A WW 


2160 


ACCAGCCATC 


AAGACACCAA GTTCTTGTTT 


A I IwO 1 iul ) 


TrrCRTfiATA 
1 w luu 4 Vj/% A i% 


CAATACCTTG 

1 *%W W 4 1 V_y 


2220 


AATCTTACCA 


TCGTGGATAA 


CGGCAATACG 


V* IV- i v*/\v*/\v»v» 


TTTAAAATPT 

AAA SV%A/\ A W A 


CATCCAATTC 

Wft A WW#V% A A W 


2280 


AAAGCTGACA 


ACAAGGACAG 


CCTTGCCATT 




T/ 1 * & A*Tf A. Af!f" 


%J A 1 Awl VA»A 4 




ATACTCAATG 


GCACCGACAT 


CCAACCCACG 


nu a I uViL 1 


W 1 t\f\\*\*t\ A #W\ 


CfSACATf^ACC 

UUAwA A LAUU 


2400 


ATCTCCATCA 


ATTTCACGAG 


CAATAATTGC 


< t > k lul 1 


A 1 iV.Ll\>^iU 




2460 


TGCAGGAACT 


AATTCACTCG 


CAGCGCGAAC 




I WwM 1 V.AUV. 1 


TTTTACrATA 


252 0 


AGAAGTAATA 


TTTCAATAAT 


TCAAAATTCC 


ATTTTTACTA 


A \* * V»W 4 * W » A 


TATAGT AGCT 


2580 




ATATTTTCAG 


ATATCATCAT 


TTCCAAAATC 


AAGCCATCAC 


GGTGACGGTC 


2640 


1 11.1 UuAnU U 


TGCCCAACAC 


TTAGTTCTGT 


AATCTGACGT 


GGGTGCAAGC 


CTACAATTCA 


2700 


ATCTCGTTTT 


AGCTCAATGC 


TACCACATTC 


AACCTTACGA 


AGACCTGTAA 


TGGCTTGAAT 


2760 




TGACCATTTC 


CATCAATCCC 


CCC AATACCA 


AC AATCTCTC 


CAGCACGAAC 


2820 




AGATTTTTAA 


CAGCTGGAAC 


ACCACGGTTT 


TCATTGACCA 


CCAAATCTTT 


2880 


GATAGACAAA 


ACCACTTCTT 


TTGGTTTAGA 


GGCTTGCTTC 


TCTCTTTTAA 


AGGAAACAGA 


2940 


ACGTCCTACC 


ATCATTTCCG 


CCAAATCACC 


ATTGGTAGCC 


CCTCCAATTT 


CAACGGTTTC 


3000 


AATTGATTTC 


CCACGACGGA 


TAACTGTAAC 


ACGGTCAGAA 


ACTGCTCGAA 


TTTCATCCAA 


3060 


TTTGTGGGTA 


ATCAAGATAA 


TTGATTTTCC 


TTCTTTGACA 


AGATTTTTCA 


TAATAGCCAT 


3120 


CAACTCATCA 


ATTTCTGATG 


GAGTCAAAAC 


AGCCGTTGGT 


TCGTCAAAGA 


TAAGGATATC 


3180 


AGCCCCCCGA 


TAAAGTGTTT 


TTAAAATTTC 


TACACGTTGT 


TGGGCTCCAA 


CTGAGATATC 


3240 


TGCTACCTTG 


GCACAAGGGT 


CAACAGCTAA 


GCCATAACGT 


TCAGAAACAG 


CCTTGATTTC 


3300 


TTTCCTAGCT 


CCACCGATAT 


CTAGCACACC 


ATTTTTAGTC 


AATTCACTAC 


CTAAAATGAT 


3360 



pwcnnrm- -wn q«ip<wia5> i > 
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CTTTTCAGCC 


ACTCTGAAGG 


CTTCAACCAA 


564 
CAT AAAU l vjw 


TGGTGAACCA TCCCGATTCC 


3420 


CAAGCTAGCT 


GCTTTACATG 


GCGAGTCGAG 


ATTGACAACT 


TGACCGTTGA CCGCGATTTC 


3480 


ACCACTACTT 


GGTTCAAGAA 


GCCCTCCTAA 


CATGTTCATT 


ACCGTGGACT TACCAGCCCC 


3540 


ATTTTCTCCT 


AAAAGTGCAT 


GAATTTCACC 


TTTTCGTAGG 




3600 


GGCAACAAAT 


CCACCAAACA 


CCTTGCTAAT 


ATCACGCATC 


TCiATCACAT TTTCGTGTGC 


3660 


CATGTCCTCT 


TCCTTTCAGA 


GTCTTATTTT 


ATTTCAATAA 


& iPTT^fTAG TTTGTCTAGT 


3720 


AGCAAGCTTT 


ACTTAGACAA 


AATGACTTTG 


TCTCAACTCT 




3780 


GCTTCCTAAG 


AAATGACTTC 


CATCCATTAT 


TTTTCAGGAA 


rrTTTALt^T TCCATCAAGG 


3840 


ATTTTAGCTT 


TTGCATCTTC 


GACAGCTTTT 


TTACCTTCTT 


uAAnuu 1 i i w A * nv» a www 


3900 


AAGTCAACCC 


CTTTATCCTT 


CAATGAGTAA 


ACGATCACTT 


UALLwv. Lnuu w*w\l»w*ww* 


3960 


CTTTCTGCCT 


TGTTAGAAAT 


ATCTTTTACA 


GTTGTACCAA 




4020 


ACAACAAACT 


TTGATTCTTT 


GCCATCTTTA 


GAAGTCTATT 


TACL i I t ITjV. TlV.1 i\iOiU\ 


4080 


CGATCAACAC 


CGATAACCCA 


AACTTTTTCA 


TTTTCAGGAC 


COL ! 1 1 CX* I 1 wAw/»w#*l ill 


4 140 


GCCTCTGCAA 


AGACACCTGC 


ACCTGTACCA 


CCACCTACTT 


GGTAAACAAT AH.iw-/itLu 


4200 


GCTGCGTATT 


GTGCGGCTGC 


AATTGTTTTA 


CCTTTAGCCG 


CATCACCAAA TGAAttftutu 


4260 


TAGTCAACTT 


GGACTTTGAT 


ACATGGGTCT 


ACTGACGCAA 


CACCAGCwTT L»AJ< l i-v. i 


4320 


TCAAAACGAG 


AGATAACTTC 


ACATTCGATA 


CCACCTACAA 


AACCAACTTG TTTTGTCTTA 


4380 


GTTGTTTTTG 


CTGCAGCCAC 


ACCTGCAAGg 


TAACCTGACT 


C ATT ATCAW. oaaau i tAww 


4440 


CTCGCAACAT 


TCTTTTGCTC 


TTTAATCACA 


TCATCAATCA 




4S00 


TGTTCTTTTG 


CTGCATCTTT 


AACTGCATTA 


TTAAGGGCAA 


AACCAAv.ALLi \j>\/\w*\i 


4560 


TTGTAACTTC 


CAGCCCCTTG 


TTCCAAGTTG 


TTAGC GT ACT 


-.--iwnptp* ^rTTTGATTGG 


4620 


AAGTAAGTGA 


AACCGTTATC 


TTTTGAAAGA 


TTGTGTTCTT 


TirrrrAACC CTGCAAACCT 


4680 


TCCCAAGCTG 


ATTGGTTGAA 


TGATTTGTCA 


TCAACACCAC 


rarrraTfACT GACGATTGCT 


4740 


GCTTTTGTCT 


TCACATCAGA 


AGATGAAGCT 


GCGTTACGAG 


AAGAGCGGTT ACCACATW.A 


4800 


GCAAGTCCAA 


CTGCTGCCAC 


TGCAACTAGG 


CCAAGACCTA 


GCCATTGTTT CTTGTTCATT 


4860 


ACTGAACCTC 


CTAAATAAGA 


TGTCCAACGA 


TGTTGCAAGT 


ATGGATTGGT TGGCCACAAG 


4920 


GACCCTGCCA 


CTCAGAGAGC 


GACTCAGACT 


AGTTTAAGTC 


TGTAAAAGAG TATGGAAGTA 


4980 


ATTCCCCGAC 


CGTCATCTCG 


ACCGTCGATT 


TATCTTTTGC 


GACTAAGGTC ACTTTTAGAT 


5040 


CTTGTTCAAA 


AAATTCAGCC 


ATCACTTGGC 


GACAAGCACC 


: ACATGGCGAG ATCGGTTTTT 


5100 


CAGTTTGACC 


ATAGACAATC 


, ^AXTCTGAAA 


k ATTCTCTTTG 


; GCCTTCAGAT ATAGCCTTAA 


5160 
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AAATAGCTGT 


TCTCTCACCG 


^ft ft. mm^^^^fc 
CAATTGGTvA 


AAtiGATAOVT 


AGCATTTTCA 






CCGTGTAAAC 


ACTTCCGTCT 


TTAGCTACTA 


& & a r"w*rTt"f 
AAaV. iuL I 


uAl MfuAnnu 




3aOU 


GGACATAGGC 


ATGTTTGCTG 


GTTTCAATTG 






GVCGCCATCT 




GCCAATTCTC 


CTTTTAAAAT 


& ft ^^^^ ft 

AGCTACCCCA 


GCTGACGTTC 




UfUkU. IVw 1 




TCCACAAAGC 


CAAGAGCATC 


m^*^*fe#**ft ft^^ftfk 

TGCATAAGAA 


CGAGCTCCAC 


M M M MM M M M*M) 

CGGCGGCCTT 


UALAV.V.LATA 




TCAGATCCAA 


CTGTTTCACG 


m & mmjm ft ft fVft^tfVfc ft 

CATTAATGTA 


ft ^ ft IH^Ift^J »<■> ft 

ACATCTGCTA 


TCGTAGCAWw 






AAGCCAGTAG 


ATGTTTTGAC 


AAAGTCAGCC 


CCAGCTTTTT 


GGGCCAATTG 


ft ft ft ft ft M ft 

GCAAACAACA 


SSBO 


ACTTTTTCTT 


GGTCTGTCAG 


AAGGCAAGCT 


TCAATAATCA 


CTTTCACTAA 


CTT ATCACC A 


5640 


CTTGCTTCCA 


CTACTGCGCG 


AATATCTGAC 


TCAACCAAGG 


/Ml ft ft ft JWft* ft 

CTAAATTACC 


TGATTTGAGA 


5700 


GCTCCAACAT 


TGATCACCAT 


ATCAATCTCA 


TCTGCACCAT 


TTTGCATAGC 


TTLi iTTGTC 


5760 


TCAAATGCTT 


TCACGGCTGA 


AGTTGTTGCT 


CCCAAAGGGA 


AACCTACTAC 


TGTGCAAACC 


5820 


TTAACATCTG 


TGCCTTCAAG 


TCCTTTTTTA 


GCATGTTCAA 


CCCAGGTCGG 


ATTAACGCAA 


S880 


ACACTCGCAA 


AGTCATACTC 


TCTAGCCTCA 


GACAACAAAC 


TATCAATTTG 


tttvitcttt 


5940 


CCATCTTGTT 


TTAAAAGCGT 


ATGATCTATA 


TATTTATTTA 


ATTTCATTTC 


J — r fTirr 1T_J"M1 j j l _~l , j _ 

GGTTTTCCCT 


6000 


CCATTTAGGA 


GATGATTTCT 


ACAATTTCAC 


CCATTTTTTT 


CACTTCATCA 


CTTATTTTAA 


6C60 


cacatttttg 


GAAATCTCTA 


ACTAGTTGAG 


GTGGAATTTT 


TTCATTTGTG 


TATACTTTTG 


6120 


CAACAATTTC 


ACCCTTTTCA 


ACGGAGTCTC 


CAATCTTCTT 


TTCAAAAACA 


ATTCCTGTTT 


6180 


CATAGTCCAA 


GGCATCAGAC 


TTAACTGCAC 


GACCAGCACC 


CAGCCTCATG 


MM ft H* ft K ft M«M 

GCATAAAGAC 


6240 


CAAAGTCCAT 


AGCTGGAAGA 


GCTGAAATGA 


CACCCGTTTC 


CTGAGCAGGG 


ft MMkfKMM • MM 

ATTTCCACCA 


c inn 
5 JUU 


CATGAGCTAC 


ft IfHIHfft ft f+ ft ft 

ATTTACAGGA 


M/*» « M> ft M ft MMMJ 

CGATAGAGGT 


CTTCCAAGTC 


TCCACCTTGG 


MM— MM » MM » 




TTTCCTCAAA 


CTTAGCCAGT 


MMMMfkM Jh MM* Mj 

GCTTGACCAT 


MjMMjM » ft M « M)M 

TCTCAAGATG 


TTGGTGAACT 






TTTTGTTAAC 


ATTTGCCAAA 


CCAAGCATAA 


TTTGAGCCAA 


TTCACAAATA 


AAGTGGGTAA 


6480 


TATCCTCACG 


TCCTTGACCT 


TGCAAAATCT 


CCAATGCTTC 


AAGCATTTCC 


AGACGATTTC 


6540 


CAATCGCTCC 


TCCCAAAGGC 


TGGCTCATAT 


CCGTAATCAC 


TGCTACTGTC 


TTCCGTCCAA 


6600 


CAACCTTACC 


AAGATCTACC 


ATAGTTTGAG 


CCAACTCACG 


CGCCTCATCA 


ACCGTCTTCA 


6660 


TGAAGGCACC 


CTCACCGACA 


GTCACGTCTA 


GCAAAATAGC 


ATCCCCCCCT 


GCCGCAATTT 


6720 


TCTTGCTCAT 


CACCGAACTC 


GCAATCAAAC 


GAATCGTGTC 


GACACTTCCG 


GTCACATCAC 


6780 


GAAGGGCATA 


GAGAACCTTA 


TCTGCTTTGA 


CCACCTCGTC 


TGATTGCCCA 


ATGACAGATA 


6840 


CTCCAATATC 


CTGAACCTCA 


CGAATAAAAT 


CCTCTTGACT 


ACGTTCTACT 


TGATAGCCCT 


6900 



OMcrwin. ,u/n on i qoh t -s 
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TAATGGACTC 


CAATTTATCA 


ATTGTTCCGC 


CTGTATGGCC 


AAGACCACGA 


C CACTCATTT 


6960 


TTCCTACAGG 


CACACCGAAG 


CTAGCAACAA 


GAGGAGCTAA 


AATCAAGCTT 


ACCTTATCGC 


7020 


CGACACCACC 


AGTAGAATGC 


TTGTCAACTT 


TCACACCATC 


AATGGCTGAC 


AGGTCAAACT 


7080 


CTTGCCCAGT 


CTTAACCATA 


TTCATCGTTA 


AATCACAGAT 


TTCTCGAGTC 


GTCATTCCTT 


7140 


TAAAATAAAC 


AGCCATAGCA 


AAGGCAGACA 


TCTGATAATC 


AGGAACAGTT 


CCTGATACAT 


7200 


AGCCTTCTAT 


CACCCATTCA 


ATTTCACTTG 


AAGTCACTTC 


TTGACCCTCT 


CGTTTTTTTT 


7260 


CCATTAAATC 


AACTGCTCTC 


ATTCTTTCAC 


ACTTCTAAGG 


ATATAGTATC 


cerruTcrrr 


7320 


TTTAAGGATT 


TCACAATTGC 


CAAACACATC 


TTCCATCTTA 


GACTTGGCAC 


TTGGACCTCC 


7380 


TTGTTTTTTC 


TGGATGACGA 


TGGTCAAATC 


TCCACCAATT 


TCCAAGAAAT 


CTTTACTTTT 


7440 


CTCGATGATT 


TCATCAACGA 


CTTGCTTGCC 


CGCACGGATA 


GCAGGATTGG 


AAATGACATG 


7500 


GTCAAATCGC 


CCTTGAACTC 


TTGCATAAAT 


ATTAGATTGA 


AATATCGTCG 


CTTTTGCATT 


7560 


ATTTTTTTCA 


GCATTTCTCT 


GAGCTAAATC 


CAGGGCACGA 


GTCTTAATAT 


CAACCATGCT 


7620 


CGCCTGAACT 


CCGTAAACCT 


TGACCAAGCA 


CAAACCTAAT 


GGACCATAAC 


CACACCCTAC 


7680 


ATCTAGGACT 


GTCTCTCCTT 


GGTTGACATC 


CAGACACTTG 


AGCAAGAGTT 


GACTTCCAAA 


7740 


GTCAACCATT 


TTCTTGCTAA 


AAACACCCGC 


ATCTGTCAAA 


AAAGTCATTT 


TTTCTCCCAA 


7800 


CAAGTCCACT 


CTCAACTCAT 


GAATGTCGTG 


AGCAGCCTCA 


GGATTTTCTG 


CATAGTACAT 


7860 


TTTACTCATG 


ACACTATTTT 


ACCATAATTT 


GACTCAAATT 


GTAAATCCTT 


TACAAATTGA 


7920 


TAATAAAACG 


AAAAAGACCC 


AAGAAAGCAA 


GTCACCAAGC 


CATTTTCTTC 


AATCTCTTTC 


7980 


AACACTTATA 


AATAATAAAC 


CATTTAGAAC 


TATAAATATC 


ACAGTCCAGA 


TAAAAACAAA 


8040 


AAGTTTATCA 


TCTATAATCA 


GGCAGATTAT 


TATTTCTATT 


GCTTAACCTT 


AAAATACTTT 


8100 


ATTATCAACA 


AAATTCCTAA 


CAAAATGTTT 


AGATAAAACC 


CCAACTGATA 


CGTTTATGTC 


8160 


ACGATTTCCA 


AACTTGTCCA 


AAGTCGTATC 


AAATCTTCTA 


GTGACATGTG 


GAAGAAATAA 


8220 


CCCTCTCTCG 


CAATCCGTAG 


GACTAAAAAG 


CAATAACTAC 


CCGCAGCAAT 


CCATTTCGTC 


8280 


CATCGTTTTT 


TAGTAAGAAA 


GCAATTAAGA 


ACGAACAAAT 


AAAGACAGCT 


CTTACAATAG 


8340 


CATGTTCCAT 


CAAAAAAGTA 


AAACCGTAAT 


AGGTTTCCAC 


AAAGCATCTA 


CCATTATCTG 


8400 


CATTGGTTCC 


TTTTATAAAA 


GGTAAAGCAA 


AACTTAAAAT 


AAAACAGAGT 


TCCAATATCT 


8460 


AACGTTTTAA 


GATTTTCATA 


CTACACCTCC 


TATAACTTCT 


GAACTAAAAA 


GCCCCCTTTA 


8520 


TAAGCTTATA 


AATCAGTAGA 


ATCTATCTCC 


TATTTCATCA 


ATAAATTCAT 


CACTTATACT 


8580 


ATATACCATT 


GACTTACCAC 


ATTCAAGAAA 


CCGCTTTATT 


TTTTTAGCTT 


TTTATGGTAT 


8640 


GATAGACAAA 


ATATCTAGGG 


GAAAACAAAT 


GACCAACGAA 


TTTTTACATT 


TTGAAAAAAT 


8700 
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CAGCCGCCAG ACTTGGCAAT CTTTACATCG AAACACAACA CCTCCTTTGA CAGAAGAAGA 87 60 

ATTGGAATCT ATCAACAGTT TTAATGACCA AATCAGTCTC CAAGACGTTA CAGATATCTA 8820 

TCTCCCCTTG GCTCATTTCA TTCAGATTTA CAAGCGAACT AAGGAAGATT TAGCCTTTTC 8880 

AAAAGGAATT TTCCTCCA 8898 
(2) INFORMATION FOR SEQ ID NO: 70: 

<U SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

TATCTTAACG aGGATTGGGT TTATCGTCAG TCTTATTGCC CTAATTGTGG GAACAATCCC 60 

TTAAATCATT TTGAAAATAA TCGGCCTGTA GCAGATTTTT ACTGTAATCA TTGTAGTGAG 120 

GAGTTTGAAC TAAACAGCAA AAAAGGAAAT TTTTCATCAA CAATCAATGA TGGTGCTTAT 180 

GCAACGATGA TGAAGCGTGT GCAGGCAGAT AATAATCCTA ATTTCTTTTT TTTAACTTAC 24 0 

ACAAAAAATT TTGAGGTAAA TAACTTTCTT GTCCTTCCGA AGCAATTTCT TACACCCAAA 300 

TCGATTATTC AAAGAAAACC ACTTGCACCA ACTGCTAGAC GAGCAGCTTG GATTGGTTGT 360 

AACATTGATT TATCACAACT ACCTTCTAAA GGAAGGATAT TTCTTGTGCA AGATGGACAA 420 

GTTAGAGATC CAGAAAAAGT TACAAAAGAA TTTAAGCAAG CTTTATTTTT AAGGAAGACC 480 

TCTCTGTCAT CAAGAGGTTG GACAATACAA ATTCTAAATT GTATAGATAA GATAGAGGGT 540 

TCAGAATTTA CCCTTGAAGA TATGTATCGT TTTGAAAGTG ACCTAAAAAA TATCTTTGTT 600 

AAGAACAATC ATATCAAAGA AAAGATTAGG CAACAGCTTC AAATATTAAG AGACAAAGAA 660 

ATAATAGAAT TTAAAGGTAG AGGAAAGTAT CGGAAATTAT CAAAACCAAA CAACTTGT^C 720 

CATCAGAAGA GGTCTATGAT TTCTTAAAAG TCATCTGCCC TGATTATGAA ACTGAAAGCC 780 

GTTACGATAA CCTAAGTTTA ATCCTCTGTA CCTTATCAGA TCCCGATTGT GTGAGATGGT 840 

TATCTGAAAA TATGAAATTT GGTGACGAAA AACAACTAGC TTTGATGAAG GAAAAATATG 900 

GGTGGCAAGT AGGAGATAAA TTGCCAGAGT GGCTACATAG CTCCTATCAT AGATTATTGT 960 

TAATAGGTGA ATTATTGGAA AGCAATCTAA AACTGAAAAA CTATACAGTA GAAATTACAG 1020 

AAACTTTATC ACGTTTAGTA AGTATAGAGG CTGAAAATCC AGATGAAGCC GAACGACTTG 1080 

TAAGAGAAAA GTATAAGAGT TCTGAAATTG TTCTTGATGC AGATGATTTT CAGGACTATG 1140 
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ACACTAGCAT 


ATATGAATAG 


GTAGATGTTT 


TTATTTTGTC 


AACAAAAAAG 


AGGCTCGCAC 


1200 


CTCTTTTTCT 


TATTTCTTTT 


TATGATTTAA 


TACCGCATTG 


AGGACAATAG 


CGAGTAGGCT 


1260 


GGCTACCACG 


ATTCCCTTTG 


AGAAGAACAT 


TTGGAACGCT 


GTCGGCATGC 


TGACAAAGAG 


1320 


ATTACTGTTG 


TTGAGACCGA 


CACCTGCAGC 


GATTGAAACA 


CCTGCGATAA 


GGAAGTTGTG 


1380 


TTCATTGTTA 


GCAAAGTCAA 


CACGGGCGAG 


GATTTGCATC 


CCTTGAATTG 


ATACAAAACC 


1440 


AAACATTACC 


AGCATGGCAC 


CACCGAGGAC 


GGAGCTTGGA 


ATGATTTGGG 


CAAGGGCGCC 


1500 


AAACTTAGGA 


AGCAGTCCAA 


GGAGAACCAG 


GAAACCAGCT 


GCGTAGTAGA 


TTGGCAGGCG 


1560 


TTTTTTGATG 


CCTCACAATT 


TAACCAAACC 


AACGTTTTGT 


GAAAATCCGG 


TGTAAGGGAA 


1620 


GGTGTTAAAG 


ATTCCTCCGA 


GAAGTACGGC 


CAAACCTTCT 


GCGCGGTATC 


CGTTGCGAAC 


1680 


GCGCCTGCTG 


TCGATTGGAT 


CCTTTGTGAT 


ATCACACAAG 


GCCAGATAAA 


CACCAGTTGA 


1740 


CTCAACCATA 


GACACCGTTG 


CGATGATACA 


CATCATGACA 


ATAGATGAGA 


TTTCAAAGGT 


1800 


TCGCATCCCA 


AAGTAGAGTG 


GAGTTGGGAC 


ATGGACAAGT 


GGAGCTACCG 


CAACAGGAGA 


1860 


GAAGTCCACC 


AAGCCCATAG 


TAGCACCAAT 


GGCAGTTCCA 


ACAACCAGAC 


CAATCAAAAT 


1920 


AGAGATAGAC 


TTCATAAATC 


CTTTGCTAAA 


GATCTTCATC 


AAGAGGATAA 


TC AG AAC ACT 


1980 


AATAGCTCCA 


AGCAAGAGAC 


TTTCACCAGT 


TGGCTCTGGA 


ACGTTATTTC 


CCATATTTCC 


2040 


AATAGCGACA 


GGCATCAAGC 


TTAAACCAAT 


CGTGGT AAT A 


ACAGATCCTC 


TTACGATAGA 


2100 


TGGGAAGAGA 


TTGGCTACTT 


TTGACAAGAT 


GCCTGAAACA 


AGAACCACCT 


AAATCCCAGA 


2160 


TGCGATAACG 


GCACCAAACA TAGCGCCACT 


ACCATGGCTT 


TCCCCAATCA 


TAATCAAGGG 


2220 


AGCGACCGAC 


TGGAATGCAA 


CTCCAAGAAC 


GACTGGCAGT 


CCAATCCCAA 


AGTATTTGTT 


2280 


GAGTTGGAGT 


TGGAGGAAGG 


TTGCCACCCC 


ACACATGAAG 


ATATCTCTAC 


AAATCAGCTA 


2340 


GCTCAACTGC 


TCAGCTGAAT 


ACCCAAGGCC 


TCTCGCAATC 


ATGATGGGAA 


CCAGCATAGA 


2400 


TCCTGAGTAC 


ATGGCTAGTA 


AGTGCTGCAA 


CCCAAGAACG 


GCTGCTTGCG 


AGTGTTTTTC 


2460 


TTGAGTTTGC 


ATTAGAGATC 


TGCCTCCTTA 


AATACGACTT 


GACCATTTTC 


AAAACAATCC 


2520 


AAACCACCAA 


GTGATAGGAC 


AGGGTAGCCT 


GCTTTTTCAA 


GCAAATCACG 


ACCATCTTGG 


2580 


AAGGATTTCT 


CAATCACGAT 


ACCGATAGCT 


TGCACTGTGG 


CACCGCCCTG 


TTCGATGATT 


2640 


TGAATCAAGC 


CTTTAGCACC 


TTGGCCATTA 


GCAAGGAAAT 


CGTCGATAAT 


CAAAACCTTG 


2700 


TCCTCTGGTG 


ACACGAATTT TTCAGCGATA 


GAAACGGTGC 


TGGTCACCTG 


CTTGGT AAAC 


2760 


CAGTACACTT 


GAGCAGTTAA 


GATGCCTTCG 


TTCATGGTCA 


TCTTCTTAGC 


TTTTTTGGCG 


2820 


AAAATCATGG 


GAACGTTTAA 


GGCTTCAGCT 


GTAAAAACGG 


CTGGGGCAAT 


ACCCGACGCT 


2880 


TCAATGGTTA 


CG AC CTTGGT 


AATGCCAGTA 


GTACCAAATT 


TTTCCGCAAA 


AACCTTACCA 


2940 
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ATCTCTCCCA 


TCAAGCTAAA 


(ji WAAL I iw 


TfVlCTTAAAA 


AGGAATCTAC 


CTTGAGGATG 


3000 


TTATCACCCA 


AGATATGCCC 


ATU LiTGAWj 




CTAATAATTT 


CATAAGACCT 


3060 


CCTAAAGTCT 


AAAAGTTAAT 






TCTATAGTGA 


TCOCTTTTGC 


3120 


TAATACTATA 


TATTTGATAA 


AACTATTACG 


AuLl«AAljLV**% 


VJ 1 w * • & ft»#m 


ATATTTCCCG 


3180 


TTCTAGTGGT 


ATCATAGACA 


ATAATCTTGT 


TATTGTCTAT 




TTGAGAGTAA 


3240 


AATAGTTCCG 


GGAACTATTT 


TAGCCTAAGC 


ftm ft ft ft * K 

L 1 AUAAATVjA 


nAuAW. 1 AV»Vj 


1 %»#%#wwwft 


3300 


TTACGCATCA 


AATTCCCTGG 


ATTCCTGAAA 


TTATTCAtAG 


r* ft f» ft. ATTTPl 


LV> 1 LULU & Lw 


3360 


CCACTAATTA 


AGGGAAATAT 


TAAAAAAAGA 


CCTAL1 lAAT 






3420 


TAGACATGGC 


AAAAACGGCC 


ATATCTCACT 


GCTGACTTAC 


TTATTGTTAG 




3480 


ACCTTGTAGA 


AACGTCGTGC 


CAATTCACGA 


CATAAACAAG 


TAAAACUATA 


TTCAATTTTA 




AATAGGCTTG 


ACCCAATGTT 


TTTATTTTAC 


ACTAAATAAC 


TTTAGAAATC 


ft & f"T» ft T^TT^IV? 

AACTATTTTvi 


i Ann 


TTAGTGTTTT 


GGTTTAAAAA 


ACGAACAAAA 


AGAAGAGAGG 


GTGAACAAAA 


ACTCCATTGT 


J DOU 


AAGCTAACAG 


TTATACTAAA 


TGAAAATCAA 


AGAGCAAACT 


AGCAAGCTAT 


f /*ftf ft ^ 
CCAWAAWWIV 


J / * u 


AAAACACTGT 


TTTGAGGTTG 


TGGATAGAAT 


TGACAGAGCC 


ft /»^» ft ft ^ ft *P 


ALL. 1 /\V_ L*\J 1 A 


3780 


AGGCGACCTT 


GACGTGCCTT 


GAACACATTT 


TCGAAGAGTA 


ft ft ft ^ ft ^P^P 

TTAGAAGAl i 


I I ILLAlLAi 


3840 


AAAAGGCATA 


CTATCAAGCT 


TTTAGACACC 


TGACAATATG 


CCTTTTTCTA 


ftf Hl»l»|l ft ft » /*" ft 

ALTilAAAuA 


ion/) 


CTTTTCCCAA 


TTTTTATTAT 


TCTACTCGCT 


AAATCTTAAA 


AAAiAULLAl 


L 1 1\*LAA 


3960 


AACTCCAAAT 


TTATGAGGAT 


AGATATAGGG 


ft *w*r* ft r*wT*^* k ^ ft 

ATCACTGACA 


r+f ft » ft 

LuAAAL 1 1 tv. 


TTTTGfTTCAA 


4020 


GGGACGATAA 


ATAGGATAGT 


TTGCCTTCAT 


CACTCTTTAA 


TAGAGTTTTG 


ft ft & riTrriT 

nAALA 1 LL A * 


4080 


TATGCCAAAG 


GAGACATTGA 


CTCCACGACC 


AAAGCGATAw 


1 V> Aw i 1 w/vo 


CT AGTTG ATC 


4140 


CTTTCTTCCC 


TCCTCTAACA 


TTAGTTGACA 


LTLiitAnW* 






4200 


GGACGTTGGT 


ATTCAATCCT 


AAAACCCAGT 


AAACCACACT 


AGAAGGACCC 


GGACTGTTCG 


4260 


ATATTCGATA 


CAAGCAACTC 


GGGAATGACC 


GCATTGTAGT 


CCATATAGAA 


AATCCTTACA 


4320 


AGTCAATTTC 


CAAGACAATC 


GGTGTATCCT 


CTTGGCGAGC 


ACCTGAGTCA 


ATCATATCAG 


4380 


ATTTAGTGAC 


CTTCTCAGCG 


ATACGGTTAC 


TTGTGAGCCA 


GTAGTCGATT 


CTCCAGCCTG 


4440 


TATTGTTGAT 


TTTAGAAGTT 


TTGCTGCGTT 


GTGCCCACCA 


AGTGTAGCGT 


TCAGGAACAT 


4500 


CGCCATGAAC 


ATGGCGGAAG 


GTGTCTGTAA 


ATCCAGTTGC 


CAAAAGGTTG 


GTAAATCCAG 


4560 


CACGTTCCTC 


GTCACTAAAT 


CCAGGTGAAC 


GGCGGTTGCT 


AGCAGGATTT 


CCAACGTCGA 


4620 


TTTCATTGTG 


GGCTACGTTG 


TAGTCACCGG 


TCGCAAGGAC 


TGGTTTTTCT 


TTGTCTAGTT 


4680 
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CAGCCAAATA CTCAGCATAT TTGGCATCCC AGACTTGGCG TTCTTCCAAG CGTTTGAGAC 
CGTCACCAGC GTTTGGAGTG TAAACTTGGG TTACGAAAAA TCCATCAAAT TCTAGAGTGA 
TGATACGACC TTCCAAGTCC ATGCTAGAAG GGGCACCGAT TTCTGGGAAG CTGATAGTAG 
CTCTAAGTTC TTTCTTATAA AGGAACATGG TTCCAGCATA GCCTTTACCG GCAGGCTCTT 
GGGAACAGCG CCACGTGTTT TCCTAGCCTC GGAAGAGTTC TTCTAAAATT TCCACGTGTT 
TCTTTCTAGG TCCTTTGGCA GAAAGCTTCG TTTCTTCCAT AGCAATGATA TCAGCATTTT 
CAGCGACCAA CCTTTCTAGG ACTTCTTGGG ACAATTTGGC ACGAGCTGAG TCACTAGTTA 
GGGCAGCGTT TAGGGAATCA ATATTCCATG AGATAACTTT CATAAACTTA CCTTTTTCAT 
TCAGATTATA GATTTTATTA TACCAAAAAA ACATCTATTT CCCCAACGTA TGGTTTGAAA 
AATTACTCTC TTTCGTTTAT AATTAAGAAT GATTTTATGA AAGGGAGTGA AAATACATGA 
AATTCTACTC TTATGACTAT CTACTCAGCC AAATCGGTCA GCAAAATGGT ATCATGGTTG 
GCTTTGGGAT TGTTCTATTA GCTGTGACAG TTTTTTTTGC TTTCAACGCA TACCATAATA 
AAAAGGCAAG CGAATTTCGT GAGTTGGTCA TGATTTCAGA TCTGGCCTTA TTTACCTCTG 
CTTTTGGTCA GCATCACGAC TTATCAAAAC AATCAAGTTT CTAACAATAA ATTTCAAACT 
TCACTTCATT TCATCGAGGT TGTTTCCAAA CATTTCTGAC TAGACAAGTC AGAAGTCTAT 
GTTAATACTT CCACAAACAC AGATGGCGCA CTTATCAAGC TGGGAGATCG CTATTATCGT 
GCCCTAAATG GAAGTCAGCC AGACAACTAC CTCTTACAGA AAGTCCAATT GTATAAGACA 
GACGCAATTC AACTGGTCCA TCTGAACAAA TGACACTTAA TTATATCGAA ATTTTAATCA 
AACTGGTCTT GACTCTCAAA TAGCTCAACA ACAATGTTCA CTTTGTGAAA CCTTTCATTG 
ATGGTAAGCC AACTCTCCTT ATCAAAAATG GGAATATTGA CCCAGAAGCC TGTCGTTCAG 
TTGGTTTGTC TCCATCGGAT CTATCCCTCA AACTTCGTAG CCAAGGCAT7 TTCCAGATGA 
AGCAAGTCAA ACGACCTCTC CAAGAGCAAA ATGGGCAACT CATCGTTCTG CAAATGGGAG 
ATGAAAATCC TAAGTATCCA GTTGTGACTC ACGGTGTGAT TCAACTACAT GTCTTGCAAT 
CGATTGGTCG TAGCGAAGAC TGGTTGCTTG ATAACCTCAG TAAACAAGGG CATGACAATG 
TAGCCAATAT CTTTATTGCT CAATATGACA AGGGTCCTGT TACAGTCCTA ACTTATGAAT 
AAGAAAAACC TCGGGTCTTG TACTCTTCGA AAATCTCTTC AAACCGCCTC AACGTCGCCT 
TGCCCTATCT AGGTTACTGA CTTCGTCAGT TCTATCTACA ACCTCAAAGC AGTGCTTTGA 
GCAGCCTGCG GCTACTTTCC TAGTTTGCTC TTTGATTTTC ATTGAGTATT GGCCTCAGGT 
TTCCATTTGC AATCAGAAAC GGATTTTATC TCCATTATTC AAAAACTTTG GTGGTTTTTC 
AAGTTAGAAA AACGCCGTTA TCTAGTCGGA ATTGTGGCCC TGATCTTGGT TTCCGTCCTC 



4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 
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6000 

6060 

6120 

6180 
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6300 
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AATCTCATTC CTCCTATGGT TATCGGGCGG GTCATTGATG CCATCACATC GGGGCAATTA 6S40 

ACCCAGCACC ACCTCCTTCT TAGCCTATTT TACTTGCTAC TTGCAGCCTT TCGTATCTAC 6600 

TATTTGCGCT ATGTGTGGCG TATCTATATC CTTGGGACCT CTTATTGCTT GGGACAGATC 6660 

ATGCGCTCTC GCTT G TTTAA GCATTTCACA AAAATGTCGT CAGCCTTTTA TCAAACCTAT 6720 

CGGACGCGTG ATCTGATGGC ACACGCAACC AATGATATCA ATGCCTTGAC TCGTTTACCA 6780 

GGTGGCGCTG TCATCTCTGC GGTGGATGCC TCTATCACGG CTCTGGTGAC TTT G TTGACC 6840 

ATGCTCTTTA GCATCTCATG GCAGATGACT CTT G TTGCCA TTCTCCCCCT ACCTTTCATG 6900 

CCCTATACGA CTAGTCGCCT AGGGAGAAAC ACTCATAAGG CCTTTCGCGA ATCCCAACCT 6960 

GCTTTTTCTG AACTCAATAA CAAGGTACAG GAGTCCGTAT CAGGTATCAA AGTGACCAAG 7020 

TCTTTCGGTT ATCAGGCAGA CGAGTTGAAG TCTTTTCAGG CACTCAATGA ATTAACCTTC 7080 

CAAAACAACC TGCAAACCAT GAAATATGAT AGTCTCTTTG ACCCTATGGT TCTCTTGTTT 7140 

CTTGGTTCCT CCTATGTTTT AACGCTTTTG GTTGGCTCCT TGATGCTTCA GGAAGGGCAG 7200 

ATTACAGTTG GGAATCTAGT CACCTTTATC AGCTATTTGG ATATCCTGGT CTGCCCTCTT 7260 

CTGGCCATCG GTTTCCTCTT TAATACTACT CAGCGAGGGA AGGTTTCTTA CCAGCGGATT 7 320 

GAAAATCTTT TGT CTCAGG A ATCTCCTGTA CAAGACCCTC ACTTTCCTCT CGATGGTATT 7 380 

CAAAATGGGC CTTTGCACTA TGCCATTGAC AGCTTTCCTT TTGAAAATCA GGAAACACTG 7440 

ACCGATATTC ACTTTACTTT GGCAAAAGGG CAAACACTGG GCTTGGTTGG GCAGACAGGC 7500 

TCTGCCAAAA CCTCCTTAAT CAAGCTCCTC TTGCCTCAAT ACGATGTGCA TAAGGGTCCC 7560 

ATTTATCTAA ACGGTCACGA TATTCGGGAC TATCCTCTGA CAGACCTTCG CAGTCTCATG 7620 

CGCTATGTTC CTCAGGACCA GTTTCTTTTT GCGACTTCAA TCCTACACAA TATCCCCTTT 7680 

GGCAATCCTA ACTTGCCCCT TTCAGCCGTC GAGGAAGCTA CTAAGCTAGC CCGGGTTTAC 7740 

CAAGATATTG TAGACATGCC TCAAGGATTT GATACGCTGA TTGGTGAAAA AGGAGTCACT 7800 

CTTTCTGGTG GTCAAAAGCA ACCGTTGGCT ATGAGTCGCG CTATGATTTT ACACCCTGAT 7860 

ATCTTGATTT TGGATGATTC CTTATCCGCC GTAGATGCCA AGACAGAGTA TGCGATTATC 7920 

G AC AACCTC A AGGAGATGCG AAAGGACAAG ACAACCATTA TCACTGCCCA TCGCCTCAGT 7980 

GCTGTTGTCC ATGCAGATTT TATTTTAGTT CTACAAAATG GTCAAATTAT CGAACGAGGC 8040 

ACGCACGAAG ACTTGCT AG C TTTGGATGGC TGGTATGCCC AAACCTACCA CTCTCAGCAC 8100 

TTGGAAATGA AAGGAGAAGA AGATGCAGAA TAAACAAGAA CAATGGACTG TATTGAAGCG 8160 

CTTGATCTCT TATCTCAAGC CTTATGGACT CCTGACCTTT TTCGCACTCA GTTTTCTCCT 8220 
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AGCGACGACG GTCATTAAAA GTCTCATACC CCTCCTGGCT TCCCACTTTA TCGACCAGTA 8280 

TCTCAGCAAT CTTAACCAAC TAGCCGTTAC CGTTTTGCTG GTCTACTATG CTCTCTACAT 8340 

CCTACAAACT GTAGTTCAGT ATGTCCGCAA TlT l' l TCm GCGCGCGTGT CTTACAGTAT 8400 

TGTTAGGGAT ATTCGTCGGG ATGCCTTTGC CAATATGGAG AAACTGGGCA TGTCTTACTT 8460 

TGACAAGACG CCAGCACGTT CTATCGTTTC TCGTTTGACC AACGATACCG AGACGATTAG 8520 

TGATATGTTT TCTGGGATTT TATCCAGCTT TATCTCAGCA GTTTTTATCT TTCTGACAAC 8580 

CCTTTATACC ATGTTCCTGC TGGATTTTCG TTTGACGGCT TTAGTCTTGC TCTTTCTTCC 8640 

TTTGATTTTC CTTTTGGTCA ATCTCTATCC AAAAAACTCA CTGAAAATCA TCGAGAAAAC 8700 

CAGAACTCTC TTCTCAGATA TCAATAGTAA GCTGCCAGAG AATATCCAGG GAATCAGGAT 8760 

TATTCAGGCC TTTAATCAAG AGAAGCCCCT GCAGCCACAA TTTGATCAAA TCAACCAAGA 8620 

ACACTTGGTC TACGCCAACC GTTCTGTAGC CTTGGATGCC CTCTTTTTGA GACCTGCCAT 8880 

GAGTTTCCTG AAACTTCTAG GCTATGCAGT CTTGATGGCC TACTTTGGCT ACCGTGGTTT 8940 

TTCTATCGGG ATAACGGTCG GGACCATGTA TGCCTTTATC CACTACATCA ACCCC C TTTT 9000 

TGACCCCTTG ATTGAGGTGA CCCAAAACTT TTCAACTCTG CAAACGGCTA TGGTTTCTGC 9060 

AGGTCCTGTC TTTGCCCTGA TAGACGAGAG GACCTATGAA CCTCTTCAAG AAAATGGGCA 9120 

AGCCAAACTC CAAGAAGGCA ATATCCGTTT TGAACATCTC TGTTTCTCAT ATCACCCTAA 9180 

ACATCCCATT CTGGATGACA TTTCTTTCTC TGTTAATAAG GCTGAAACCA TTGCCTTTGT 9240 

AGCTCATACA GGTTCAGGGA AATCGTCTAT TATCAATGTC CTCATCCCCT TTTATGAATT 9300 

CCAGTCAGGG AGAGTTCTCT TGGATGATCT GGATATCAGG GATTTCAGTC AAGAAGAGCT 9360 

CAGAAAAAAC ATCGGTTTGG TCTTGCAGGA ACCCTTCCTC TATCATGGAA CTATTAAGTC 9420 

CAATATCCCC ATGTACCAAG AAACCAGTCA TGAGCACGTT CAGCCTGCGG CAGCCTTTGT 94 80 

GGATGCAGAT TCCTTTATTC AAGAACTTCC TCAGGGGTAC GACTCCCCTG TTTCCGAGCG 9540 

TGGTTCGAGC TTCTCTACTG GCCAACGCCA GCTTCTTGCC TTTGCTAGAA CAGTCGCCAG 9600 

CCAGCCTAAA ATCCTGATTT TGGATGAAGC GACAGCCAAT ATTGACTCTG AAACACAAAC 9660 

CTTGGTTCAA GCTTCTCTGG CGAAGATGAG ACAGGGCCGA ACAACTATTC CTATCGCTCA 9720 

CCGCCTTTCT ACTATTCAAG ATGCCAACTG CATCTATCTC TTGGATAAGG GACGCATTAT 9780 

CGAGAGTGGA ACCCATGAGG AACTCTTGGC TCTCCGAGCA ACCTATCACA AGATGTATAG 9840 

TTTGCAGGCA GGGGCCATGG CCGATACTCT TTGAAAATCT CTTTAAACCA TGTCAGCTTT 9900 

ATCTGCAATC TCAAAGCTGT ACTTTGATTT TCATTGAGTA CTAGAAGGAA ATCCTTCAAA 9960 

TTACAGATTT CTTTCACCCC CTTTTCCATT TTGTCGTATA ATGAAAAATC TTGACAAATA 10020 
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GTATAATAAA 


AACAAAGGAG 


AACAGCATCC 


TGAAATGGGA 


AGACTTGCCT GTGGAAATGA 


1 AAflA 

lOOSQ 


AATCAACCGA 


GGTTCAGTCT 


TACTACCACC 


TTGTCTCTAA 


AAGGAAGGGT TCGCTGATTT 


1U14U 


TCAAGCGTTG 


CTTGGACTGG 


CTTTTCGCCT 


TGGTCTTACT 


GGTTCTGACC 


TCTCCCATCT 


10200 


TTCTCATCTT 


GAGCATTTCG 


ATCAAGTTGG 


ATAGCAAAGG 


GCCAGTGATT 


TACAAGCAAG 


1 ft ^ f A 

10260 


AGCCTGTGAC 


CCAGTACAAC 


CGTCGGTTCA 


ACATTTCGAA 


GTTTCGTACC 


ATGGTGACGG 


10320 


ATGCGGATAA 


AAAAGCAACT 


CTGGTGACTT 


CTGCTAACGA 


TAGCCGCATT 


ACCAAGGTTG 


10380 


GAAATTTCAT 


CCGACGTGTC 


CGTTTGCACG 


AACTGCCTCA 


GTTGGTCAAT GTCCTTAAAG 


10440 


GTGAGATGTC 


CTTTCTCGGT 


ACACGACCTG 


AAGTGCCACG 


TTATACAGAG 


CAGTATAGCC 


10500 


CTGAAATGAT 


GGCAACCTTG 


CTCTTGCAAG 


CAGGGATTAC 


CTCTCCAGCC 


ACCATCAACT 


10560 


ACAAGGATGA 


GGACACAATT 


ATCAGTCAAA 


TGACGGAGAA 


AGCTCTGTCA GTTGATCAGG 


10620 


CCTATGTGGA 


GCATGTTCTT 


CCTGAAAAGA 


TGCCCTATAA 


CCTCGCCTAT 


CTCCGAGAGT 


10680 


TTACTTTCTT 


TGGGGACATC 


AAAATCATCT 


TTCAAACCGT 


GTTTGAGGTA 


CTAAAATAAA 


10740 


GTACTCATAA 


CAAAATGAGT 


ACAGATAAAA 


GGAGCAAATC 


AATCCCAAAT 


TACAATATTC 


10800 


CATTTTCACC 


GCCTGATATC 


ACACAAGCAG 


AAATTACTCA 


AGTACTGGAT 


ACCCTGCGTT 


10860 


CTG GTTGG AT 


CACAACAGGT 


CCTAAAACAA 


AAGAACTGGA 


GCGCCGCTTG 


TCTCTTTACA 


10920 


CACAGACACC 


TAAGACTGTT 


TCTCTCAACT 


CTGCGACAGC 


CGCTCTGGAG 


TTCATTTTAC 


10980 


CCGTTTTGGA 


AGTGGGACCT 


GGTGATGAAG 


TCATCGTTCC 


AGCCATGACC 


TATACGGCTT 


11040 


CATGTACTGT 


CATTACGCAC 


GTCCCAGCAA 


CCCCTGTCAT 


GGTGGATATC 


CAAGCAGATA 


11100 


CGTTTGAGAT 


GCACTATGAC 


CTGCTTGAGC 


AACCTATCAC 


TGAGAAAACT 


AAGGTGATTA 


11160 


TTCCAGTAGA 


GCTCGCAGGG 


ATTGTTTGCG 


ATTATGACCC 


TTTGTTCCAA 


GTCGTGGAGA 


11220 


AAAAACGTGA 


CTTCTTTACC 


GCTTCAAGCA 


AGTGGCAAAA 


GGCCTTTAAC 


CGTATTGTCA 


11280 


TTGTCTCTGA 


TAGTGCCCAC 


GCTTTGGGAT 


CTATTTATAA 


AGGACAACCT 


TCTGGTTCTA 


11340 


TCGCTGACTT 


TACTTCCTTC 


TCATTCCATG 


CAGTTAAGAA 


CTTTACAACG 


GCAGAAGGTG 


11400 


GAAGTGCCAC 


TTGGAAAGCC 


AATCCAGTGA 


TTGATGACGA 


AGAGATGTAC 


AAGGAATTCC 


11460 


AAATCCTTTC 


CCTTCACGGG 


CAAACTAAGG 


ATGCTCTTGC 


CAAGATGCAA 


CTGGGGTCAT 


11520 


GGGAATACGA 


TATCGTTACA 


CCAGCCTATA 


AGTGCAACAT 


GACCGATATC 


ATGGCTTCAC 


11580 


TTGGTTTGGT 


ACAATTGGAC 


CGCTATCCAA 


GTTTGTTGCA 


ACGCCGTAAG 


CACATTGTGG 


11640 


ACCGCTATCA 


TAGTGGTTTT 


G C AGGTTCTC 


GCATCCATCC 


TTTCGCACAC 


AAGACTGAAA 


11700 


CTGTCGAATC 


TTCACGCCAC 


CTCTACATCA 


CCCGTGTAGA 


AGGAGCAAGC 


CTAGAAGAAC 


11760 
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GCAACCTCAT 


CATCCAAGAA 


TTGGCTAAAG 


CAGGAATTCC 


AAGTAATGTT 


CACTACAAAC 


11820 


CCCTTCCTCT 


CTTGACAGCC 


TATAAGAATC 


TTGGATTTGA 


TATGACGAAC 


TATCCTAAGG 


11880 


CCTATGCCTT 


CTTTGAGAAT 


GAAATTACCC 


TCCCTCTTCA 


TACTAAATTA 


AGCGATGAAG 


11940 


AAGTAGACTA 


TATCATTGAG 


ACTTTCAAAA 


CAGTTTCTGA 


AAAAGTCCTA 


ACTTTATCAA 


12000 


AAAAATCACA 


AACTACAGTC 


AAGCGAAAGT 


GATCCTGCCC 


L'TAAAAAGTC 


TAATTGAGTG 


12060 


TAAAAACTGT 


TGTTTTCAAT 


TGATAATAGT 


TTACACCTGT 


AGTTGAGGCC 


CCTTTCTCCT 


12120 


CAGAGAGAGA 


ATTTTTATAG 


GATTTTCCTT 


TCTTCTGGGA 


CTCCCGTGGT 


TTGAAATAAG 


12180 


ATGTGAGCAA 


TTTAGTGTAG 


CATTTAGAAT 


CCTTACTAGA 


CATCATTTAG 


AAAATCTAGT 


12240 


CTCTTGTTCT 


AGTTTTCAAT 


TCACCCTATT 


TTTTGAAAGA 


CGTGAGTTTC 


CATGAGTCAG 


12300 


ATTGTGGAAA 


CTCGCGTCTT 


TTTTTGTTTT 


CACAATATTG 


TTCAAAATTT 


TGTGCCTGTC 


12360 


TTTCATGTTC 


TAGTCATTCT 


TTTGCATGAT 


AGAATTTATA 


CCATGTTGAT 


ATTATAATAA 


12420 


TACAAATATT 


CTATATGTTT 


AGTGATGCTT 


GCTATACATT 


ATTAGATCTC 


CTGCGAGACA 


12480 


ATCTATAAAA 


CACTTCTCTA 


CGATTACCTA 


TATGCCCTAT 


TCCAGTATTT 


TAGAACCACT 


12540 


GCATCTATTT 


TTATCGAGGT 


TAAATCTAGC 


TTTTATAGAA 


GGTCTATTTA 


AGAAATATAT 


12600 


TGTAGTCTTT 


TAGTTTCAAT 


CCGCCATATC 


AGCGATATTC 


AGCTAAATAT 


CCCTGGCGAA 


12660 


TGCTTCTATG 


ACAAGGTATT 


TGTTCTTTCA 


TTTATAATTT 


ACAACATATC 


AACAAATTTA 


12720 


AATATACTAA 


ATGGGATATT 


TTATATTCAA 


GCTAAGAAAG 


ATAGCATCAC 


TTTTGAATGG 


12780 


AAGGCTAAAG 


AGCAAACTAG 


GAAGTTGGCC 


ATAGATAGCT 


CAAAACCCTG 


CTTTGAGGTT 


12840 


GTAGATATAG 


TAAAATGAAA 


TGAGAATAGG 


ACAAATTGAT 


CGGGACAGTC 


AAATCGATTT 


12900 


CTAACAATGT 


TTTAGAAGTA 


GAGGTGTACT 


ATTTTAGTTT 


CAGTCTACTA 


TACAACTGAC 


12960 


CAAGTCAGTA 


ACCTAGACTT 


AGGGCAAGGC 


GGCACTGACC 


TAGTTTGAAG 


AGATTTCCGA 


13020 


AGAGTATAAA 


TTTTAATATT 


TTCTTGTGTT 


ATTCCTTGAC 


AATTCAATTT 


GCAAAATATA 


13080 


TCATAAAGAT 


AATGACAGCG 


GTGTCATTCT 


ATCTATTTTA 


AGAAAAGTAA 


TAATCAATTG 


13140 


TTAAAAATAG 


TAAAAAAATT 


GGAGGTTCTG 


ATGAAATATT 


TTGTTCCC 




13188 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
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Trirrrrnir. 


C ft AH CACCAG 


TGCGTCGGCC 


TCAGCAAGCA 


CCAGCGCGTC 


60 




•iv* & ft f* c a /^tvs 




AGCAAGTACC 


TCAGCATCTG 


AATCAGCATC 


120 




TCGncrrcAf 

1 UuuV. 1 1 v. AO 


CAAGCACAAG 


TGCTTCAGCC 


TCAGCAAGTA 


TCTCAGCGTC 


180 




*r^*ft trrtrTr 


rrrrrfV*TTr 

C w 1 1 lv 


AGCAAGTACT 


AGCGCCTCAG 


CATCAGCGTC 


240 


AACAAGTGCT 


TCGGtTTCAb 


f^ff* AftfGAG 

1 \«.AAI«.*jMVj 


TGCGTCTCAC 


TCAGCATCAA 


CGAGTACGTC 


300 


AGCCTCAGCA 


AGCACATCAG 


CTTCTGAATC 


iwv- ATC AACC 


AGTGCGTCAG 


CCTC AG CATC 


360 


GACAAGCGCC 


TCAGCTTCAG 


CAAGTACCAb 


f rvf**TV*Artf c* 

I 1*CVj i V- 




CCAGTGCTTC 


420 


AG CCTC ACCG 


TCGACAAGTG 


CGTCGGCCTC 


ft AfV*A#TGC A 


TCTGAATCGG 


CATCAACCAG 


480 


TGCGTCAGCC 


TCAGCAACTA 


CTAGCGCCTC 


AGCCTCAGCA 


1 LAALuMVJ 1 V* 


CGTCCCCTTC 


540 


AGCAAGTACT 


AGTGCATCAG 


CATCAGCATC 


AACUAUTvfLA 


1 CuuC I 1 l^/tu 


CAAGTACCAG 


600 


CGCCTCAGCT 


TCAGCAAGCA 


CCAGTGCGTC 


AGCC I C AuL A 


AGTACCAGCG 


CCTCAGCCTC 


660 


AGCAAGCACC 


AGTGCCTCAG 


CTTCAGCAAG 


TACCAGTGCG 


•Pf* ft /" r* r*^r* A 
TCAGCC 1 LAO 


l^ftTV^G A C A A G 


720 


TGCGTCGGCT 


TCAGCAAGTA 


CCTCAGCGTC 


*r*/^ft sTfirri 
TGAA I V.AULA 


Tf* ft ft.rr'ft.r'rTfi 


CATCAGCTTC 


780 


AGCATCAACA 


AGTGCTTCAG 


CTTCAGCAAG 


*r ft*pfTf* ft ffVT 




CATCAACGAG 


840 


TGCGTCCGCT 


TCAGCAACTA 


CTAGCGCCTC 


AGC A 1 l_Al>l-u 


Tf* ft ft r~ ft ftnTn 




900 


AGCGTCAACG 


AGTGCGTCTG 


AGTCAGCATC 


ft ftrv* ftr**p ft rv: 

AAv. bAb X MLV* 




CAACCACATC 


960 


ACc i~i C^tSAA 


TCTGCATCAA 


LUAb l uLu ll_ 


AVjViL i WnLitn 


TCGACAAGCG 


CCTCAGCTTC 


1020 


AGCAAGTACC 


AGTGCGTCAG 


CCTCAGCAAG 


4 AC CAO 1 4 


TCAGCCTCAC 


CGTCGACAAG 


1080 


TGCGTCGGCC 


TCAACcAGTG 


CATCTCAATC 


GGCATCAACC 


AGTCCGTCAa 


CCTCAGCAAG 


1140 






CATCAACGAG 


TGCGTCCGCT 


TCAGCAAGTA 


CTAGTGCATC 


1200 


& ft *^/* ft Cf* & 




CATCGGCTTC 


AGCAAGTACC 


AGCGCCTCAG 


CTTCAGCAAG 


1260 


CACCAGTGCG 


TCAGnCTCAC 


CAAGTACCAG 


CCCCTCACCC 


TCAGCAAGCA 


CCAGTGCCTC 


X 320 


AGCTTCAGCA 


AGTACCAGTG 


CGTCAgCCTC 


AGCGTCGACA 


AGTGCGTCGG 


CTTCAGCAAG 


1380 


TACCTCAGCG 


TCTGAATCAG 


CATCAACGAG 


TGCATCAGCT 


TCAGCATCAA 


CAACTGCTTC 


1440 


AGCTTCAGCA 


AGTACCAGTG 


CGTCGGCTTC 


AG CATC AACG 


AGTGCTTCAG 


TCTCAGCGTC 


1S00 


AACCAGTGCC 


TCTGAATCAG 


CATCAACAAG TGCCTCGGCT 


TCAGCAAGCA 


CCAGTGCGTC 


1560 


GGCTTCAGCA 


AGTACTACTC 


CATCGGCTTC 


AGCATCGACA 


AGTGCGTCTG 


AATCGGCATC 


1620 


AACGAGTGCT 


TCGGCTTCAG 


CATCAACGAG 


TGCGTCAGCC 


TCAGCAAGCA 


CATCAGCTTC 


1680 


TGAATCTGCA 


TCAACCAGTG 


CGTCCCCTTC 


AGCGTCAACC 


AGTGCGTCGG 


CTTCAGCGTC 


1740 



f 

/ - 

V-' 




RM^norm- ^wn am Ram a? i •> 
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GACAAGTGCT TCGCCTTCAC CATCAACGAG TGCGTCGGCC TCAGCAAGCG CAACTACCTC 
ACCGTCAGCC TCCGCCTCAA CCAGTGCGTC GGCTTCAGCA AGCACAAGTG CGTCACCCTC 
ACCAAGTATC TCAGCCTCTG AATCCGCATC AACGAGTGCG TCTGACTCAG CATCAACGAG 
TACGTCAGCC TCACCAACCA CATCAGCTTC TGAATCTGCA TCAACCAGTG CGTCACCCTC 
AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCT TCAGCCTCAC CGTCGACAAG 
TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTCCGTCAG CCTCAGCAAG 
TACTAGTGCA TCAGCTTCAG CATCAACGAG TCCATCGGCT TCAGCATCAA CCAGTGCCTC 
GGCTTCAGCG TCAACCAGTG CCTCAGCTTC AGCAAGTACC ACTGCTTCAC TCTCAGCATC 
AACAAGTGCT TCACCCTCAG CATCGACAAG TGCCTCGGCT TCAGCAAGCA CATCAGCATC 
TGAATCAGCG TCAACCAGTG CTTCGGCTTC AGCAAGTACC AGTGCTTCAG CTTCAGCATC 
AACCAGCGCC TCGCCCTCAC CAACCACCTC AGCTTCTGAA TCCGCCTCAA CCAGCGCCTC 
GGCCTCAGCA ACCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCAG CCTCAGCATC 
AACGACTGCT TCGGCTTCAG CAAGCACAAG CCCCTCGGGT TCAGCATCAA CCACTACGTC 
AGCTTCAGCG TCAACCAGTG CTTCAGCCTC ACCATCAACA AGTCCGTCAG CCTCAGCAAG 
TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCTGAC TCAGCATCAA CGACTACCTC 
AGCCTCAGCA ACCACCTCAG CTTCTGAATC GGCCTCAACC AGTCCGTCAG CCTCACCATC 
GACAAGCGCC TCAGCTTCAG CAAGTACCAC TGCTTCACCC TCAGCGTCGA CAACTCCGTC 
GGCCTCAACC AGTGCATCTC AATCCGCATC AACCACTCCC TCAGCCTCAC CAAGT ACT AC 
TCCATCGGCT TCAGCATCAA CCACTCCCTC CCCTTCAGCC TCAACCAGTG CCTCACCTTC 
AGCAAGTACC AGTGCTTCAG TCTCAGCATC AACAAGTGCT TCAGCCTCAC CATCGACAAG 
TGCCTCGGCT TCACCAACCA CATCAGCATC TGAATCAGCG TCCACAACCG CCTCACCTTC 
AGCAAGTACC AGTCCGTCAG CCTCAGCGTC GACAAGTGCC TCAGCCTCAC CAAGT ACT AG 
TCCATCACCT TCAGCATCAA CGACTGCATC GGCTTCGGCG TCAACCAGTG CATCAGAGTC 
AGCAAGTACC ACTGCGTCAg CTTCCGCATC AACAACTCCC TCGGCTTCAG CAAGCACCAG 
TGCCTCGGCT TCAGCAACTA CTAGCCCCTC AGCCTCACCC TCAACCAGTG CGTCACCCTC 
AGCAACTATC TCAGCCTCTG AATCCGCATC AACGAGTGCG TCGGCTTCAG CAAGTACTAG 
CGCCTCAGCC TCAGCCTCAA CAAGTGCATC GGCTTCAGCG TCAACCAGTG CGTCTGAATC 
GGCATCAACC AGTGCCTCCG CTTCAGCAAG TACTAGCCCC TCAGCCTCAC CGTCAACAAG 
TCCATCGGCT TCAGCATCAA CCAGTGCGTC CGCTTCACCA AGTACTACCG CCTCAGCCTC 
ACCGTCAACA AGTGCATCGG CTTCAGCCTC AACGAGTGCG TCTGAGTCAC CATCAACGAG 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2S80 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 
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TCCGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCAGTG CGTCAGCCTC 
AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CCTCGACAAG 
TGCGTCGCCT TCAGCAAGTA CCACTGCGTC AGCCTCAGCA AGTACCAGTG CGTCAGCCTC 
AGCGTCGACA AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCCGCATCAA CCACTGCGTC 
AGCCTCAGCA AGTACTAGTG CATCAGCTTC AGCATCAACG AGTGCATCGG CTTCAGCATC 
AACCACTGCA TCAGAGTCAG CAACTACCAG TGCGTCAGCT TCCGCATCAA CAAGTGCCTC 
GGCTTCAGCA AGTACTAGCG CGTCAGCCTC AGCGTCAACA AGTGCTTCAG CTTCCCCGTC 
AACCACCGCC TCGCCCTCAG CAAGTATCTC AGCGTCTGAA TCCGCATCAA CAAGTGCCTC 
GGCTTCAGCA TCAACGAGTG CATCAGTCTC AGCAAGCACC AGTGCGTCGG CCTCAGCAAG 
CACCAGCCCG TCTGAATCCG CATCAACCAG TGCCTCAGCT TCAGCAAGTA CCTCAGCATC 
TGAATCAGCA TCAACAAGTG CCTCGGCTTC AGCAAGCACA AGTGCTTCAG CCTCAGCAAG 
TATCTCACCG TCTGAATCCG CATCAACCAG TGCGTCCGCT TCAGCAAGTA CTACCCCCTC 
AGCATCAGCC TCAACAAGTG CTTCGGCTTC AGCGTCAACG AGTGCGTCTG AGTCAGCATC 
AACGAGTACG TCAGCCTCAG CAAGCACATC AGCTTCTGAA TCTGCATCAA CCACTGCGTC 
AGCCTCAGCA TCGACAAGCC CCTCAGCTTC AGCAAGTACC AGTGCGTCAG CCTCAGCAAG 
TACCAGTGCT TCAGCCTCAG CCTCGACAAG TGCGTCGGCC TCAACCAGTG CATCTGAATC 
GGCATCAACC AGTGCGTCAG CCTCAGCAAG TACTAGCGCC TCAGCCTCAG CATCAACCAG 
TGCGTCCGCT TCAGCAAGTA CTAGTGCATC AGCTTCAGCA AGTACTAGCG CCTCACCCTC 
AGCGTCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CCTCGACAAG 
TGCGTCCGCT TCAGCAAGTA CCTCAGCGTC TGAATCAGCA TCAACAAGTG CCTCGGCTTC 
AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCACCTTCAG CAACTACCAG 
TGCGTCCGCT TCAGCATCAA CGAGTGCTTC ACTCTCAGCG TCAACCAGTG CCTCTGAATC 
CGCATCAACA ACTCCCTCGG CTTCAGCAAG CACCAGTGCT TCCCCTTCAG CCTCAACCAG 
TGCGTCTGAG TCAGCATCAA CCACTGCGTC AGCCTCAGCA AGCACATCAG CTTCTGAATC 
TGCATCAACC AGTGCGTCAG CTTCCCCATC AACAAGCGCC TCGCCCTCAG CAAGTACAAG 
' TGCTTCAGCC TCAGCATCAA CCAGTGCATC AGCTTCAGCC TCAACAAGTG CTTCAGCCTC 
ACCGTCAACC AGTGCCTCGG CTTCAGCAAG TACCAGTGCG TCACCTTCAG CAAGCACAAG 
TGCGTCAGCT TCAGCATCAA CCAGTGCTTC GGCTTCGGCA TCAACAAGTG CCTCAGCATC 
AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTACTGCA TCAGCATCAC CATCAACCAG 



3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4S00 

4S60 

4620 

4680 

4740 

4800 

4860 

4920 

4980 
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5220 
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TGCATCAGCC 


TCAGCAAGTA 


TCTCAGCGTC 


TGAATCGGCA 


TCAACGAGTG 


CATCAGCATC 


5340 


AGCATCAACG 


AGTGCATCGG 


CTTCAGCGTC 


AACCAGTGCA 


TCAGTCTCAG 


CAAGCACCAG 


5400 


TGCGTCGGCT 


TCACCATCAA 


CGAGTGCCTC 


AGCCTCAGCA 


AGTATCTCAC 


CCTCTGAATC 


5460 


GGCATCAACG 


AGTGCGTCAG 


CCTCAGCAAG 


TACTAGTGCA 


TCGCCTTCAG 


CAAGCACCAG 


5520 


TGCGTCGGCT 


TCAGCATCAA 


CCACTGCCTC 


AGCCTCAGCA 


AGTATCTCAC 


CCTCTGAATC 


5S80 


GGCATCAACG 


AGTGCGTCAG 


CCTCAGCAAG 


TACTAGTGCA 


TCAGCmTCAC 


CATCAACGAC 


5640 


TGCATCGGCT 


TCAGCAAGTA 


CCACCGCCTC 


ACCTTCAGCA 


AGCACCACTG 


CCTCAGCCTC 


5700 


AGCAACTACC 


AGCGCCTCAG 


CCTCAGCAAG 


CACCAGTGCC 


TCAGCTTCAG 


CAAGTACCAG 


5760 


TGCGTCAGcT 


CAGCATCAAC 


AACTGCTTCA 


GCTTCGGCCT 


CAACAAGTGC 


GTCAGCTTCA 


5820 


CCATCAACGA 


CTGCGTCGGC 


TTCAGCAAGC 


ACCAGTGCCT 


CGGCCTCAGC 


AAGCACCACT 


S880 


GCTTCAGCTT 


CAGCATCAAC 


AAGTGCCTCA 


GCTTCAGCAA 


GTACATCAGT 


TTCAAATTCA 


5940 


GCAAACCATT 


CGAACTCACA 


ACTTGGAAAT 


ACTTCTGGAT 


CGACAGGTAA 


ATCCCAAAAA 


6000 


GAATTGCCTA 


ATACAGGTAC 


TGAGTCCTCA 


ATTGGATCTG 


TGTTACTTGG 


AGTTCTAGCA 


6060 


GCTGTTACAG 


GTATTCGATT 


GGTTGCCAAA 


CGCCGTAAAC 


GTGATGAAGA 


AGAGTAAGAC 


6120 


AACCTCTAAA 


CTTAGGCTAA 


ACTAACTCGC 


GCACATAAAT 


C AAGG AG AAA 


ATTGCTACTG 


6180 


GATGATAAAA 


TAACAGTCAT 


TCTACCACTA 


TACAATCTGC 


AAAACTATCT 


GAGGAAGTGC 


6240 


CTAGATAGTA 


TTATTACTCA 


AACATATAAA 


AATATTGAGA 


TTGTTGTCGT 


TAATGATGGT 


6300 


TCTACGGATG 


CTTCAGGTGA 


AATTTGTAAA 


GAATTTTCAG 


AAATGG ATC A 


CCGAATTCTC 


6360 


TATATAGAAC 


AAGAAAATGC 


TGGTCTTTCT 


GCCGCACCAA 


ACACCGGTCT 


GAATAATATG 


6420 


TCCCGAAATT 


ATCTGACCTT 


TGTCCACTCG 


GATGATTGGA 


TTGAGCAAGA 


TTATGTAGAA 


6480 


ACTCTATA7A 


AAAAAATAGT 


AG ACT AT C AC 


GCTCATATTG 


CAC7TCC7AA 


TTATTATTCT 


6S40 


TTCAACGAAA 


GTGAAGGAAT 


GTTCTACTTT 


CATATATTGG 


GAGACTCCTA 


TTATGAGAAA 


6600 


GTATATGATA 


ATGTTTCTAT 


CTTTGAGAAC 


TTGTATGAAA 


CTCAAGAAAT 


GAAGAGTTTT 


6660 


GCTTTGATAT 


CTGCTTGGGG 


TAAACTCTAT 


AAGGCAAGAT 


TCTTTCAGCA 


GTTGCGCTTT 


6720 


GACATAGGTA 


AATTAGGAGA 


ACATGGTTAC 


CTCAATCAAA 


AGGTATATTT 


ATTATCAGAA 


6780 


AAGGTAATTT 


ATTTAAATAA 


AAGTCTTTAT 


GCTTATCGGA 


TTACAAAACG 


TAGTTTATCA 


6840 


AGAGTTTGGA 


CAGAAAAGTG 


GATGCACGCT 


TTAGTTGATC 


CTATGTCTGA 


ACGTATTACG 


6900 


CTACTAGCTA 


AT AT GGGTT A 


TC CTCT AG AG 


AAACACTTGG 


CAGTTTATCC 


TCAGATGTTG 


6960 


GAACTCAGTC 


TCCCCAACCG 


TCAAGCTAGT 


GCTTTATCTG 


ACACAGCAAC 


GTATAAAGAG 


7020 


TTTGAAATGA 


AACAAAGGCT 


TTTAAATCAG 


CTATCGAGAC 


AAGAGGAAAG 


TGAAAAGAAA 


7080 
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TCGCAGCAAA 


CT ATGGCT AT 


GTAGACCAAG 


TTTTAACGAC 


AATCAACTCT 


7140 


ATTTGTT ATC 


ATAATCGTTC 


GATTCGTTTT 


TATCTGATTC 


ATAGCGATTT 


TCCAAATGAA 


7200 


TGGATTAAGC 


AATTAAATAA 


GCGCTTAGAG 


AAGTTTGACT 


CAGAAATTAT 


TAATTGTCGG 


7260 


CTAACTTCTG 


AGCAAATTTC 


ATGTTATAAA 


TCGGATATTA 


GTTACACACT 


CTTTTTACGC 


7320 


TATTTCATAG 


CTGATTTCGT 


G C AAGAAGAC 


AAGGCCCTCT 


ACTTGGACTG 


TGATCTAGTT 


7360 


CTAACGAAAA 


ATCTGGATGA 


C "'PT*f iTTTGCT 

w * 1 VJ * A » * 


ACACACTTAC 


AAGATTATCC 


TTTGGCTGCT 


7440 


GTTAGAGATT 


TTGGGGGCAG 


ACCTT A'I'YTT 


GGTCAAGAAA 


TCTTTAATGC 


CGGTGTTCTC 


7500 


TTGGTAAACA 


ATGCTTTTTG 


fi AAAAAAG AG 


AATATGACCC 


AAAAATTAAT 


TGATGTAACC 


7560 


AATGAATGGC ATGATAAGGT 


flGATCAGGCA 


GATCAGACCA 


TCTTCAATAT 


GCTTTTTGAA 


7620 


CATAAATGCT 


TGGAATTGGA 




AATCATATTG 


TCATTCATAA 


ACAGTTTGCT 


7680 


GATTATCAAT 


TGCCTGAGGG 


TfAfiGATTAT 


WW m WW * •» * * *■ 


TTCACTATCT 


TTCTCATCGG 


7740 


AAACCGTGGA 


AAGATTTGGC 




TATCGTGAAC 


TTTGGTGGTA 


CTATCATGGG 


7800 


CTTGAATGGA 


CAGAATTGGG 




CATTTACATC 


CATTACAAAG 


ATCTCACATC 


7860 


TATCCAATAA 


ACGAACCTTT 


f* A rTTGTCTA 


ATCTATACTG 


CCTCACACCA 


TATTGAACAA 


7920 


ATTGAGACAT 


TGGTTCAATC 


CTTGCCTGAT 


ATTCAGTTTA 


AGATAGCAGC 


TAGACTAATA 


7980 


GTTAGTGATC 


GATTGGCTCA 


CATGACAATT 


TATCCAAACG 


TGACTATATT 


TAACGGAATT 


8040 


C ACT ATTTGG 


TAGATGTCGA 


TAATCAATTG 


GTAGAAACCA 


GTCAAGTACT 


TTTAGATATT 


8X00 


AATCATGGCG 


AAAAG AC AG A 


AGAAATTCTC 


GATCAATTTG 


CTAATCTTGG 


CAAGCCTATC 


8160 


TTATCCTTTG 


AAAATACTAA 


AACCTATGAA 


GTAGGTCAGG 


AGGCATATCC 


TGTTCACCAA 


8220 


GTTCAAGCAA 


TGATTGAAAA 


ATTGAGAGAA 


ATAACCAAAT 


CAAGAAAAAT 


CATTTAGTAO 


3230 


GAGATGCTCT 


GATTTTGACG 


GTTAGTGATC 


AGATTGAAGA 


GTTGGATTAT 


TTTTTATAAA 


8340 


ATTTCTCCGT 


TC ATCAT AT A 


TGAAACTTGT 


TCAAACATCA 


CACTGCTTTA 




a Ann 


ATAGACCTAA 


AGATATTTAA 


TATGAACTGC 


ACCCCAAAAG 


TTACACAGAA 


AAAATCTAAC 


8460 


TTTTTGGsGT 


CACTACAATA 


TTAGGGTGTG 


ATTAATTATC 


TTTTTAGGTG 


AAAATGATTC 


8520 


TATATTATAG 


CTGTTTCATA 


CGAAATTTAT 


TATAAGGAAA 


TTATGTTAAT 


GAATACAAAA 


8580 


TCTATACTTT 


TTAATGCAGA 


TAATGATTAT 


GTAGATAAAT 


TACAAACTGC 


AATTAAATCT 


8640 


ATTTGTTGTT 


ATAATAATTG 


TTTAAAATTT 


TATGTATTTA 


ATGATGATAT 


TGCGTCAGAG 


8700 


TGGTTTTTGA 


TGATGAATAA 


GCGATTGAAG 


ACTATACAAT 


CTGAAATCCT 


TAATGTAAAG 


8760 


ATTGTAGATC 


ATGTTCTTAA 


AAAGTTTCAT 


TTACCGTTAA 


AG AATTT AAG 


TTATGCCACT 


8820 
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TTCTTTCGTT ATTTTATACC TAATTTTCTC 
GACATCATTG TTACAGGAAG TTTAGACTAT TTATTTGATA 
TTCGCAGCAG TAGAAGATTC TTTTGGTGAT GTTCCTTCTA 
TTATTAGTTA ATGTAGATAC TTGGAGACAT GAAGATGCTT 
ACCAATCAAT ATCATGAAAC AGCATATGGA GATCAAGCAA 
GATAGATGGA AAAGATTAGA CCGAAATTTT AATTTTATGG 
CACATAGAAG GAAATCATAA ATGGTATGAG ATTTCTGAGT 
AGTGTTATAC ATT AT ACT GG GGTAAAACCT TGGGAAATAA 
GAAGTTTGGT GGTTTTATAA TCTGTTAGAA TGGTCTGATA 
ATTAGTCCTA GTTTCGAAGA ACTTGTATAC AGTCCTAAAG 
CCTACTTGTG AGATGGAGCA TCTACAATAT TTGATAGAAA 
TCT AT ACT AG CACATACATA TTTTGCGTCT AGTGTCGTTG 
GTTACGATTT ATCCTTGTTT TTCTCCATTT GATTATCGAA 
TTTTATTTAG ATATTAATCA TTATAAAGAA GTCGATAATA 
CTATCTAAAC CAATTTTTAC CTTTGAAAAT ACTACTCATC 
ATATTTTCTT CAACCGAACC AAACAAAATG GTACAGGCTA 
TAAGTTTATG GCAGACGAAC TAATTAGTAT TGTACTTCCA 
TTTGCGAATG TGTTTCGATA GCATTCAGAA TCAGACGTAT 
AATCAATGAT GGCTCTCCAG ATCATTCATC CAAAATATCT 
TTCTCGTTTC AAATATTTTG AGAAAGCAAA CGGCGGTCTT 
7ATTGAA7ST TCGGGGGCGG CCGTACATTA CTTTTGTAGA 
ATGATGCTTT AGACCGATTA TATGGTGCTT TGAAAAAGGA 
GGCGTTATAA TTCTTATGAT GAAACACGCT ATGTGTATAT 
ATGATTCTCT AGAAGTGATA GAAGGTAAAG CAATTATGGA 
TCAGAAATGG GAACTGGACT GTAGCTGTCT TGAAGTTATT 
ATTTACCATT TCCTATAGGA AAAATTGCAG AGGATACTTA 
TAAGAGCTTC GAGGATAGTC TATTTGAATC GTTGTGTTTA 
CTCATACTTT ATCGAATACA TGGAGTGAAA AGCGTATGTA 
AAGAAAAGAT AGCTATTTTA GCAAGTTCAG ACTATGACTT 
ATAAAAATAG ATTACAAAGA GTGATAGCAA AATTAGAAGA 



GTGCTTTATA 
TAGAACTAGA 
CCAATTTTAA 
GTTCGAAACT 
TTTTAAATAT 
TGGGGATGGA 
TGAAAAATGG 
TTTCCAATAA 
TTTTATTGAG 
CTCATACAGC 
ATTTACCAGA 
CTTTATTAAG 
AAATTTTGGA 
TTGTATCCGT 
ATATAGGCAA 
TTAGACAATT 
ATCTACAACG 
CAAAATTTTG 
CAAGAATTTG 
T CATC AG CTC 
CTCTGATGAT 
AAACGCAGAT 
GACTTATGTT 
TAGGGAAGCT 
CAAGAGAGAG 
CTGGACATGG 
CTGCTACCCT 
TGATGAAATT 
GACCAATCAT 
ACAAAATATG 



CCTAGATTCT 
TGGTTATGCC 
CTCCGGAATG 
GTTAGAACTG 
GTTATTCCAT 
TAGCGTCGCA 
ACATTTACCT 
TCGCTTTAGA 
AAAAGACATT 
AATTTTTACA 
CGTACATTTT 
ATATAGCAAT 
TAATTTAGAT 
TCTTCAACAA 
TCAAACTAAT 
TATAGGAGAA 
TTGAGAATTA 
AGTCTTTATT 
TAGAGAAAGA 
CTAACCTAGG 
TGCTTGGAAC 
ATTAGTATCG 
ACGGATCCAC 
GTCGAAGAAG 
TTACTACAAG 
AAGGTACTTC 
CTTGCTTTAT 
GGGGCTAGGG 
ATTTTCATTT 
CACTTCACAG 



8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
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9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
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10S60 
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ACATTT AC AG AACAATGATG CAAAAATTCT CTTTACTTCC GTAGATACTA ATAAAAAATG 10680 
AGATAGCGTA ATATGAAACT ACATTTAACA AATTTATACC GCATGGCTGC TGATAGTACG 10740 
GTTATCTTAG CTCAAAATGC TGTTCAAAAG ATACCTAGTC AACTGGGATT TAGAGAGGTT 10800 

GGTATTTATT TTTACAACAT TGCTTCAGAT ACTCCTTCTG AAATGAATAA CCGTCTGGAT 10860 

GGTATTATGG CCAGTATCTC TATTGGGGAT ATTTTAGTCT TTCACTCTCC AACCTGGAAT 10920 

GGTTTTCAAT TTGATCCTCT CTTGTTTGAT AACCTAAAGG ATATGCAGGT GAAAATTATT 10980 

TGCTTTATCC ATGATGTTCT TCCCCTCATG TTTGATAGTA ACTATTATCT CATGAAAGAT 11040 

TATCTGTATA TGTATAATCT ATCAGATGTT TTGATAGTGC CCTCAGAGAG AATGAAAACA 11100 

CGCCTGATGG AAGAAGGATT GACGACTAAG AAGATTCTTG TTCAAGGGAT GTGGGATCAT 11160 

CCTCATGATT TATCCTTATA CACCCCTGCT TTTAAAAAAG AACTTTTTTT TGCTGGAAGT 11220 

TTAGAGCCTT TTCCACACTT ACAAAATTGG TCTCAAGATA CGCCTTTGAG AGTATTTTCA 11280 

AATAAAGGGG AAGCTACTTC TAGTGCTAGA ACTCTCACCA TCGAACGATG GAAAAAAGAT 11340 

CACGAATTGT TCCTAGAATT ATCAAAGGGT GGATTTGGCC TTCTCTGGGG AACCcATCAA 11400 

AATGAGGGAC AAAGTAACCA ATACTATACC TTGAATATAT CTCATAAGGT CAGTACCTAT 114 60 

CTAACAGCCC GC ATT CCACT CATTGTACCA ACTAGCTTGT CAACTGCTAA ATTTATAGTA 11520 

GATCAAGGCT TGGGCTTTAT GGCGGATAGT CTGGAAGAGG TTCATGAGAT AGTTGATAAA 11580 

ATGAATCTAC AAGAATATCA AGAAATGACG AATCGTATCA AGACCTTTAG CTATTTGTTA 11640 

AAAGAGGCCT ATTTCACTAA AAAGTTATTG GTAGATGCAA TCTATCACTT GGGAATTGAT 11700 

TAAGGGAATG AAATCAACAA AACAATTGTA CTAGCAGCGG ATCGCAATTA CACCACGCAG 11760 

TTAGAAACAA CGATAAAATC TATTTTATAC CACAATCGAG ATGTTAACAT TTATATTTTG 11820 

AATCAAGATA TCATGCCAGA TTG C TTTCGC AAACCACCAA AAATAGCTCG CATGTTAGGT 11880 

AGTGAGATTA TCGATGTTAA ACTACCTCAA CAAACTGTGT TTCAACATTG GGAAAAGCAA 11940 

GATCACATTA GTAGCATTAC TTATGCTAGA TATTTTATTG CAGATTATAT CCAAGAAGAT 12000 

AAGGTTTTAT ATTTAGACAG TGATTTGATT GTAAATACTT CTTTAGAGAA ATTATTTAGT 12060 

ATTTGTTTAG AACAAAAATC ACTCGCACCA GTTAAAGATA CAGATGCAAT TACATTTAAT 12120 

GCAGGTGTTT TATTAATCAA CAATAAAAAA TGGCGTCAAG AGAAATTAAA AGAACGACTA 12180 

ATTGAACAGA GCATTGTTAC AATGAAGGAA GTTGAAGAAG GCCGTTTCGA GCATTTTAAT 12240 

GGTGATCAAA CGATTTTTAA TCAGGTCTTC CAAGATGATT GGTTAGAACT AGGTCGAGCT 12300 

TATAATTTAC AAGTACGGCA TGATATTGTG GCTTTGTATA ACAATTGGCA GCAACATCTG 12360 
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GCTTTTAATC 


ATAAACCAGT 


CGTGATTCAT 


582 
TTTACGACCT 


ACAGAAAACU 


*«. I uOMl. 1 r*\~\- 


12420 


TTGACAGCCA 


ATCGTTATCG 


TGATTTATGG 


TGGGAATTCC 


ATGATTTGGA 




12480 


ATTTTACAAC 


ACCATATGGG 


AGAATTTGAA 


CTAATATCGC 


CTCTAGATAA 


GGAA n nui 


1 2S40 


TGCTTAACCT 


TAACGAATTC 


CCAAGATTTA 


GAAGGAATAG 


AAGAGCTAGT 






CCTGAGCTGG 


TATTTCATAT 


CGCAGCTTGC 


ACGGATATGG 


GAGATAAATT 


ft It It ft ti K ft ^n^M ft 

AAAAAAATTA. 


1 


GCTGTATATA 


ATAATGTGAG 


ATTGCATCCA 


CAAATTGTTC 


CACCGGTCTT 


■h a ft tm ft ft Jftj 

AGATAAGCTG 


i ?75n 
1« / «u 


AAAAAGTCAA 


CAAATCTATA 


TTTGGATATC 


AATCATGGTA 


CTGCAGATGA 


GAAui i i ii A 


l£ / OU 


AAATCTTTGC 


AAGAACAAGA 


AAAAACGCTA 


CTAGCTTTTC 


AATCGACTCA 


GCACGGAGAC 


1 •049 


TTAGGACAAA 


TCGTTTTCGA 


AAATGGGAAA 


GTTTCCTTTA 


TGATTGATAC 


GATTAAAGAT 




TTTAAGAAAA 


ACGGACATCT 


TACCTGTTTT 


CGACAACTTC 


CAAGTTTAAC 


'ITGT rTAACG 




TTTACGGCTT 


CTCAGTATAT 


CGAACAATTG 


GATTACTTCC 


CTCGACAGTT 


GCCAAATGTT 


13020 


GTTTTTCAAA 


TTGCTGCTTG 


GACAGCTATG 


GGCCCAAAAT 


TATATGATTT 


CTCTAATCGT 


13080 


TATCCTAATA 


TTCAGCTCTA 


TCCGGCAATT 


TCTAGACATA 


AGCTACACGA 


GTTGAACGAG 


13140 


AAGATGGATG 


CTTATTTAGA 


TATCAACCTA 


CTGACTTCAA 


CATCCGATAT 


CGTTGCAGAA 


13200 


ATGCCTCATC 


TATCTAAACC 


TATACTAGCC 


TTTTATAAAT 


CTCAAAATCG 


GAATAATGGC 


13260 


CAAAGGTTGT 


ATTCAACTGA 


ACATCCTGAA 


CGAATCTTGG 


CTCATTTGCA 


AAAATTGATA 


13320 


ACTAAGCATA 


TGCTAGAAAA 


ACCGCTTGAT 


ATAATCCAGG 


TGAAAGGGAT 


AGATGAAACC 


13 380 


TTGGATTATA 


TTATTGAACA 


CAACTCTTCT 


TTAGTTCCTT 


TTGGAGATGG 


GGAAATCAAT 


13440 


ATGCTTGCAG 


GGCATTCAAT 


TCCCTACCAG 


GATTATGATG 


AAGAGTTGGT 


TTCAATCATG 


13500 


AGGGACATTA 


TCGGCCAAGA 


AAGTCGAGAA 


GATTTAGTAG 


TGTGCCTTCC 


TGATGCTTTT 


1 T C £ A 


ACAGATCCTT 


TTAGGTTTAC 


ATCCTGCCCC 


ATTCCATTTT 


GGAAAGATCA 


CATGGATCAT 


1 1 £9 A 


TATATGGATT 


TTTACAGAGA 


GTTATGCAGT 


GATTCATGGT 


ATGGCTCAAC 


CrTTOTATCT 


IJOtfU 


CG CCCTT AT A 


TCCATTTTGA 


ACACAAGAGT 


CAAGCTAAAG 


CTCAATTTGA 


AAAATTGAAA 


1 J / 4U 


AGCATTTGGG 


AAAACCCTGA 


CTTACTGATA 


GTCGAAGGTG 


CGACTTCTCG 


TTCAGGTCTC 


13800 


GCAAATGATT 


TATTCCATGA 


GGCAAATTCT 


ATTAAGCGAA 


TTATCTCTCC 


TTCTCATACT 


13860 


CCCTTTTCTA 


GAGTTCATGA 


ACTTGAACAA 


GAAATTGAAA 


AGTATGCTGG 


TGGTCGCTTG 


13920 


ATTTTATGTA 


TCCTTGGACC 


TACAGCAAAA 


GTTCTGACTT 


ATAATCTATG 


CCAGATGGGC 


13980 


TATCAAGTTT 


TGGATCTAGG 


CCATATTGAC 


TCAGAGTATG 


AATGGATGAA 


AATGGGAGCT 


14040 


AAAACTAAGG 


TTAAATTTTC 


TCATAAACAT 


ACTGCAGAAC 


ATAATTTCGA 


CCAAGATATT 


14100 


GAATTTATTG 


ATGATGAAAC 


CTATAACAGT 


CACATTGTTG 


CACGAATATT 


AAACTAGACT 


14160 
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ATTTAAAATA 
CCTGATTACC 
AGTCATTTGA 
AGAATATATT 
TTTCAAATGA 
TTTATTCCAG 
ACTGGTGATT 
CGTTCTTGCT 
AAATGGCCAT 
AATGTGGAAG 
CTTCAAGATC 
CAATTCATTT 
GATAAGCCTT 
TTGATCGATT 
AAATCACAAA 
ATCCATTATT 
ATGGCAAATG 
TTACCAATCT 
GATCGAAAAT 
TTCGACAATA 
TCCATTCCGA 
GAAAATCAAT 
AGTCTGTATT 
CTGAATGGTT 
TACACATTGA 
CTTACTTTAG 
CCGATATCAT 
CAATTGGTGC 
GTATGTTACT 



AATGATAAGG 
CTTATATTCG 
AAATTTATCT 
TACAAGAGAT 
ATTGCTCTAA 
ATTTTCTAAC 
TGACCGATTT 
TTGGACCAGG 
CTGAAACTAT 
AAGGAGACCA 
AATATAATTT 
TTGATATTCC 
GGAATCAATT 
CCTCTGTTAT 
TATTTAAGTT 
TGGCGGAGCA 
AACTACTAGC 
TATTGGAACA 
TAGAAGATGC 
CTTGCTCTGA 
AAAAAATGGT 
ACTATTAGCG 
GTATCACAAT 
TAAATTATTG 
TAAAGAACTT 
ATTTTTTGCG 
TCTAACTGGG 
TGTTGATCAT 
AATCGATCTT 



583 

ATTTAAAATG ACAAATACCA 
ACAAATCGAA ACGGCGATGA 
GCTAAATCAG GACATTCCTC 
GGGGGCCGAC TTGATTGACT 
TAAATTACCT CATATCAATC 
AGAAGATAAA GTTCTCTATC 
GTTTGAATTA GACTTAGGTG 
AGTCGGCTTC AATCCTGGTC 
TCGACAAAAA TTGATTGACT 
CTCAATTTTG AATATCTTGT 
TCAAATAGGA TATGATTATG 
GCTCGAACCA CTCCCACTAA 
TTCTGTTGGA CGTCTAAGAG 
TTTAAATGAA TGGTTTTCAA 
GCAATCTCTT AATTTAACGA 
ATTGCCAGAA CTTCATTTTC 
TTTAACGCGT TTTCCTAATC 
AATAGTAATA GCTTCACATT 
ATATGAGTTT GTGCTTAAGT 
AAATCTTTCT GAGATTTCAT 
TGCAGCAATC AGATCTTACA 
GCAGATAATG CCTATCTTAT 
AGAGATGTTG ATTTTTATAT 
GGGAGAAAAA TGGAAGTTGT 
TTTGAAAGCT ATAAAACAGG 
ACAGAACTGG TTGAATCTGA 
GAACTAGCTA CTTTCTTTGA 
GTCTATCCCT ATGAAGGACG 
G C AAAGTGG A AAGAACATTC 



AACGCGCTGT 
ACTCACTCTG 
AGGAATGGTT 
GCAAGTTAAT 
ATATGACATT 
TAGATACTGA 
AAAATTATTT 
TTCTCTTGAT 
TAACAGAAAA 
TTAAAGATCA 
GGGCGGCAAC 
TTTT AC ACT A 
AAGTTTCCTG 
AGAGTGTGAA 
ATTCTTCCTG 
ATATTGTTGC 
TTACCCTATA 
TCTATTTGCA 
ACAAAAAACC 
ATGAAGCTAT 
TGAGGTAGAG 
TCCTTTAGAG 
TCTCAACAGT 
GAATTCTACA 
ACCTCATATA 
TACGGTATTC 
GATAGATCTC 
AAAATCTCCA 
TATTGTCAAT 



AGTATTT CC A 
TAGACACAAT 
TAGTCAAATA 
TGGCTCACAG 
TGCACGCTAT 
TTTGATTGTG 
GGCAGCAGCT 
TAACAACAAA 
AGAACATGAG 
ATATAGTTCC 
CTTTAAACAT 
TATTTCTCAG 
GGAATACTCT 
GTACCCTACT 
TGTCGAGAAA 
TTATACAAAT 
TCC AAATTCC 
TTTGAATCAT 
AATGATAGCT 
CTATCCAACC 
AACAGTATGA 
ACG ACT AT *iA 
CATATAGCTC 
ATTCGCAGTG 
AATTATGCTT 
TATCTGGATT 
AAAGGATATT 
TTTAATACTG 
ACTTTATTGG 



14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
154 80 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
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AATTAGCGGC 


CGAGCAGAAT 


CAAGTTGTTC 


ATCTTCCGGA 


TCAGAGTATT 


TTAAATATTT 


15960 


ATTTTGAGGA 


TAATTGGCTA 


GCCTTAGATA 


AAACATATAA 


TTATATGGTG 


GGTATTGATA 


16020 


TTTATCACCT 


TCCTCAACAA 


TGTGAACGTC 


TAGATGACAA 


TCCACCTACA 


ATTGTTCACT 


16080 


ATGCTAGTCA 


TGATAAACCT 


TCGAATACAT 


ATAGTATATC 


TAGACTACCT 


GAATTATGGT 


16140 


GGGTTTATAG 


AGATTTGGAT 


TGGTCAGAGA TTGCTTTTCA 


ACCTTCCGAT 


TTAAATTATT 


16200 


TTGAAAGAAG 


CAATCAGTCT 


AAAAAACAAG 


TGATGCTTCT 


GACATGGAGT 


GCAGATATAA 


16260 


AACATTTAGA 


GTATTTAGTA 


CAACCGTTAC 


CTGATTGGCA 


TTTTCATTTG 


GCTGCACCGT 


16320 


GTGATTGTTC 


TGAGGAGCTG 


ACCTCTCTAT 


CACAGTATAC 


GAATCTAACA 


CTATATCAAA 


16380 


ATGTATTACA 


TAGTAGAATT 


GATTGCCTAT 


TGGACGATTC 


TATAGTTTAT 


TTAGATATTA 


16440 


ATACAGGTGG 


AGAGGTTTTT 


AATGTAGTTA 


CAAGGGCACA 


AGAAAGTGGC 


AAGAAAATCT 


16500 


TCGCTTTTGA 


TATCACACGT 


AAAAGTATGG 


ATGATGGACT 


CTATGACGGT 


ATTTTTTCTG 


16560 


TGGAGAGACC 


AGATGATTTA 


GTGGATAGAA 


TGAAGAATAT 


AGAGATAGAG 


TAATGAGTGA 


16620 


ATTAATTAGT 


GTTCTGCTAC 


CGATATACAA 


TACGGGAAAA 


TATTTACTGC 


ACTGTGTCGA 


16680 


GCATATTCTG 


AAGCAAACCT 


ATCAAAATAT 


AGAAATTATT 


TTAGTTGATG 


ACGCTTCTAC 


16740 


GGATAATTCT 


GCGGAAATTT 


GTGATGCTTT 


TATCATGCAA 


G ATAATCGTG 


TGCGAGTATT 


16800 


GCATCAAGAA 


AATAAGGGGG 


GGGCAGCACA 


AGCTAAAAAT 


ATGGGGATTA 


CTGTAGCTAA 


16660 


GGGAGAGTAC 


ATCACCATTG 


TTGATTCAGA 


TGATATCGTA 


AAAGAAAATA 


TGATTGAAAC 


16920 


TCTTTATCAG 


CAAGTCCAAG 


AAAAGGATGC 


AGATGTTGTT 


ATAGGGAATT 


ACTATAATTA 


16960 


TGACGAAAGT 


GACGGGAATT 


TTTATTTTTA 


TGTAACAGGG 


CAAGATTTTT 


GCCTCGAACA 


17040 


ATTAGCTATA 


CAAGAAATTA 


TCAACCGTCA 


AGCAGGAGAT 


TGGAAATTCA 


ATAGCTCGGC 


17100 


CTT7ATATTG 


CCGACATTTA 


AGTTGATTAA 


AAAACAATTA 


TTCAATGAAG 


TTGACTTTTC 


17160 


AAATGGTCGC 


CGCTTTGATG 


ATGAAGCAAC 


TATGCATCGC 


TTTTATCTTT 


TAGCCTCTAA 


17220 


AATCGTCTTT 


ATAAACGATA 


ATCTCTATCT 


GTATAGAAGA 


CGTTCAGGAA 


GCATCATGAG 


17280 


AACGGAATTT 


GATCTTTCCT 


GGGCAAGAGA 


TATTCTTGAA 


GTGTTTTCTA 


AGAAAATATC 


17340 


GGATTGTCTC 


TTGGCTGCTT 


TGGATGTCTC 


CGTTCTGCGT 


ATTCGATTTG 


TCAATCTTTT 


17400 


AAAAGATTAT 


AAGCAAACTT 


TAGAATACCA 


TCAATTAACA 


GATACTGAGG 


AATATAAACA 


17460 


TATTTGTTTC 


AGATTAAAGT 


TGTTTTTTGA 


TGCAGAACAA 


AGAAATGGTA 


AAAGTTGAAA 


17520 


TAAAACAATT 


CTTATTTACC 


AT ATC ACAAA 


CAATGAAGGT 


GAGGGGAGTC 


TTTTATGACT 


17580 


AAGATTTATT 


CGTCAATAGC 


AGTAAAAAAA 


GGACTATTTA 


CCTCATTTCT 


ACTGTTTATC 


17640 


TATGTATTCG 


GAAGTCGTAT 


TATTCTCCCT 


TTTGTTGACC 


TAAATACTAA 


AGATTTTTTA 


17700 
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ses 





tAW. tTATCT 


ACCCi i L 1 tA 


tzr^mrrr^v a A 




11.4 #\Aw A/%w A 


17760 


TTATCAATTT 


TTTCTGTTGG 


ATTATLV.Lv. 1 


flvy* ft ^v:*rr*f*ft 

luuAlu 1 




a Tv;fif~ tow; 


17B20 


"iri i i~l illii 


CTAAACuOTT 


CvAil 1 t AALA 




TAOAA A*TAf" A. 


AuA 1 IvVj^. A 


17880 


AAAATGTALL 


TGACCTTOCT 


AATTuL 1 v» I \j 


Al I LAA t LL i 


1 uuL A\j 1 1 Au 


1 IVfnVtnLi l 


17940 


CCAGTACAAT 


CCTCCTATTC 


TGCAATAi i\i 


O i 1\. 1 AA 


TT&ATAPA1T 
luAA X AL AA 1 


A 1 A OA A A 


1 8000 


m ft ft ^» ft rti 

GCAGGAACAT 


TTTTTCTTGT 


TTGGTTGTCA 


V»ATTT AAA i V* 


LLAb I A 1 VjLfVj 


A "nV!/-* A 
Vin 1 tlj^jnlAi * 


i anen 

lOVOv 


TCTATTCTAA 


TCCTCCTATC 


CAGTATGCTT 


TTAAATATTl 


LTCAGGATCT 


TTTuGAAALA 


1 Q1 1ft 


TTTCAGACAG 


TACACATTCC 


AACAGGGATT 


ATTGTGTTAC 


TTGCTTTATT 


AACCCllVJU 


i Ol OA 

loloU 


'ITTTCl'TATT 


TACTTGCCCT 


TATGTATCGA 


GCTCGCTATT 


TGCTTCCTGT 


m ft ft ft * A*f**I* 

TAATAAAATT 


i g<) Aft 


GGCTTACACA 


ATCGATTTAA 


ACGCTATTCT 


TATCTCGAAA 


TCATGTTGAA 


TCCTCCAGGT 


loJUO 


CGGATGCCTT 


ATATGTATGT 


GATGAGTTTT 


CTTAGTGTAC 


CAGCTTATTT 


GTTCATCTTG 


183 60 


TTGGGATTTA 


TTTTCCCTAA 


TCATTCAGGG 


TTAGCGGCTT 


TATCAAAGGA 


ATTTATGGTT 


18420 


GGAAACCCTT 


TCTCGGTCTA 


TGTTTATATT 


TCGCTCTTAT 


TTTTATTTAG 


TATCATTTTT 


18480 


GCTTTTGTTA 


CGATGAATGG 


AGAAGAGATT 


GCAGACCGTA 


TGAAAAAATC 


TGGAGAATAC 


18540 


ATTTATGGTA 


TTTATCCAGG 


TGCGGATACT 


ACTCGATTTA 


TTAATCGATT 


CGTCCTTCCT 


18600 


TTCTCACTCA 


TAGGTGGTCT 


CTTTAATCTG 


ATTATGGCAG 


GTGGTCCCAT 




18660 


TTCTTTGATG 


AAAAGTTATT 


ACGATTGGCA 


ATGATTCCAG 


GCTTATTTAT 


CATCTTCGGC 


18720 


GGCATGATTT 


TTACGATTAG 


AGACGAGGTC 


AAGGCTTTAA 


AAAh life m/k Jk 

GGCTAAATCA 


GACCTATAGA 


18780 


CCTTTGATTT 


AGGAGACTTT 


TATGTATTAT 


TTTATTCCAG 


LTTWTATCG 


f*fw*r* ft^* ft A ft /"•x 
bl^ACAAAGA 


X DB4U 


ACATGGCATG 


CAGATATCAC 


TCwATGG * AT 


1"! L 1 lATT 


TTLoTL • AoA 


GTrTGATGAT 


A 


ACCTTTCACC 


AGATTCGGCT 


L I H CAAOAG 


ft ft/*ft*I*A^ Ai^ 

LAACATATAu 


ATTCTCGTCT 


ATT ACT ATTP A 




GCTTACCAGC 


CTCATCTACG 


TTATTTTTTA 


TATAGACATG 


GTGTGTTAGA 


AATGGATACT 


19020 


TATTCCGTTT 


TTGATCTTAT 


GCAAGATTTT 


CATAATCTCC 


ATACCCAACT 


TTTAAGCATT 


19080 


AGACATATTG 


AGTGGGATGA 


TCACTGTCAA 


TTTATTTATA 


GTCCCTTTAC 


G ATT AT CGTT 


19140 


CAAAAAAATG 


GGAAGAAATT 


TGCTAAGGTT 


GAACATGGAG 


TTGAAGGCTT 


CATCAGTGAT 


19200 


ATACAGTATT 


TTGAACCAAA 


TGGTCAAATA 


CATATGCACC 


ATATCGTGGA 


TGATCGTGGG 


19260 


TTTGTATCGA 


GCATTATCTT 


TTTTGAAGAT 


GGGCAAGCAC 


CCTATCAACA 


ATATCTGAAC 


19320 


CTCAAGGGAG 


ACTGGCAATT 


CAGAG AG CGT 


TTAAAAGAAG 


GAGGACAGCT 


AGAAGTCAAT 


19380 


CCAATTTTGG 


GTTATCGCTT 


TAAAATGCTT 


ACCTATCAAA 


ATATGGGAGA 


TCTGGTGGCA 


19440 
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GAATTTTTTC 


AGAATTATCT 


CCAAACGTAT 


GTGAAGGATC 


AGG AT ATT TT 


TATGCTTCCT 


19500 


TCTCATTCTC 


ATCATGACCA 


GTTGGTACTA 


GATCGTTTAC 


CTACTACTAA 


TCCTAAACTG 


19S60 


TTGACTCTGT 


TCATTGCACG 


TAATCCTCAA 


GATACCTTTA 


GGGATTTAGA 


TGTAACTTTT 


19620 


GAAAAATCGC 


ATTTGATTTT 


GGTGGATAGA 


GAGGATAGTT 


TACGATTGTT 


GCAGGAGTTG 


19680 


TATCCTCAAC 


GAATGCATCA 


ATGTTATCAT 


TTATCATCTT 


TTGACACCCG 


ATTACGATTC 


19740 


GGACGAAGCC 


AAACTAAGAA 


AGAATCCATC 


ATTTATTTTC 


AACTGGATTT 


TGAGCAGGGG 


19800 


ATTGATAATC 


AAGCTCTGCT 


TCAAGTCTTG 


TCCTTTGTCG 


CTGAAAATAA 


GGATACTGAG 


19860 


CTGATTTTTG 


GAGCCTTTGC 


TGCTAGTCAG 


GAGCAAATGA 


ATGAGGTTGA 


AGCGATTGTT 


19920 


GACTCTTTCA 


TCCAAGAAAA 


CATTCAATCC 


GAAAATCTGG 


GAAAGGCGAT 


TGATTATCGT 


19980 


GATGCAGAAA 


ATCCTCTGGA 


AGAAAATCAA 


CACCAGGACT 


TACGCTTACA 


GTTTGTTAAC 


20040 


TTGAATGATG 


AGTTAGATTT 


GATAAAAACA 


CTAGAATTTG 


TCCGTTTGAT 


TCTGGATTTA 


20100 


AATAGACATC 


CTCATCTCTA 


CACACAGATT 


GCTCCGATTA 


GTCCAGGAAT 


TCCTCAAATC 


20160 


AACCTAGTTG 


AAACCGTCTA 


TGTTGAACAT 


TTAAAAAATG 


GTTATTTGTT 


AGCAGATGTT 


20220 


ACAGAATTTT 


CTAAGGCTCC 


ACATTATTAC 


ACAGATAGGT 


TGAAGGAGTG 


GAATGAGTCC 


20280 


TTGATATATT 


CAATTGATAA 


GATTAAGGAC 


CACACAGCAC 


AACAATTTCT 


TCGAAAATTA 


20340 


GAGAAATGGA 


TACAGGAGGT 


TAAAAATGTC 


AAACGAACTT 


AATATTTTAC 


AGATAGGACT 


20400 


TGCCAATTGG 


GAAAATCACT 


ATGACATACC 


TGAAAATATG 


AGTTGCTATT 


ATTTTTACCC 


20460 


AAACTCATCA 


AAAGCCCTTC 


CTGAAATAAT 


TGAAAAAGAC 


CATATTAACC 


GTTTTCATGC 


20S20 


AGTTTTAATA 


CAAGATCCTC 


AGTATTCCAC 


AGACTTATTT 


TCCTATGTAA 


AATATTTTGA 


20580 


ACCTTATACT 


TTATT TT AT A 


ACCAGAATCT 


ACAAATAAAT 


GATAGAGAGG 


TTGTGGATTT 


20640 


TCTAAAAAAA 


CCATGTGCAC 


AAGCAATTCA 


TTTTTTAACT 


CCCCAACAAC 


TAATCAATGA 


20700 


TTTAAGTAAA 


TCTCTTTTTG 


CCCCTCCCTA 


TGGTGATAAA 


CTCTTTCCTC 


CGACGATACA 


20760 


AGTCAATCCA 


AATTTTACAC 


GAGCTATTTC 


TTATCAAGGA 


TTGGATTATG 


TCAGTTTGGA 


20820 


ACGTGAGTTT 


GGGCAAGATT 


TTGCCCACCT 


TGCCTATTGG 


GCTTATAATA 


TTATCGTGCA 


20880 


AAAAACACTC 


CCTATTGAGT 


TGTGGCTTGA 


ATATGAGAAG 


GAACGCAATT 


GTGACTTTCG 


20940 


TTTAGTAATC 


CGTAAAATCT 


GGAGTCGCTC 


TGTTGATGAT 


TTCTTTGAAG 


AAGTAATAGT 


21000 


ATCTGAAAAA 


GACTTGGAGC 


AAGCACTTTT 


TATGGATACT 


CGAGACGGAG 


ACTACTTTCT 


21060 


CTCGATATCT 


GTTGAAGCAA 


GAGGTCGTGG 


AACTATCAAA 


CTAGGTAATC 


TTCACCAACG 


21120 


ATGGAGTCGA 


AAACAATTTG 


GTAAGTTTGT 


ACTTCCTGGA 


AATATCCTAC 


ATGATTCCAA 


21180 


GCGTCATGAA 


ATAAACTATT 


TCTTCCATCC 


AGGTGATTTT 


AAACCGCCTT 


TGACTCTCTA 


21240 
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TTTTGCAGGT 


TATCGACCTG 


LAunnOwn i 1 


CGAGGGTTAC 


TTTATGATGA 


AAACTCTTGG 


21300 


ATGTCCCTTC 


ATTTTATTTT 




TTTAGAGGGG 


GGAGCTTTTT 


ATCTCGGAAC 


21360 


GGATGAGCTA 


GAGGGAAAAo 


t* a a & rr/T A ^ A 


GATCACTCAC 


TATCTTGATT 


ATTTAGGCTT 


21420 


TGATCATAAG 


GATTTGATTT 


TATCAGGTtT 


TTCTATGGGA 


ACGTTTCCGG 




21480 


TCGTGCTTCT 


TTTGAACC CC 


ATGCCATCAT 


AGTTGCTAAG 


CCCTTGGCTA 


ATTTAGGAAC 


21540 


TATAGCTACT 


CGTGGACGTT 


TGGACCCACC 


GGGTGTCTCT 


AACTTAGCTT 




21600 


AATTCATCAT 


ACAGGTGGGA 


CAAGTTCTCA 


AGATATGACG 


GAGTTGGATC 


OwV. Va» k i. * & A Vff 


21660 


GAAAATTTTT 


AAACAAGCAA 


ATTTTTCAAA 


GACAACCTTT GGTTTATCCT 




21720 


TGAAGAAATG 


GATCCACAAG 


CCTATGAACA 


All Au lulti 


TATCTCTCTA 


IT A*~ AfiCTCf? 


21780 


GAAGATTTTA 


TCTAAAGGAA 


CTGCTGGACG 


irAraiTfiAT 

AUAWw\ 1 w#\ 1 


GATACACATA 


V,UAAlnl 1 1\. 


21840 


TTGGTTITTG 


CACTTTTATA 


GAATGGTCTT 


nunwnW A v»V* i 


TTTGGAAGGG 


AuAAAnuArii 


21900 


ATTATTACAC 


AGAGACAGTC 


TATTCATTGG 




GTGGGACTTA 


TATCTATvGA 


4. 17BW 


AC AACTGT AT 


CTTATTACCC 


TGACAAAAGT 




ATAATCCTCT 


A 1 iVLvn i\m 1 


22020 


GGTGAGATTC 


TAAAGACTTG 


GTTTTCTAGT 


GTCAATTACC 


AGGCTGCACG 


& a /"f*f^ A a<T**p 


22080 


CAGCTTCCCT 


TATTAAAAAG 


AAAGCAGGAG 


TATCAACTAT 


CACTGGTTTT 


•TV" A r"*Tf* TP & 


22140 


CCTGAAAATG 


GAGTTTATAC 


CAAGATAACT 


TTTTTTGACC 


GCTATGCTGA 


1 A 1 1 i A ^VJ^J^ 


22200 


AAAAAGGTAG 


AAAAAGTGAA 


AGATTTCATA 


TTTACTTATC 


CAGAAGATAG 


I 1 #\ A * • A 4 


22260 


CGAGTTTCTC 


TTTTAAGTGC 


TGGATTTGAG 


TCCTTAACTT 


TTTATCATTT 


TTCTATCAAG 


22320 


GAG AT C AGAA 


GTGTTTAGAC 


GTTTAGGTCA 


ACATTTCCAG 


CTTAGGAAAG 


TGAAAAACAT 


22380 


TTTAAAGCAG 


ATTAATCCCC 


TGAAAGGv-AA 


GATGTCCTCT 


CTTTCGGATC 


AAGAATTAGT 


22440 


AGCTAAAACA 


CTAGAGTTTC 


GTCAGCbTt* 


TTCCGAGGGA 


GAAAGTCTAG 


ACGATATTTT 


22500 


GGTTGAAGCT 


TTTGCTGTGG 


TGCGTGAAGC 


AGATAAGCGG 


ATTTTAGGCA 


TGTTTCCT^A 


22S60 


TGATCTTCAA 


GTCATGGCAG 


CTATTGTCAT 


GCACTATGCA 


AATGTTGCTG 


AGATGAATAC 


22620 


GGGGGAAGCT 


AAGACCTTGA 


CAGCTACCAT 


GCCTGTCTAT 


TTGAACCCTT 


TTTCACGAGA 


22680 


AGGAGTGATG 


CTTGTGACTC 


CTAATGAGTA 


TTTATCAAAG 


CGTGATGCCG 


AGGAAATGCG 


22740 


TCAAGTTTAT 


CCTTTTCTAG 


GATTGACCAT 


TGGTCTACCA 


TTTACGGAAG 


ATCCAAAGAA 


22800 


GGAGATGAAA 


GCTGAAGAAA 


AGAAGCTTAT 


CTATGCTTCG 


GATATCATCT 


ACACAACCAA 


22860 


TAGTAATTTA 


GGTTTTGATT 


ATCTAAATGA 


TAACCTAGCC 


TCGAATGAAG 


AAGGTAAGTT 


22920 


TTTACGACCG 


TTTAACTATG 


TG ATT ATT G A 


TGAAATTGAT 


GATATCTTGC 


TTGATAGTGC 


22980 
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ACAAACTCCT CTGATTATTG CCGGTTCTCC TCGTCTTCAC TCTAATTACT ATGCCATCAT 23040 

TGATACACTT CTAACAACCT TGCTCGAAGG AGAGGATTAT ATCTTTAAAG AGGAGAAAGA 23100 

GGAGGTTTGG CTCACTACTA AGGGCGCCAA CTCTGCTGAG AATTTCCTAG GCATTGATAA 23160 

TTTATACAAG CAAGAGCATG CCTCTTTTGC TCGTCATTTG GTTTATGCGA TTCGAGCTCA 23220 

TAAGCTCTTT ACTAAAGATA AGG ACT AT AT CATTCGTCGA AATGAGATCG TACTGGTTGA 23280 

TAAGGGAACA GGGCCTCTAA TCGAAATGAC TAAACTTCAA GGAGGTCTCC ATCAGCCTAT 23 340 

TGAAGCCAAG GAACATGTCA AATTATCTCC TCAGACGCGG GCTATGGCCT CGATCACCTA 23400 

TCAGAGTCTT TTTAAGATGT TTAATAAGAT ATCTGGTATG ACAGGGACAG GTAAGGTCGC 23460 

GGAAAAAGAG TTTATTGAAA CTTACAATAT GTCTGTAGTA CCCATTCCAA CCAATCGTCC 23 520 

GAGACAACGG ATTCACTATC CAGATAATCT ATATATCACT TTACCTGAAA AAGTCTATCC 23580 

ATCCTTGGAC TACATCAAGC AATACCATCC TAAGGGAAAT CCTTTACTCG TTTTTGTAGG 23640 

CTCAGTTCAA ATGTCTCAAC TCTATTCCTC TCTCTTGTTT CGTGAAGGGA TTGCCCATAA 23700 

TGTCCTAAAT GCTAATAATG CGGCGCGTGA GGCTCAGATT ATCTCCGACT CAGGTCAGAT 23760 

CGGGGCTGTG ACAGTGCCTA CCTCTATGGC AGGACCTGGT ACGGATATCA AGCTTGGTAA 23820 

AGGAGTCGCA GAGCTTGGGG GCTTGATTGT TATTGGGACT GAGCGGATGC AAAGTCAGCG 23 880 

GATCCACCTA C AAATTCGTG GCCCTTCTGG TCGTCAGGGA CATCCTCCTA TG AG T AAATT 23 940 

TTTTGTATCC TTAGACCATG ATGTTATCAA GAAATTTGGT CCATCTTGGG TGCATAAAAA 24 000 

GTACAAAGAC TATCAGGTTC AAGATATGAC TCAACCGGAA CTATTCAAAC GTCGTAAATA 24060 

CCGGAAACTA CTCGAAAACG CTCAGCATGC CACTGATAGT CCTGGACGTT CACCACCTCC 24120 

TCACACTCTG GAGTATGCTG AAAGTATGAA TATACAACGG GATATAGTCT ATAAAGAGAC 24180 

AAATCGTCTA ATACATGGTT CTCGTCACTT ACACCATGTT CTTGTCGATA TCATTCAGAG 24240 

ATATACAGAA GAGGTAGCGG CTGATCACTA TGCTAGTCGT GAATTATTCT TTCACTTTAT 24300 

TGTGACCAAT ATTAGTTTTC ATCTTAAAGA GGTTCCAGAT TATATAGATG TAACTGACAA 24360 

AACTGCAGTT CCTAGCTTTA TGAAGCAGGT GATTGATAAA CAACTTTCTC AAAAGAAAGA 24420 

ATTACTTAAT CAACATGACT TATATGAACA GTTTTTACGA CTTTCACTGC TTAAAGCCAT 24480 

TGATGACAAC TGGGTAGAGC AGGTAGACTA TCTACAACAG CTATCCATGG CTATCGGTGG 24 540 

TCAATCTGCT AGTCAGAAAA ATCCAATCGT AGAGTACTAT CAAGAAGCCT ACGCGGGCTT 24 600 

TCAAGCTATG AAAGAACAGA TTCATGCCGA TATGGTGCGT AATCTCCTGA TCGGGCTGGT 24 660 

TGAGGTCACT CCAAAAGGTG AAATCGTGAC TCATTTTCCA TAAAAGGAGA AAATATGACA 24 720 

ATTTACAATA TAAATTTAGG AATTGGTTGG GCTAGTAGCG GTGTTGAATA CGCTCAAGCC 24780 
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TATCGTCCTC 


GTCTTTTTCG 


GAAATTAAAT 


CTGTCCTCTA 


ACTTTATCTT 


TACAGATATG 


24840 


ATTTTAGCCC 


ATAATATTCA 


GCACTTAACA 


GCCAATATTG 


GTTTTGATGA 


TAATCAGGTT 


24900 


ATCTGGCTTT 


ATAATCATTT 


CACACATATC 


AAAATTGCAC 


CTACTACCGT 


CACAGTGGAT 


24960 


GATGTCTTGG 


CTTACTTTCG 


TGGTGAAGAA 


AGTCACAGAG 


AAAAAAATGG 


CAACGTTTTA 


2S020 


CGTGTATTCT 


TTTTTGACCA 


AGATAAGTTT •GTAACCTCTT 


ATTTGGTTGA 


TGAGAACAAG 


2S080 


GACTTCGTTC 


AACATCCCGA 


GTATGTTTTT 


AAGGGAAACC 


TGATTCGGAA 


CGATTACTTT 


25140 


TCTTATACGC 


GTTATTGTAG 


CGAGTATTTT 


GCTCCCAAGG 


ACAATGTTGC 


AGTCTTATAC 


25200 


CAACGAACTT 


TTTATAATGA 


AGACGGCACT 


CCAGTCTATG 


ATATCTTGAT 


GAATCAAGGG 


25260 


AAGGAAGAAG 


TTTATCATTT 


CAAGGATAAG 


ATTTTCTATG 


GAAAGCAAGC 


TTTTGTGCGT 


25320 


GCCTTTATGA 


AATCTTTGAA 


TTTGAATAAG 


TCTGATTTGG 


TCATTCTCGA 


TAGGGAGACA 


25380 


CGTATTGGAC 


AGGTTGTGTT 


TGAGGAAGCA 


CAGACAGCAC 


ATCTAGCCGT 


AGTTGTTCAT 


25440 


GCGGAGCATT 


ATAGTGAAAA 


TGCTACAAAT 


GAGGACTATA 


TCCTTTGCAA 


TAACTATTAT 


2SS00 


CACTATCAGT 


TTACCAATCC 


AGATAAGCTT 


GACTTCTTTA 


TCGTCTCTAC 


TGATAGACAA 


2S560 


AATGAAGTTC 


TACAAGACCA 


ATTTGCCAAA 


TATACTCAGC 


ATC ACCC AAA 


GATTGTT AC C 


2S620 


ATTCCTCT AC 


GCAGTATTGA 


TTCCTTGACA 


GATTCAAGTC 


AAGGGCGCAA 


ACCATTTTCA 


2S680 


TTGATTACGG 


CTTCACGTCT 


TGCCAAAGAA 


AAGCACATTG 


ATTGGCTTGT 


GAAAGCTCTG 


25740 


ATTGAAGCTC 


ATAAGGAGTT 


ACCGGAACTA 


ACCTTTGATA 


TCTATGGTAG 


TGGTGGAGAA 


2S800 


GATTCTCTGC 


TTAGAGAAAT 


TATTGCAAAT 


CATCAGGCAG 


AGGACTATAT 


CCAACTCAAG 


2S860 


GGGCATGCCG 


AACTTTCGCA 


GATTTATAGC 


CAGTATGAGG 


TCTACTTAAC 


GGCTTCTACC 


25920 


AGCCAAGGAT 


TTCCTCTGAC 


CTTGATGCAA 


GCTATTGGTT 


CACGTCTACC 


TCTAATTGGT 


25980 


TTTGATGTGC 


CTTATGGTAA 


tcagaccttt 


ATAGAGGATG 


GGCAAAATGG 


TTATTTGATT 


26040 


CCAAGTTCAT 


CTGACCATGT 


ACAAGACCAA 


ATCAAGCAAG 


CTTATGCCGC 


TAAGATTTCT 


26100 


CAATTGTATC 


AACAAAATCG 


TTTGGAACCT 


ATGCGTGCCT 


ATTCTTACCA 


AATTGCAGAA 


26160 


CGCTTCTTGA 


CCAAAGAAAT 


TTTAGAAAAG 


TGGAAGAAAA 


CAGTACAGGA 


GGTGCTCCAT 


26220 


GATTGAACTT 


TATGATAGTT 


ACAGTCAAGA 


AAGTCGAGAT 


TTACATGAAA 


GTCTAGGCGC 


26280 


TACTGGTCTT 


TCTCAACTTG 


GAGTGGTCAT 


CGATGCAGAT 


GGTTTTCTGC 


CTGATGGTCT 


26340 


CCTTTCTCCT 


TTTACCTATT 


ATCTAGGTTA 


CGAGCATGGA 


AAACCTCTCT 


ATTTTAATCA 


26400 


AGTTCCCGTT 


TCAGATTTTT 


GGGAAATTTT 


AGGAGATAAT 


CAGTCTCCTT 


GTATTGAAGA 


26460 


TGTCACGCAG 


GAGAGGGCTG 


TCATTCATTA 


TGCTGATGGA 


ATGCAGGCTC 


GCTTGGTTAA 


26520 
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ACAGCTAGAC 


TGGAAAGACC 


TAGAAGCTCG 


AGTACCTCAG 


CTTCACCACT 


ACAATCGCTT 


, 26580 


CGGAGCTTGT 


TTTGCTACAA CGACTTATAG 


CGCAGATAGC 


CAGCCGATTA 


TGACAGTTTA 


26640 


CCAAGATCTC 


AATGGTCAAC 


AAGTTTTACT 


GGAAAACCAT 


GTGACGGGTG 


ATATCTTATT 


26700 


GACTTTGCCA 


CCTCACTCCA 


TGCCTTACTT 


TGCAAATAAA 


GTTGAATTTA 


TCACCTTCTT 


267 60 


TTTGCAAGAT 


TTGGAAATAG 


ATACCAGTCA 


GCTTATCTTT 


AATACTCTAG 


CGACTCCTTT 


26820 


CTTGGTTTCC 


TTCCATCATC 


CAGATAAATC 


TGGCTCCGAT 


GTCTTGGTAT 


GGCAGGAACC 


26880 


TCTCTATGAT 


GCCATTCCAG 


CTAATATGCA 


CTTGATTTTC 


GAAAGTGATA 


ATGTGCGTAC 


26940 


TAAGAAGATC 


ATCATTCCAA 


ATAAGGCGAC 


TTATGAGCGC 


GCTTTAGAGT 


TAACTGACGA 


27000 


GAAATACCAT 


GATCAGTTTG 


TGCACTTGCG 


TTATCATTAC 


CAGTTCAAAC 


GTGATAATTT 


27060 


CCTAAGACGA 


CATGCCTTAA 


TCTTGACCAA 


TTCAGATCAG 


ATTGAGCAAG 


TAGAAGCAAT 


27120 


CGCAGGAGCC 


TTGCCTGATG 


TCACTTTCCC 


TATTGCAGCG 


GTGACAGAGA 


TCTCTTCTAA 


27180 


GCTCTTAGAC 


ATGCTTTGCT 


ATCCTAATCT 


GGCCCTTTAC 


CAGAACGCTA 


GTCCACAGAA 


27240 


GATTCAGGAG 


CTGTATCAAC 


TGTCGGATAT 


TTACTTGGAT 


ATAAACCACA 


CTAATGAGTT 


27300 


GCTACAGGCA 


GTGCCTCAGG 


CCTTTGAGCA 


CAATCTCTTG 


ATTCTTGGCT 


TTAATCACAC 


27360 


CCTGCACAAT 


AGACTTTATA 


TCGCTCCACA 


CCATCTATTT 


CAAACTAGTC 


AAGTTGCTGC 


27420 


TTTGGTTGAG 


ACCATTAAAT 


TGCCCCTTTC 


AGATGTTGAT 


CAAATGCCTC 


AGGCACTTGG 


27480 


CAAACAACGC 


CAACATGCAA 


ATTATGTTGA 


CTTGGTGAGA 


TATCAGGAAA 


CCATCCAAAC 


27540 


TGTTTTAGGA 


GGCTAACATG 


TCAGAGGAAG 


ATTTATTTTA 


CAAAGACGTT 


GAAGGCCGCA 


27600 


TGGAAGAGTT 


GAAACAAAAA 


CCCATCAACA 


AGGAAAAAGA 


AACCCGAGGG 


GAAAAGATTA 


27660 


GTAAGACTTT 


TTCACTTTTA 


CTGGGTTTGA 


TGATTCTGAT 


TGGTTTGCTC 


TTTACTTTGC 


27720 


TGGGAATTTT 


GAGGTAGATC 


TATGATTGAA 


ATACTAATTC 


TTTTAGCTAT 


TATCCTATCT 


27780 


CTTGCTTTGA 


TTGTATTGGT 


AACTATACAA 


CCCCGTCAAA 


ATCAACTATT 


TTCCATCGAT 


27840 


GCCACTAGTA 


ATATTGGTAA 


ACCAAGCTAC 


TGGCAGAGCA 


ACACCTTGGT 


CAAGGTGCTC 


2790O 


ACTTTATTGG 


TGAGTTTGGC 


TTTATTTATT 


CTACTATTAA 


CCTTTATGGT 


G ATT ACTT AT 


27960 


AAATAAAAGA 


AAACTTCAGA 


TATTCACCTT 


TTGTGGATTG 


GTCTGAAGTT 


TTCTTTTTTA 


28020 


TACTCAATGA 


AAATCAAACA 


GCAAACTAGC 


AAGCTAGCCG 


CAGGCTgCTC AAAACACCGT 


28G80 


TTTCAGGTTG 


TAGATATAAC 


TGACGAAGTC 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTAGAT 


28140 


ATAACTGACG 


AAGTCAGCTC 


AAAACACCGT 


TTTGAGGTTG 


TGGATAGAAC 


TGACGAAGTC 


28200 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTGGAT 


AGAACTGACC 


AAGTCAGCTC 


AAAACACCGT 


28260 


TTTG AGGTT G 


TGGATAGAAC 


TGACGAAGTC 


AGCTCAAAAC 


ACCGTTTTGA 


GGTTGTGGAT 


28320 
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AGAACTuALVf 


AnuC LLAu 1 A 


Af" AT AT AT AC 

Aw A lAtAi Aw 


AG C AAGGCGA 


CGCTGACGTG 


CTTTGAACAG 


28380 


jm k f«HA«k *ft#fft^v«n^ 

TATTACTGTC 


TATATTTTTG 


Vj 1 AAAAA X L.n 


A fTTTTACTT 


GGATGAAGGT 


TTTGGCTTCA 


28440 


CGTAGGAGTT 




GGfGPGGG TT 


TCAAATTCTT 


CTCTTGTCTT 


GGCCAGACTG 


28S00 


CGGTTCCGGA 


ft ^ ft /«T«'1V*^*IIP 

AGACTTV.CA6 


A 1 AAUj I 1 W» 


ATTTCATCTA 


GCAAATCAGA 


AGCAGGATTG 


28S60 


GTCTGGCTCA 


GTTGACCTOC 


AATTTTTGAA 


AAGAGTTGCG 


CTAAGATCAG 


GCTTTCACTG 


28620 


GCGGCAACGT 


GACAAGTGTT 


AATCTGTTGG 




TPACGATACC 


r^v» A * A ^* * w^^w 


28680 


TGTCTCATCT 


CAAAGTAGTG 


^ m «it ft mi<pm ft /** 

GATATGGTaG 


TCTGTCTGGT 


GAAAGAGCTG 


GTCAGAGTGA 


28740 


TCCAAATAGA 


CCAGTCTGAG 


GGCTTCTTTC 


ft ft K KClff** IYt*T» 




TACCACCTGT 


28800 


GCTCGGTTGC 


GTCCGTCTCC 


TCTGGATAAA 


TAGT ATTTG A 


AGf*GPTCGAG 


GAT ATCTTTT 

\wFff» A « ft \* ft 4 ft ft 


28860 


AACTTTTCTT 


CCACCAGCGT 


GTGGTAGTGC 


TGGATTTLL T 


^^^*^^*T*ft r ™^^n^ A 


ACGCATATAG 


28920 


AGATTAACAA 


GCAAGGCAAA 


TCCTGTACCA 


ft IT* »> /""»/"• ft. K ft f* R 


UA/\UUA/\ 1 1 w 


ATTGACTAGA 


28980 


AGGTCTGGAG 


AGGTTGACTC 


TTGAACCAAG 


ft /» « nwMMm ft ft 

AuATCCCTAA 


IV h. & AI^Af^TP 
(.LAAAACAw 1 


firTTRfiTCTG 

V*V» 1 1 1U1VJ 


29040 


ATGCCAATTT 


CCCAGCCCAT 


CTTGTAGGCT 


ft ft & r*?* » ft ^r**t* 


Aw nAuuC wAv 


ATAGAGGCCG 


29100 


AGACTCCAGA 


TATGAAATCC 


GCTCAAGTGA 


AAAu^ lAynA 


f & Cf*CZ. AT AG f* 


CACACCTACA 


29160 


AGCATAGAAA 


AAAGACGATT 


GC G AG CCAGT 


i I I AaAIj 1 M\. 




ATCAGATAGG 


29220 


CTCAAGAGAG 


CGATAATTCC 


A S** S*^*** ft ft m ^K%ft 

AGCCGAAACT 


/~*f^C Kff* & A A 
wbTViAwAnA 


G ATTG AGAAA 
uAl A wAwwv" 


ATAAGCAACC 

A ^*^^ftJ ^m^^m^\J^m 


29280 


AGGCAGGCAA 


G ACAGGT AG C 


TAAGATGAGC 


n (Ail lvi i al. 


Oil uuU 1 AA 1 


AGACATAACA 


29340 


ATTTCCTAAT 


AACTTAGAAT 


AAAAGCGTAA 


AAunlbAAUnL 


ATGAGPACGC 


TTCC f'l^lXj At 1 


29400 


GAGTTATTTT 


TTACGGGTTG 


vTrOCOTATTt 




GTAAAGAGGA 


CATCTGTAGA 


29460 


AGAGTTAAGG 


GCTGTTTCAC 


A 1 ww 1 v» 1 1 VJ 


GATGACACCA 


ATCACAAAAC 


CAACCCCAAC 


29520 


AATTTGTATG 


UCAAIA I i 


1 AUfvM 1 ALw 


CAAAACGCTA 


CAAGCAACTG 


GGATAAGAAG 


29S80 


GAGGGAACCT 


CCGGCAATAC 


CTGAAGCATC 


ACAGGATGAG 


ATAGCTGCTA 


r+r* ft Aft ft /vft/"* ft A 


1 tlC A ft 


GACAAACGCT 


GTGGCAAAGT 


CAACAGGAAT 


TCCAAGAGTG 


TTAACTCCAG 


CAAGGGTCAA 


29700 


AACGTTAATG 


GTAATCGCTA 


CTCCAGCCAT 


ATTCATACTA 


GAACCGAGTG 


GGATAGAAAC 


29760 


AGAATAGCTA 


TCTGGGTTGA 


GTCCAAGCTC 


ATGGCACAGT 


TTCATGTTGA 


CAGGAATGTT 


29820 


AGTCGCAGAA 


CTACGAGTCA 


AAAAGGCTGT 


CACACCGCTG 


ACACGGAGGC 


AGTTCCAAAC 


29880 


TAGAGGGTAA 


GCATTGCCTC 


TCATAAACAA 


GAAGGCAATC 


AAAGGGTTGA 


CCACAGGGGC 


29940 


AACAAAAAGC 


ATAGTCGTTA 


CTAAT AG AAC 


CAATAAAATA 


CCGTAGTTGG 


CAAGGCTTCC 


30000 


GACTCCCTTG 


TCAGAAATGG 


TTTTAAAAAC 


AAGACCAAGG 


ATTCCAAATG 


GAGCCAGATT 


30060 
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CATGATCCAT TCGACAATTT TAGAAGTCAC GTCAGCGATA GTTTTTAGCA ATTCTTCACT 30120 

ATTTTTACTC GCTTCTCTCA TAGCGATTCC AAAAATGACT CCCCAAGATA ACATTCTAAT 30180 

ATAGTTAGCA GTAAGCAGGG CGTTGACTGG GTTGTCAACC A G TTTGAGCA AGAGGTTGCT 30240 

GAGAACCTGC CCAATCCCAT CTGGTGGTGC AATTTCAGTA TTGGCACTAT TTGGGGTAAT 30300 

TTCAATAGGG ACGATGAAAT TTGCTACTAC ACCTACAACA CCAGCGCCGA AAGT CCCTAT 30360 

CATAGGATAT ACAAGAAAAC AACAGTTTTC ATATTGCTAT CTTGTCCCTT TTCATCTTGG 30420 

GAAAGGGCAT TGGCAACGAG AGCAAAGACT AGGATAGGAG CAACAGCTTT TAGACCTCCA 30480 

ACGAATAAAT CCTCGAGTAG CCCAATCCCT GAG AG ATT AG GAAGGCTCAG TCCTAGGATT 30540 

CCCCACAAAG CATACCAATC AAGATACGCT TGACAAGGCT TCCCTTATTC CAAGCATGAA 30600 

TGATTCTTTT CATAATAATC TCCTTTTTGT CTACTGATTA TGATTATACT AT AAATGAT A 30660 

GACAAAATCA AGAATTTTCT GTCTATTTTT TGAATATTTA TGGAGAATGA GACTGATGAA 30720 

AATATGGTAT AATGAAATAA ACGAGTTTTA TATGCAAAAA TTTATTCAGG CTTATATTGA 30780 

AAAGCTAGAT GTCACAACCA TTATCGAGAA TATTCTAACC AAGGTCATTT CTCTTTTACT 30840 

GCTTTTAATT GTATTTTATA TTGCTAAAAA AATGCTTCAT ACCATCCTGC AGAGAATTGT 30900 

CAAACCTTCT CTAAAAATGT CTCCTCATGA TGTTGGACGC CAAAAAACCA TCTCACCTTT 30960 

ACT AGAAAAT CTGTTTAATT ATACGCTATA TTTCTTTTTA CTCTACTGCA TTTTCTCGAT 31020 

TTTAGGTTTG CCAGTTTCTA GTTTGCTGGC TGGAGCTGGT ATTCCTGGGG TAGCGATTGG 31080 

TATGGGAGCC CAAGGCTTTC TGTCTGATCT CATCAATGGC TTTTTCATCC TCTTTGAACG 31140 

TCAACTGGAT GTGGGAGATG AGCTCCTTCT GACAAATGGA CCGATTACTG TATCGGGTAA 31200 

GGTTGTCAGT GTGGGAATTC CTACCACACA GCTTCCTACC GAGGACCAAG CCCTTCACTT 31260 

TGTCCCTAAC CGAAATATCA CAGTTGTTAG CAATTTCTCA CGCACAGAC7 ACACCTGTTA 31320 

TTTTAAGTAA TTTGTGGTAC AATAGAGGGA CTTTAATAAC CACAAAAGAT GCTTTTAGAA 31380 

AAGCACTTCG GCAATCGTTC TACCTGGATA GACCTAGACC TAGGAAAGTT GAATAAACTA 31440 

CAACACCTTT CTGAAATTTA CGGTTTGGAC AAGGAAACCA TTGAATACGC ACTGGATAGA 31500 

AACGAGCGCG CCCACATGGA CTACCACCGT GAAAGTGAGA CGGTTACCTT TATCTATAAT 31560 

CTCTTAGACG TAAAAAAGCA CAAGGCCTAC TATCAGACTT TTCCCATCAC CTTTATTGTC 31620 

GACCATCGTC GCCTGATTAC CATTAGTAAT ACCAAGAACG CCTATGTCAT TGAACAGATG 31680 

ACTCGTTATC TGGAGAACCA TGACACGCTT TCGATTTATA AGTTTCTCTT TGCCAGTCTG 31740 

GAAATCATCA GCAATGCCTA CTATCCTGTC ATTGAGCAGA TGGACAAGAG TAGGGATGAG 31800 

GTCAATGACC TCTTGCGCCA CCGAACTACC AAGAAAAACC TCTTTGTCCT CTCTGATTTG 31860 



WO 98/18931 



PCI7US97/19588 



593 

CACACTGCTA TCCTTTATCT CACGGCAGCT GCCAAACAAA 
ATTCAAGGTC ATGCCTTGTA TCCTACTTTT GATGAGATTC 
GCCATGATTG AGGCTCATCA GCTGGTATCC ATGACAGACC 
CAGCTTTCAG CCTCTTACAA CAATATTCTA AACAATAATC 
TTGACTATCA TTTCAGTCTT GCTAGCTGTT TTGGCAGTCC 
AATGTTCCCT TACCTTTAAC AGATGAGCCC CATCCTTGGC 
GCAGGTTTGT CGATTGTTTT ATCCTTGTTA CTAAGGAAAA 
AAGGAGCCAG AATGGCGATT GAAAATTATA TACCAGATTT 
ATCTCACACT CCCAAGCCTG CAGGCGCAGG GAATAAAGGC 
ATACCCTCAT TGCTTGGAAC AACCCTGATG GAACGCCAGA 
ACCTTCCGGA CGCGGGTATT GGCATTATCG TAGTGTCAAA 
AACGAGCACT TGAGAAATTT GGGATTGATT ACGTTTACTG 
TTGGTATTGA CCGTGCTATG AACGAATTCC ACTATGACAA 
CTGACCAACT CATGACAGAT ATACGAGCAG CCCACCGTCC 
TCAAACCCTT GGTCCAACAT GACTCAATCA AAACGCAGAT 
GTGTTATG 

(2) INFORMATION FOR SEQ ID NO: 72: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



ATCGGATTTT 
AGAGAGAACA 
TAATCTCTCA 
TGAATGACAA 
TGACAGGCTT 
TCTATATCAG 
TTGCGAAAAA 
TGCTGTGGAA 
TGTTTTGGTC 
GATGAACCAA 
TAACACCAAA 
GGCCTTGAAG 
AAAGGAAGTG 
AGGGATTCGG 
TAACCGAACT 



GTTAGAGCAT 
GTTTGATGAT 
GATTTTACAG 
TTTGACAACC 
TTTCGGAATG 
TTTGGCTAGT 
AAGTTAAGAA 
GCACTCTATG 
GATTTGGATA 
TGGCTACATG 
AAACGCGTTC 
CCCTTCACAT 
GTCATGGTTG 
TCAATTTTAG 
CGTGAGCGTC 



31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32S20 
32S80 
32640 
32700 
32760 
32768 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CCAGTCACAA AGAAATTGAG CGCGTTCAGc TCAGGATCCA CTATGATGCA 
CATTTGATGG GATATTAAGA AAGGAGATTT TCATGACACT TTTAGATGTA 
AAAAAATTTA TAAAACACGT TTTCAGGGCA ACCAAGTAGA AGCCCTCAAG 
TTACCGTAGA AAAGGGTGAC TACGTTGCCA TCATGGGTGA GTCTGGTTCT 
CTCTTCTCAA TATTCTAGCT ATCTTGCATA AACCAAGTCG TGGTCAGGTT 
GAACTCACAC CGC AACT ATT AAAAATTCAC ACCCTTCTAG TTTCCGGCGT 
GATTTGTCTT CCAAGACTTT AACTTGCTAG ATACTCTGTC TCTTAAGGAC 



AGCTACATTT 
AAACACGTTC 
GATATTCACT 
GGTAAATCAA 
TACTTGAATG 
GAAAAGCTAG 
AATATCTTGC 



60 
120 
IBO 
240 
300 
360 
420 
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TTCCGCTTGT CTTGTCAAGA AGACCTATAA CGGAGATGAT GAAGAAATTG GTGGTGACAG 480 

CTGAGAATCT GGCTATTAAC CAATTGCAAG AGAAGTACCC TTACGAGATT TCTGGTGGTC 540 

AGAAACAGCG TGTAGCAGTA GCCCGCGCCA TCATCACAGA ACCTGAAATT CTCCTTGCGG 600 

ACGAGCCAAC AGGACCCCTT GATTCCAAGT CATCTGCAGC CTTACTTGAT GTCTTTAATG 660 

AAATCAATGA GCGTGGGCAA ACCATCCTCA TGGTAACCCA CTCAACAGCA GCTGCTAGCA 720 

GGGCCAAGCG TGTTCTCTTT ATCAAAGACG GCATTCTTTA CAACCAAATC TACCGTGGAG 780 

AGAAGACAGA GCCTCAGATG TTCCAAGAAA TCTCTGATAC CTTGACTGTC ATGGCAAGCG 840 

AGGTGAATTA GTATGTTTCG ATTAACCAAT AACTTAGCGG TATCGAACTT GATTAAAAAC 900 

CGCAAACTCT ACTATCCCTT TGCACTGGCT GTTCTCTTGG CAGTCACCAT CACCTATCTC 960 

TTTTACTCCC TAACCTTCAA TCCAAAGATT GCGGAAATCC GTGGAGGAAC CACCATTCAA 1020 

GCAACACTTG GATTTGGTAT CTTTGTCGTT ACCCTTGCGT CAcCATTATC CTCCTCTATC 1080 

CCAATAGTTT TGTCATGAAA AACCGTTCCA AGCAACTGGG TATATATGGC ATCTTAGGCT 1140 

TGGAGAAGCG CCATCTAATC AGTATGACCT TTAAGGACTT AGTGGTATTT GGGATTCTAA 1200 

CTGTTGGAGC GGGTATCCGT ATTGGAGCCT TGTTTCACAA GTTAATTTTC G C TTTCCTCC 1260 

TCAAACTAAT GAAACTGAAG GTTGAGCTGG TTGCTACCTT CCAAATGAAT GTTGTCATTG 1320 

CAGTACTTGT TGTCTTTGGA TTGATTTTCC TAGGCCTCAT GTTCCTGAAT GCTCTTCCAA 1380 

TCGCCCGTAT GAATGCCCTC CAGCTCTCGC GTGAGAAACC AAGCGGAGAG AAAACAGGTC 1440 

CCTTCCTACC TCTCCAAACG ATTCTTGGTT CCATAAGTTT AGGGATTGGC TATTATCTTG 1500 

CCCTTACGGT AACCGATCCT CTTACAGCCC TAACAACTTT CTTCCTAGCT GTTTTGCTGG 1560 

TTATCTTTGG TACTTATCTA TTGTTTAATG CAGGGATTAC AGTCTTCCTA CAAATCTTAA 1620 

AGAAAAACAA GAAATACTAT TACCAACCTA ATAACCTCAT ATCT G TTTCC AACTTGATTT 1680 

TCCGTATGAA GAAAAATGCG GTTGGACTAC CAACCATCGC TATTTTGTCA ACAATCCTTT 1740 

TCGTAACCAT GTCAGCAGCG ACAAGCATTT TCAATTCCGC AGAAAG C TTT AAAAAAGTTC 1800 

TAAATCCTCA TGATTTTGGG GTTTCAGGGC AAAATGTTGA AAAAGAAGAT TTGGACAAAC 1860 

TCTTGAGCCA GTTTGCAAGT GACAAAGGTT ATAGTCTCAA AGAGAAAGAA GTACTTCGTT 1920 

ACAGTAACTT TGGTATTGCA AATCAAGAAG GAACCAAGTT AACTATTTTT CAAAAAGGAC 1980 

AAAACCGTGT CCAACCCACA ACAGTTTTCA TGGTATTTGA CCAAAAAGAT TATGAAAATA 2040 

TGACTGGTCA AAAACTGTCT CTATCAGGAA ATGAGGTCGG TCTCTTTGCC AAAAATGACG 2100 

GACTGAAAGG ACAGAAAGCT CTAACTCTAA ATGATCATCA ATTTTCTGTC AAAGAAGAAT 2160 

TTAATAAAGA TTTCATTGTG AACCATGTTC CAAATAAGTT TAATATCTTG ACTACTGATT 2220 
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ACAATT AC CT 


iWi 1WW1 


GATTTACAAG 


CCTTTTTGGA TCAATTCCCA 


GATTCGGCTA 


2280 


TCTATAATCA 




CCTATGAATG 


TAAATCTCAG 


TGAACAAGAA 


CAACTCAAGG 


2340 


TCGCTGACGA 


OTA i\*#V»/w*W 


TACCTCAATC 


AATTTAATGC 


TCAATTAGAC 


ACAGAAGGTA 


2400 


GCTATGTTTA 


TGGTAbCAAl 


rTAGCAGATG 
w 1 now*w* * v 


CTACTTCTCA 


GATGAGTGCC 


CTCTTTGGTG 


2460 


GTGTCTTCTT 


TATCGI#TA1 1 




TTATCTTTAT 


GCTCGGAACT 


GTTCTGGTCA 


2520 


TCTACTACAA 


AC AAA ill U 


fl A A ttGCTACG 


AAGACCGTGA 


ACGCTTTATT 


ATCTTGCAGA 


2S80 


AAGTCGGTTT 


GGACCAAAAU 




AAACCATCAA 


CAAACAGGTT 


TTAACTGITT 


2640 


TCTTCCTTCC 


TTTGCTCTTT 


rrrTTTATAC 

UUL 1 1 W/\ 1 #*w 


ATCTCGCCTT 


TGCCTACCAT 


ATGCTTAGCC 


2700 


TCATTTTAAA 


AGTGATTGGT 




CGACTATGAT 


GTTGATTGTG 


ACCTTCTCTA 


2760 


TCTGCGCTAT 


CTTCCTCATC 




TGATTTTCAT 


GATTACTTCA 


AGAAGTTATC 


2820 


GCAAGATTGT 


GCAAATGTAA 


AAAAuni AUU 


TCGACTTCAA 


AATCGAGGTA 


TTTCTTGTAT 


2880 


TCTAAATGCT 


GAAAAGTTGT 


CCGACCAW»A 


AGGTAACTCC 


CATCCTCAAG 


AGACCAATAG 


2940 


CAAGCTTCCG 


AATCATAGCT 


GTTTTGGTTV* 


GCGCTTTTCC 


AAGTCTAGCA 


CTTGTGTAAC 


3000 


CAGTGAGAAG 


AAGGGCCACA 


CCCACAA 1 AA 


GGACGGTAGC 


AGCGATCCGC 


TAATCACTTG 


3060 


GAAAAATGGT 


CACTGACAGC 


ATI VjUMuuUM 


AACTTCTAAG 


GAAAAAGGCA 


ACGAAGCTAG 


3120 


AAATGGCAGC 


GTCCCAAGGA 




CTTCATACTC 


AATCCCATAT 


TTTTCCTCTA 


3X80 


CCAGACCCTT 


GACTGGATTT 


«n<*> » Af; AAAG A 


TCTTATTGGT 


CAAGAGTTGG 


G C AG AAGTTT 


3240 


TGAATTCTCC 


ATTTTGGATA 




AGACCGATTT 


TTTGGCTAGT 


TCCCTATCTT 


3300 


CGTCTAGCAA 


GAGTTTTTCT 


WWWWrWVtW V>V* 


CAGCTTCCTC 


GGTATCTTTT 


GGAGTTGAAA 


3360 


C CG AT AC AT A TTCTCCACCA 


nr**"A'^ r GAAA 


ACGCACCACC 


TAAGATAGCC 


GTAAAACCTG 


3420 


ATAAAAAGAT 


AATCCAGATA 


TTGGTCCTGG 


CACTGGCAAC 


TCCGATAACC 


ACACCAGCAA 


3480 


TGGAAATAAT 


TCCATCGTTA 


GCATCAAGAA 


CACCCGCACG 


CAGGATATTT 


AAACGACCTG 


3540 


CAAAATTTGA 


ATCAATTTCG 


TGATTTGTTT 


CTGACGCTAA 


ATTTCAAGTT 


CAAGTTAGCC 


3600 


ATCAAGAAGT 


CTTCTCTGGG 


TGACTTGTAG 


TCCAAGCATT 


TTTTAGGATA 


GTTCTTAATC 


3660 


CACTTTTCGA 


TGAATGCGAC 


TTCTTTGGGA 


GTCATTTTCT 


TGGTTCCCTT 


AGGTAACCAT 


3720 


CTACGAATGA 


GCCTGTTGTG 


ATTCTCATTA 


GTTCCCCTTT 


CCCAAGAGGC 


ATACGGATCT 


37 B0 


GCATAATAAA 


TGTGCTCCTC 


AGAAAATACA 


TTAGACAAGC 


GATTGAATTC 


CGTTCCATTA 


3840 


TCTGCCGTGA 


TGGAAAGAAT 


CTTGTGTTGT 


TTTAAGATGA 


GTTTTAGAGC 


CTGATTGACC 


3900 


ACATCAGCAC 


TTTTATTTGG 


AATCAATCGG 


ATGATCTGAT 


GTCTACTTTT 


' TCGATCCGTC 


3960 
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AACACAAGCA ACCACTAGTT TTTCGCTCTC GTAAGTAGAA CTGTATCAAT CTCATAATGC 4020 
CCATTCTCCA AGCGAAAATT GATAGCTTCA ACCCGCTGTT CGATGCATTC ACCAGCAGGT 4080 
TTAAAGTTGG TGCTGGCCTG TTTCTTAAGC CCTTTTCCTT TTCTAGCGTA AAGCAGATCC 4140 
TCTTTGCTTA ACCCCAATTT TCCATGATGA ATCCAATAGT AAATGGTTGA AATTCCCACG 4200 
TTAACCCCTT TAGCCATCAC CATCATTTCA GGCGAAAATT TTTGGTTATG ATAGTGGAGA 4260 

ATCTTTTCCT TTAGTTCCTT GGTCAAGCTT GATTTCTTGA CCGAGCGCTT GCGATT G TTT 4320 

TCATAACACT GTTGAGCATA GTCGGCAGAA TAAACCTCTT TGAAGCGCCC TTTTCCAACA 4380 

CATTGTCGGA CTGTCCCACG CTTGATTTCA GTGTGGATAG TTTGAGGAAC TTTTCCAAGC 4440 

AGAGAGGCAA TTTCTCTATT TGATTTCCCT TCTTTTTTCC ATCTTTCGAT TAAGCGACGG 4500 

CTATCGATTG TCAAATGTTC GCCTTTTGTA GTATAATGGT TTTGCATCTC TGTGCCTTTC 4 560 

TTGTGTTTGT CGTTGAACAA CAAGTATAAC ACAGAGGTGT TTTCTTATGC CTACAAGAGC 4620 

TATCGGCTAG TTGAACCATC TAATTTTTAG GACGGCTGGC TGCCTAACTT CATTATAGAA 4660 

CTTTCATTTA CGAACATATA GTAAAATCAA ACAAGAACAG AACAAATCCA TCAGGACAGT 4740 

AAAATCTATT TCTAACAATC TTTTAGAAGC AGAGGTGTAC TATTCTACTT TCAATCTATT 4800 

ATATTTTTGT TTTTTATCAA AAAATACTTT ACAAGTTCTT AAAAACATGA T AT AGTAAT A 4 860 

AAGCTTAGAA AATGAGATGA TGTTTTCTAG CAAATATAAA CCCCACTAAA AAATGCCTAC 4920 

GCACAGGCAG GGTTGAATGC CGAAGCCTGG TTGAAAAGCC ACATTATTGA TAGGGTTAAA 4 980 

ACCCT AC TTT TATAACTTGA TGTTAGGACA CTTGTCCTAA TTCATAAATT TTTAGTGTGG 5040 

TGAAAGCACA CGTCATCTTG TGAAACGATC AATAAACTAC CTAATATTTG CTACTAGAGA 5100 

CTTAGCAAAC ATCCGCAACA GACATACTCA ACAGAAACCA AAATAAACAC GTCACAAGAT 5160 

TCCAGAGCAG GTGAAAACCT GCTCTTTTTT CATGAGTCAA CCTTTAGTTC CTTAGTTTTC 5220 

ATAACGTCCT AAAAATATTG AAAGGAGTAT CTTTTCAAAG AGTTAGATCA AAACCAAGCC 5280 

CCAATTTATG AGGCCTTGGT GAAGTTACGC AACAAAAGGA TTCTTCCCTT TGATGTTCCA 5340 

GGTCACAAGC GTGGACGGGG AAATCCAGAA CTTGTCGAAC TCTTAGCAGA AAAATGTGTA 5400 

GGCATTGATG TCAATTCGAT GAAACCTTTG GATAATTTAG CCCATCCTAT TTCGATTATT 54 60 

CCTGATGCAG AGGAGCTCCC TGCAGATGCT TTTGGAGCTA CCCATCCCTT TCTAATGAT? 5520 

GGTGGAACAA CTT CATCGGT GCAGACTATG ATTCTGGCAA CCTGCAAGGC AGGAGATAAG 5580 

ATTATTCTGC CACGAAATGT CCATAAATCT GCTATCAATG CGTTGCTTCT ATGTGGTGCC 5640 

ATTCCCATCT ATATCGAGAT GAGTGTAGAT CCTAAGATTG GTATCGCTTT AGCTCTTGAA 5700 

AATGACCGAG TACCACACGC CATAAAGGAC CATCCAGATG CTAAGGCTAT CCTAATCAAC 5760 



li 

{ 
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AATCCTACTT 


ACTACGGCAT 


CTGTTCAGAC 


CTAAAGGGGT 


TGACAGAAAT 


GGCTCATGAA 


5820 


GCTGGCATCA 


TGGTTTTAGT 


AGATGAAGCC 


CACGGACCGC 


ATTTGCATTT 


CACTGATAAA 


5880 


CTTCCAA*TTT 


CTGCTATGGA 


TGCAGGGGCT 


GATATGGCAG 


CAGTTTCCAT 


GCATAAGTCT 


5940 


CCTGGGAGTT 


TGACCCAAAG 


CTCCATTTTA 


CTTATCGCGG 


AGCAGATGAA 


TTCTGAATAC 


6000 


CTTCGTCAGA 


TAATTAACCT 


GACCCAGTCT 


ACATCTGCCT 


CTTACTTGTT 


CATGCCTAGT 


6060 


TT^HATATTT 


CACGTCGCAA 


CTTGGCCCTT 


CGTGGTAAAG 


AGTCGTTTGA 


GAAAGTCATT 


6120 




ACTATGCCCG 


CCGTGAAATC 


AATGCTATCG 


GTGGCTACTA 


TCCCTACTCA 


6180 


AAAAACTTAA 


TAGACCGTGT 


TTCGGTTTGC 


GATTTTGACG 


TAACTAAGCT 


GTCACTTTAC 


6240 






AGGTATCGAG 


GTTTATGACC 


TCTTGCGAGA 


CGAATACGAC 


6300 


ATTCACATCtt 






ATCTTGGCCT 


ATATTTCCAT 


CGGCCACCGC 


6360 


ATC\>AAunwn 






CTGGCTGATA 


TTAAGAGACT 


CTATTCAAGA 


6420 


CATWjAAAAIj 


Ail I\iA 1 AwW 


a ftr: AC AAT AT 


ATTCAGCCCG 


AGTTAGTGCT 


GTCTCCGCAA 


6480 


CAAGCCTTCT 


Al i v.«wAnA« 


AAAAACTTTA 


ACTTTGGATG 


ATTCTGTTGG 


ACAGGTCTGT 


6540 




TTATCTnTTA 


rCCTCCAGGT 


ATTCCTATCT 


TGGCTCCTGG 


TGAACGCATT 


6600 




TTCTCCACTA 

1 l w 1 WU#\W * r\ 


T ATCC AATTC 


GCCAAGGAAC 


CTGCTTGCTC 


CCTCCAACGC 


6660 


A^tjuMrtwi I V. 


TACACCTCAA 


TCATATCAAC 


GTTATTAAGA 


GAAAGACAAA 


CTATAAGAAA 


6720 




TTTATPTAAA 




TTTCAATTTG 


ATGATTTGGC 


GATGATTTTA 


6760 


ftAfZf*Af*ttCICA 


AAAAGCCCTT 


CAATTAGAAG 


CGCTCAATCG 


CTTAATTTCT 


ATCAGCTTAT 


6840 


TAAATCCTCC 


CTCAAGCCTT 


TTCTGAGGAT 


TAGGGTAGCG 


TCTCAAGAGT 


TGGTACGTAT 


6900 


ATTCTGAATG 


CTTTCCAACG 


ATTTTATCCA 


ACTCAGGAAA 


GATGATATCA 


ACACAACGAG 


6960 


TGTATTGTAC 


TTTCCAATCA 


GACTCTTTTT 


TCTTGAGACG 


ATGAATATGT 


CTAGCCAGTA 


7020 


TTTTTAGTTC 


TACTTGCCGA 


TTATCGTGTT 


GAAATTGTTC 


ACGATTGGGG 


TWAbAAAuAA 


/vow 


GTTTAAGAGC 


GATGCCATGA 




TATCCGTTTT 


AGTTTTGCGA 


AGTGATAATG 


7140 


ATTTGGCAAA 


TTTCTTGATG 


AGCAAAGGAT 


TGTAGGTCTA 


AACTTTATAT 


CCTTGTTCAT 


7200 


GCAGGAAGTT 


CAGTAGATTA 


AAGGCATAAT 


GTCCGGTATT 


TTCAAGAGCG 


ATGAGACAGT 


7260 


CTTGCTTGAG 


CTGTCGAAGA 


GACAGATCTA 


AGAGTTCAAA 


ACCAGCTTTA 


TTATTTGAAA 


7320 


AAGTGAGTGG 


TTTAAGAACA 


GTTTTTCCTG 


GAACATTCAA 


GGCTGTAACA 


TCGTCTTTAT 


7380 


TTTTAGCGAC 


ATCAATGCCC 


ACATAAAGCA 


TGGGAGTATC 


TCCAGATATA 


GTATTTCAAG 


7440 


TCTACTGGGT 


TATCCACGAA 


CTTTTTGCCT 


TGTTACCTTA 


GACGAGATAA 


AACGTCTATC 


7500 
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CGTTATCAAA CTCATTACCA ATTGAAACAA AAAACTGTCG TTAGAGCCTT TCGGAAATCO 
TCAAGCCATT GGACGAAATG AACTAATCCA CACTGGCTTA TTCCAAGTAT ACCACTTGGG 
CTTTGGCACT AGCTAACTGC CCTAAATATA ATATAAGGAG AAATAGATGG ATTTATGGTT 
rrCTGAAGTT CATACTCCAG ATGTCAAATT GTCTCTGAGA ACAGCCAAGC AACTTTACGC 
TGGAAAAAGT GAATCGCAGG ATATCGAAGT CTTGGATACG CCACCTTTTG GGAAAATACT 
CATTTTAAAT GGCCATGTCT TGTTCTCAGA TGCGGATGAT TTCGTCTACA ATGAAATGAC 
CGTTCACGTT CCCATGGCTG TCCACCCAAA TCCAAAGAAA GTATTGGTTA TTGGGGGTGG 
TGACGGCGGT GITCCCCAAG TATTAACCCT CTATCCTGAA CTGGAGCAAA TTGATATTGT 
GGAACCCGAT GAGATGTTGG TCGAGGTCTG TCCTGACTAT TTCCCAGACT TTGCTGCAGG 
GCTAGATGAT CCTCGTGTTA CCATTTACTA CCAAAATGGG CTACGCTTTT TGCGAAACTG 
CGAAGATGAT TACGATATTA TCATCAACGA TGCCACAGAT CCATTTGCCC ATACCGAACG 
ACTCTTTACC AAGGAATTCT ACGGCAATAG TTATCGAGCT CTGAAGGAAG ACGCCATCAT 
GATTTACCAG CATCGGACTC CCTTCTTTGA CGAGGATGAG TCGGCCTCCC GAACCATGCA 
CCCCAAGGTC AATCAAGCCT TTCCAATCAG TCCCGTTTAT CACGCCCATA TTCCAACTAG 
CCCAGCTGGC T A TTGGTTGT TTGCATTTGC ATCGAAAAAA TACCACCCTG TCAAAGATTT 
TGACAAGGAA GGCTGGAAAA AACGCCAGCT TTTCACAGAA TACTACACTG CAAACTTACA 
CCTGGGAGCC TTTATGTTGC CCAAGTATGT TCAGGACATT TTAGAAGAAG AGGAAGGAAA 
AAAATGAGTC GTTTACTAGT TATTGCTTGT GGGGGCGTTG CCCAAGTTGC TATTTCAAAG 
ATTTGTCAAG ATAGCGAAAC ATTTACAGAG ATTATGATTC CTAGCCCTAC CAAGTCAAAA 
TGCGATGACT TGAAAGCGAA GCTAGAAGCC AAAACAAGTA CTAAAATTGA AACTGCAGCA 
CTTGATGCTG ACAAGGTTGA AGAACTGATT GCCCTGATTG AAACCTACAA ACCACAAGCT 
CTTTTGAATG TAGCTCTGCC TTATCAAGAT TTAACCATTA TGGATGCTTG TTTCGCAACA 
GGTCTTCACT ATATCGATAC AGCCAACTAC GAAGCAGAAG ACACAGAAGA CCCTGAGTGG 
CGTGCTATCT ACGAAAAACG TTGTAACCAA CTTGGTTTTA CAGCCTACTT TGACTACTCA 
TGGCAGTGCG CTTATCAAGA GAAATTCAAA GAAGCAGGCT TGACTGCTCT TCTTGGTTCT 
GGTTTTGACC CAGGTGTAAC TAGTGTCTTT TCAGCTTATC CCCTCAAACA CTATTTTGAT 
GAAATCCATT ATATCCACAT TTTAGACTGT AATGGCGGTG ACCACGCTTA TCCATTTGCA 
ACCAACTTTA ATCCAGAAAT TAATCTCCGT GAGGTTTCTG CCCCAGGTTC TTACTGGGAA 
GATGGGAAAT GGGTCGAAGT CGAAGCTATG TCTATCAAGC GTGAGTATGA TTTCCCTCAA 
GTTGGACAAA AAGATATGTA TCTCCTTCAC CATGAAGAAA TCGAATCATT CGCCAAGAAC 



7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

6400 

8460 

8520 

8580 

8640 

8700 

8760 

6820 

8880 

6940 

9000 

9060 

9120 

9180 

9240 

9300 
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ATTCCAGCTG 


TC AAACGC AT 


TCGTTTCTTT 


ATGACTTTTG 






9360 


ATGAAATGTC 


TTGAAAATGT 


TGGACTCCTT 


CGTACGGATA 






9420 


GAAATTGTTC 


CAATTCAATT 


TTTGAAAGCC 


TTGCTTCCAG 






9480 


CGTACACTCG 


GAAAAACCAA 


TATTGGATGT 


ATCTTTACAG 


GTGTCAAAGA 


CGCTGTCAAA 


9540 


AAGACTATCT 


ATATCTACAA 


TGTCTGCGAC 


CATCAGGAAT 


GTTACGCAGA 


GGTTGGTTCG 


9600 


CAAGCTATTT 


CTTATACGAC 


AGGACTTCCA 


GCCATGATTG 


GGACAAAATT 


AGTCATGAAC 


9660 


GGAACTTGGA 


AACAAGCTGG 


AGTGTATAAC 


CTTGAGGAGT 


TAGATCCAGA 


TCCATTCATG 




GAAGCTTTGA 


ATGAGTATGC 


TTTGCCATGG 


GTTGTGGTTG 


AAAATCCACA 


AATGGTGGAC 


7 / O \J 


TAATGAACTT 


AGAACAAGTA 


CCAACACCAG 


CCTATGTTAT 


TGACTTGGCC 


AACTTAGAAG 




CTAATTGCCG 


CATTCTACAA 


TATGTACAAG 


AAGAGGCCGG 


TTCCAAGGTC 


TTGCTTGCCC 


oann 


AGAAGGCATA 


TTCCCTCTAC 


AAAACTTATC 


CCTTGATTAG 


CCACTATCTA TCAGGTACGA 




CAGCTAGTGG 


ACTCTATGAG 


GCCAAATTGG 


CAAGGGAAGA 


ATTTCCTGGT 


GAAGTCCATG 




TATTTCCGCC 


TGCTTTCAAG 


GATGCAGACT 


TGGAGGAATT 


GCTAGAGATA 


ATGGACCATA 


lUUoU 


TAGTCTTTAA 


CTCAGACACA 


CAGTTGCCTA 


AACACGGTCC 


GCGTTGTCCA 


GAGGCTGGTG 




TCAGTGTTGG 


TTTGCGCCTC 


AACCCTCAGT 


CTTCAACTCA 


AGGCAGATCA 


CGCGCTCTAT 


i n*> no 


GACCCTTGTC 


CACCAGGTTC 


TCGCTTTGGA 


CTTACTATAG 


ACAACATTCC 


GAGTGATTTG 




CTAGATTTGG 


TTGACGGACT 


TCATTTTCAT 


ACCCTTTGCG 


AGCAGGGAGC 


AGATGATTTA 




CAAACAACTT 


TGAAAGCACT 


AGAAGAACAG 


TTTCGTCCCT 


ACTTACATGA 


GGTAAAATGG 


10380 


CTCAATATGG 


GTGGTGGTCA 


TCATATTACA 


AGAGAAGGTT 


ACGATGTGGA TTTGCTGATT 


10440 


TCACAAATCA 


AGCGTATCCG 


AAAAACTTAC 


AATCTTGAAA 


TCTATATCGA 


GCCTGCTGAA 


10500 


CCCATTGCGC 


TTAATGCGGG 


TTATTTAGCA 


ACTGAGGTAT 


TAGATATTGT 


AGAAAACGGT 


10560 


ATGGAAATCT 


TGGTTTTAGA 


CGCCTCTGCG 


ACCTGCCATA 


TGCCTCATCT 


ACTTGAGATG 


10620 


CCCTATCGTC 


CACCTTTGAG 


1 AAATGGCTTT 


GAGTCACAGG 


AAAAAGCCCA 


TACCTACAGA 


10680 


CTTTCTTCTA 


ATACCTGTCT 


GACGGGCGAT 


GTGATTGGTG 


ATTATAGTTT 


TGAAAATCCA 


10740 


GTCCAAATCG 


GAGACAGACT 


TTATTTTCAA 


GACATGCCCA 


TTTATTCTTT 


TGTCAAAAAT 


10800 


AATACCTTTA 


ATGGTATTGG 


ATTGCCAAGT 


CTCTATCTCA 


TGGACGAACA 


GGGAGACTCT 


10860 


AGCTTACTCA 


AAGCTTTTGG 


CTATCAAGAC 


TTTAAAGGGA 


GATTATCATG 


ATGGACAGTC 


10920 


CAAAAAAATT 


ACGCTATCAC 


ATGCCAGCAG 


AGTACGAACC 


CCATCATGCT 


ACCCTCATGA 


10980 


TATGGCCCAC 


TCGACCAGGA 


TCATGGCCTT 


TTCAAGGAAA 


GGCTGCTAAA 


AGAGCATTTA 


11040 



WO 98/18931 ~ PCT/US97/19588 



600 



CTCAGATTAT 


CGAGACCATA 


GCAGAAGGGG 


AAAGAGTCTA 


TCTTTTGGTG 


GAGCAGGCCT 


11100 


ATCTATCTGA 


AGCCCAATCC 


TATCTTGGAG 


ACAAGGTTGT 


TTATTTAGAC 


ATTCCCACCA 


11160 


ATGATGCCTG 


GGCGCCTGAT 


ACTGGCCCAA 


CCATTCTCGT CAATGATAAA 


GGTAAGAAAT 


11220 


TAGCCGTGGA 


TTGGGCCTTC 


AATGCTTGGG 


GAGGCACCTA 


TGATCGTCTT 


TATCAAGATT 


11280 


ATGAAGAGGA 


TGACCAAGTA 


GCCAGTCGTT 


TTGCTGAGCC 


CTTGGAAAGG 


CCTCTCTATG 


11340 


ATGCTAAACC 


TTTTGTACTG 


GAAGGAGGCG 


CAATCCATAG 


CGATGCTCAA 


GGAACTATTC 


11400 


TCGTAACTGA 


AAGTTGCTTG 


CTTAGTCCTG 


GTCGCAATCC 


TAACTTGACT 


AAAGAGGAGA 


11460 


TTGAAAACAC 


ATTATTAGAA 


AGTCTTGGTG 


CTGAAAAAGT 


TATTTGGCTT 


CCTTATGGTA 


11520 


TTTATCAGGA 


TGAAACCAAT 


GAACACGTCG 


ATAATGTTGC 


TGCCTTTGTT 


GGTCCTCCTG 


11560 


AGCTTGTTTT 


GGCTTGGACA 


GATGACGAAA 


ATGATCCCCA 


GTATGCCATG 


TCAAAAGCAG 


11640 


ATCTCGAACT 


CTTAGAACAG 


GAAACAGATG 


CAAAACCTTG 


TCACTTCACC 


ATTCATAAAT 


11700 


TGCCTATCCC 


TGCAGTTCGA 


CAAGTTGTGA 


CAGAAGAAGA 


TTTGCCAGGC 


TACATCTATG 


11760 


AAGAAGGAGA 


AGAAAAGCGA 


TACGCAGGTG 


AACGACTAGC 


AGCTTCCTAC 


GTAAACTTTT 


11820 


ATATCGCCAA 


CAAGGCTCTC 


TTGGTTCCAC 


AGTTTCAGGA 


TGT AAACGAC 


CAAGTGGCCT 


11880 


TAGATATCCT 


CAGCAAGTGT 


TTCCCAGACC 


GTAAAGTTGT 


CGGAATACCA 


GCCAGAGATA 


11940 


TTCTCTTACG 


TGGTGGCAAT 


ATCCACTGTA 


TCACCCAACA 


AATTCCACAA 


TAGGAGAAAA 


12000 


AGATGAGAAA 


TGTAAGAGTT 


GCAACCATTC 


AGATGCAATG 


CGCTAACGAT 


CTGGCAACAA 


12060 


ATATCCAAAC 


CGCAGAGCGT 


TTAGTACGTC 


AGGCTGCTGA 


GCAAGGAGCC 


CAAATTATTC 


12120 


TCTTGCCCGA 


GTTGTTTGAA 


CATCCCTATT 


TCTGTCAGGA 


ACGTCAGTAT 


GACTACTACC 


12180 


AGTATGCCCA 


ATCTGTAGCG 


GAAAATACTG 


CCATTCAGCA 


TTTTAAGGTG 


ATTGCTAAGG 


12240 


AACTACAACT 


TGTTTTACCA 


ATCAGTTTCT 


ATGAAAAAGA 


TGCTAATGTC 


TTGTATAACT 


12300 


CTATTGCCGT 


CATTGATGCA 


GATGGGGAAG 


TGCTGGGCCT 


TTATCGAAAG 


ACCCATATAC 


12360 


CACATGACCA 


TTATTATCAA 


GAAAAATTCT 


ATTTCACGCC 


TGGTAACACT 


CGTTTCAAGG 


12420 


TCTGGAATAC 


TCCCTATGCT 


AAGATTGGTA 


TCGGTATCTC 


TTGGGATCAA 


TGGTTCCCTG 


12480 


AAACAGCGCG 


CTGTCTTGCA 


TTGAATGGTG 


CTGAATTGCT 


CTTTTATCCT 


ACACCTATCG 


12540 


GTTCAGAGCC 


AATTTTGGAT 


ACAGATAGTT 


GTGGTCACTC 


GCAACGTACT 


ATGCAACGGC 


12600 


ACGCAGCAGC 


GAATATTGTT 


CCAGTCATCG 


CAGCCAATCG 


TTATGGTTTA 


GAGGAGGTTA 


12660 


CTCCTAGTGA 


GGAAAATGGC 


GGACAGAGCT 


CCAGTCTTGA 


CTTCTACGGT 


TCCTCCTTTA 


12720 


TGACGGATGA 


AACAGGAGCT 


ATT CT AG AAC 


GAGCTGAAAG 


ACAAGAAGAA 


k GCTGTTCTGT 


12780 


TAGCTACTTA 


TGACCTAGAC 


AAGGGAGCAA 


, GTGAACGCCT 


AAACTGGGGC 


: TTGTTTCGAG 


12840 
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ATAGAAGACC AGAAATCTAT AGACAAATTA CAGATTAGTG 
TTCTGCTAGA CTAACTTCTT ATTACTAACT AT AAGAT ACT 
TTTTATGATT CGCTATTCTT GTCTATTGAT TACTCCCTAT 
GCAAATAGCA GTAACTTCTG TCTATTTGCT TTTCTTTTTT 
GCACGCGCAA CGCCGTCTTC TTCGTTGCTT GAGCTAACGC 
TAATCGCTGG CATTTCCCAT TGCAATCCCA ACCCCTGCAA 
TTATTAGCAT CGCCCATGCC CATAATCTCT GAGGAATCAA 
CGTGAAAGAG CAGTAGCCTT TGTCGTTCCA AGCGGCATTG 
GAACGAACTC CACTGAATCG TTGGCAAAGC TCTTCAGCAA 
G-m V ATCTT TTGTTCCTAA ACACATACCT TGGAACATCC 
TCTTCAAGAG AAATTTCACT CAGGTCTGAA AATACTACTT 
TGATTGGGCT TGTCACCGAG AACAAAATAA TGTGACTCCT 
TCACTCTTTT CAGCAAGGTC ATACACCTAT TCGATGTCAG 
TCAACTAGAC TCCAATCACT GGTCTGGTGA CTTGAACAAC 
TCGTTCTGGA GGTC AAGCTC CACTTTTTTG TAGTAGGGCA 
CCCGTACAGA GAACCAGTTT GACACCTTTT TCAATGGCTT 
CCTTGTGGGA TTTCCTTGGC TTCATTGAGG ACCGTGCCCT 
TTAATCATAG GTCTTCCTCT TTATCTTTCC TATTATTATA 
TGATAGAAAG CTTGAGACTA ATTGATTTTA TACTTTAAGA 
ATTTGAACAC G AT ATTTCGC AAAGATATGC TATACTATGT 
CAAATTAAAA AACCAACTTA ATATAATAGT TTTTTTCTAA 
ACTCAACTAA TCTGAAGAAT AATGGAGGAA ATATATCATG 
AAATCTAACA AATGAAGAAT TAGAGCTGAT ACAAGCTGGA 
TCCTAATGGT AGGTACGATT GGCAAATAGA ACCAGTATTA 
TTGTCCCAGA GGCACCTATC ATTCACGATA TATTGGAGGA 
AAGTGCTGCG ACATTTTAAG TAAAATTTAT TAGGAATATG 
ACAGGATTTA ATATGAAAAA ACGAGCTATT CAAATTTTAC 
TACAAATCAA CTTGGTTTTG GAGGCTTTTC AATTATCTCG 
AGTCGTGAAT TTTTTCAGAT TCTGCTTTTG ATGGAGAGCG 



TGGCAGAAAT 
ATGGCATCTA 
TTTAAAATAT 
ATAGAATATA 
CATCCGCAAG 
ACTGGACCAT 
TCTTCAAAAT 
CTTCATAAAT 
AACGCTGCTC 
GGAACTTTCC 
TAGCATCATT 
CAAAAAGTCT 
CTGGACTCAG 
CGTTGTTAAC 
GGACACCGAA 
TGTGAATAGC 
CCATATCCAA 
GCATATTTTG 
TGTTTTGATG 
TTGTCAATGT 
GTAGGTATGA 
ATTTTAATGA 
GCAGATCCAT 
ACTCTGCTGC 
GGTAATCATC 
AAGAAACAAG 
TAGCATTGTC 
CAAAGCCCTA 
GACTTCTTTT 



GAGAGATTCA 
GTAAATCGAT 
TAGCAAAAAA 
TTTCTCAATA 
AGATTTGATA 
TTCCATATCG 
CTCAGCTAGT 
GACAGGCTGC 
AAAATCGTCT 
ACTAGTCGCT 
TTCAATAACT 
CAACTGAACA 
TTCTTTCCAG 
AATAATATAT 
AAGGGGGCGA 
ACTAATGTGT 
GGCTAGTAGT 
GAGAAGAAAT 
AC AATTC ATG 
TCCAACTAGA 
CT AG CAGATT 
CAAAAAATAT 
ATGGTAAAAA 
TTCATGGATT 
TTTGCAAAGG 
GGGAGAAAAC 
CTTAATTTTT 
TCTACCAGCA 
CTTAGCGGTC 



12900 

12960 

13020 

13060 

13140 

13200 

13260 

13320 

13380 

13440 

13S0O 

13S60 

13620 

13660 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 
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ATCTATCTAC TGCTTTTTCC AGGAAAGAAA ATTTTTCATT TCAACTCGCA GCTGAGGTAC 14 640 

TTCATCTACC TTTTACTGGG CTACATCATT TCATATATGT CTGACTTCCT CTTTTCGTAT 14700 

TTCATATCCC TGTCTTCAAA TCAGATTTCT TTGAATGAAA CGGTAGAAAT CATGGGGAGA 14760 

CAGGAGTTCC CTTATGTCTT GCTCATCGTT TGCTTCATCG CCCCTATTGC TGAGGAATTC 14820 

ATTTATCGAG GtGTCCTTAT GACAACCTGT TCCAAAAACT CACCTTGGTA CC 14872 
(21 INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10223 base pairs 

(B) TYPE: nucleic acid 
IC) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi» SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



1 in* IA* 




CAATCTGGTC 


GCTATGGTCA 


AATCCAGTTG 


GAAAATCCAT 


60 


TCTTCTTGGA 


GCCATCTGCT 


GGATTGCCAT 


CATCCTCACC 


ACTCTTGGTA 


TGCACACCCT 


120 


TATCGGCATT 


TTCTAATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAACG 


TCGCCTTGCC 


180 


CTAGGTATAT 


GTTACTGACT 


TCGTCAGTTC 


TATCTGCAAC 


CTCAAAACGG 


TGTTTGAGCT 


240 


GACTTCGTCA 


CTTCTATCTG 


CAACCTCAAA 


ACGCTGTTTT 


CAGCTGACTT 


CCTCACTCGT 


300 


ATCTACAACC 


TCAAAACAGT 


GTTTTGAGCT 


GACTTCGTCA 


CTTCTATCTG 


CAACCTCAAA 


3 60 


ACAGTGTTTT 


GAGCAGCCCG 


TCGCTACTTT 


CCTAGTTTGC 


TCTTTGATTT 


TCATTGAGTA 


420 


TAACACAAAA 


GGTAGCCCAT 


CAGCTACCTT 


TTTCTTATGC 


TTCCTCAATC 


AAGCGACTAT 


480 


GTTCTCTCTT 


GATACAGCGA 


TTCATCACGA 


TATCATCACA 


TCCACCATCA 


CGCAAAATCT 


540 


CTTTCGCTTC 


TAAACTTTCA 


AGTCCTACCT 


GTGCCCAAAA 


AATCTTGGCA 


TCAGCTTTCA 


600 


GAAAATCACG 


CGCCACATCG 


GGCAGAAATT 


CACTGCGACG 


ATAAACATTG 


ACAATATCTA 


660 


CAGCAAAAGG 


AATTTCAGCG 


AGGCTAGCAT 


AAGCCTTTTC 


ACCCAAGATT 


TCCCCACCTG 


720 


CCGCCTTGGG 


ATTGACTGGG 


ATGATTTTAT 


AGCCCCGAGC 


CTGCATTTCC 


TTTGTTACTC 


780 


GATTCCTGGT 


TGTTTCTTCA 


CGCTCAGACA 


AACCCACCAC 


AGCAAGGGTT 


TTACTCGTTG 


840 


CGAGATACTG 


ACGAATCACG 


CCATCACTTG 


GATTGATAAA 


TTCTTGACTC 


ATAGAAATCC 


900 


TCCTTTTTCA 


TCAGTATAGC 


ACATTTTGAA 


AAGGTTTGCA 


GAATTATACT 


ACAAAAAAGG 


960 


AGGACTAGCC 


CCCTTTTTAT 


TTAGCCTCGT 


ACCAGGTTGC 


CCCTTCATTC 


TCATCTGCGA 


1020 


TAACAGGAAC 


ACTGAGTTGA 


ATGGCTTCTT 


CCATCGTTTG 


TTTCACCAAT 


TTTTTCATCT 


1080 


CTACCAATTC 


AGATTTAGGC 


ACTTCAAGGA 


CGATTTCATC 


GTGCACTTCT 


AACAGCATCT 


1140 
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TACTCTCATA ACCACCTGCA ACCAAGGCTT TATCCACCTC AATCATGCCA ATCTTGACAA 
TATCTGCTCC CGAAC CCTGG ATAGGTGAGT TGATAGCACT TCGCTCCGCA AAACCACGAA 
TATTGAAGTT GCGCGAATTG ATATCTGGCA ACTCACGGCG ACGCTTAAAG AGGGTCTCTA 
CATAGCCCTT ATCACGCGCC TCCCGCACCA CTTCATCCAT GTAGTTTTTA ATACCTGCAA 
AACGTTCAAA GTACCTATCA ATCTAGGCTT TGGCTTCCTT ACGACTAATT CCCAAATTAT 
TAGACAAGCC AAAGTCTGAA ATCCCATAAA CCACTCCAAA GTTAACTGCC TTGGCATTGC 
GACGGTCGTT TGCACTCACA TCATCAGGAC GCTCAATGCC AAAGACCCGC ATGGCTGTCG 
AAGTATGGAT ATCTGCCCCC TCTTGGAAGG CCTTAATCAA CTGCTCATCC TTAGAAATAT 
CCGCCAAAAC GCGCAATTCA ATCTGTGAAT ACTCAGAGCT GAGTAGCACA CTATCCTCCC 
ACTCTCGCAC AAAACCCTTC CGAATCAAGC GCCCCTGTTC CAATCGGGCA GGAATATTTT 
CCAAGTTTGG ATCCACACTA GACAAACGCC CCGTCTGGGT CAAATCCTGC ACATAGCGAG 
T ATG AATCTT TCCATCAGCC AAAATCCAGT CCTCCAAGCC AATTACATAA GTAGATTCAA 
TCTTAGCAAT TTGACGGTAA TCCAGGATTT TCTTAACAAT CGGACCAATA GGAGCCAGAC 
CCTCTAAAAC ATCCACTGCT GTCGAATAAC CTCTCTTGCT TTTCTTAGTG TATTCTAGAG 
GAAGTCCCAA TTTCTCAAAG AGAAGCACGC CCAACTCCTT AGGCGACTTG AC AT T AAACT 
CCTCACCAGC CAGCTCGTAA ATCTCTTGAG TCAGTTTTTC AATGACAACC TCATTTTCAG 
CCTGCATCTC AAGCAAGCTC TCTTTCTTGA CCATAATCCC AGCAATTTCC ATCTTGGCAA 
CGACAAAAGC CAGAGCTTGC TCCATATCAT AAACAAGCTC TAATTGCCCA TTTTCGCTGA 
GTTTTTCAAG TAAAATAGCC TCTGTTTCTA CCAAAACAGC AAGTTTACAA GCTAAGTGTT 
CCAAGAATTT CTCACGTTCA GGAATGGCCT TTTTAACACC CTTACCGTAG AAAGTTTCAT 
CATCAACCAA GTAACTCTGA CCATAAAGAC TAGCGATGGT CGCAATTTCA TTGTCCTCCA 
CACTCGAAAG GAGGTATTTA GCCAAACGCA TCTCAAAAGC AGGCGCCTGC AAATCCACAC 
CAAAACGTTG CAAAAGAACT TTAACCTTCT TAAAGTCATA AACTCTCAGA GATCTTTTTT 
CTAAGAAATC CTTGAAAATC GGGTCTTGCA ACAGCTCAAG CTTGTCTGTG GCATAGAGCT 
TATCCCCACA AGACCAGACA AATCCAACCA AATTATCCGT ATGCTAATTC TCACCAAAAA 
GCTCAAAGTG GAAGATAGAC TCTTCACTCA GCATATCTTG ACTGATTTCG TCAACAATAG 
TAAAATCCAA ACTCTCAGAC ACATCAGCTG ACGACACATT TAAAGCCTGC TTTAGCTCTT 
TGAAGCCCAT CTCATCGTAG AATTTCCCAA G ATTTT C AAC ATCTGGACCA CTATAGACCA 
AGTCCTCTAA ACCAATCGCA ATCGGTGCCT TGGTATCAAT GGTCGCTAGT GTTTTAGACA 



1200 

1260 

1320 

1390 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2S80 

2640 

2700 

2760 

2820 

2880 
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AAAAGGCCTG 


TTCCTTGTCA 


TTGATGAGAT 


TTTCCTTCAT 


CTTAGAAGTC 


TTCATTCCAT 


2940 


CAATATTTTC 


ATAAATCCCC 


TCAAGCGAAC 


CATGCTCCAG 


CAAGAGCTTA 


ATACCCGTCT 


3000 


TTTCACCGAC 


TTTGGTCACC 


CCAGGGATAT 


TATCCGACTT 


ATCACCCATG 


AGCGCCTTGA 


3060 


GATCGATAAA 


CTGAGCTGGT 


GTGAGGCCCA 


TTTCTTCCAT 


GAGGTAATCT 


GGCCTAAAGG 


3X20 


CCTCAAACTC 


AGCCACACCT 


TTCTTGGAAA 


TTTCAACCAC 


CGTATGCTCA 


TCCGTCAGCT 


3180 


GAATCAAATC 


CTTGTCCCCA 


CTGACAATAG 


TAATATCAAA 


ACCATCCTGC 


TCTGCTAGCT 


3240 


TATCCACCGT 


CCCAATGATG 


TCATCCGCCT 


CATACTGAGC 


CAGATCATAG 


TGACGAATCC 


3300 


CCATATCATC 


CAGCAACTCA 


CGAATGAAAG 


GAAATTGCTC 


ACGAAACTCA 


TCAGGACTCT 


3360 


TGGCCCGACC 


ACCCTTATAG 


TCCGCATACA 


TCTCTGTCCG 


GAACGTCCTC 


TTTCCCGCAT 


3420 


CAAAAGCCAC 


CAAAATATCA 


CTCGGCTCAA 


CCCGCTCCAA 


TAAATGACTC 


AACATCAACT 


3480 


GAAAACCATA 


AATCGCATTG 


CTATGCAAAC 


CAGCCACATT 


CTTAAAACCG 


TCCAACTGCT 


3540 


GATACAGCGC 


AAAAAACGCC 


CGAAAAGCTA 


CAGAAGACCC 


ATCAATCAAT 


AATAATTTTT 


3600 


TCTTATCCAT 


ACACCCATTA 


TAAACGAAAG 


AATCAAAAAA 


TACCATTGGG 


AAGAGCTAGA 


3660 


GCAAGTATTT 


TTCAAAC1 TT 


TTCCGAATAA 


ATAGATAGAG 


CCAGAGAATT 


TAGTAAACCT 


3720 


AGATTTAAAA 


ATGTGCTATA 


ATATAGTATA 


TTGAATCTAT 


AA T ACT AC AC 


CTTGACTGCT 


3780 


AAAATATTTC 


TATAAATTAA 


TTTGACTTTC 


CTGATAGAGT 


TATTCACATC 


TTATTTCAAC 


3840 


TCACTATACA 


AGGAGGAATA 


GGAGGATTCT 


CAGACATCCG 


GGCATCAGCC 


CAACTAATGA 


3900 


TTTGATTGCT 


AAGAAAATAT 


TCAGCAATCC 


AGAAATCACT 


TGTCAATTTA 


TTCGCGATAT 


3960 


GCTGCACTTC 


CCAGCAAAAA 


ATGTCACCAT 


TTTGGACGGA 


AGCGATATTC 


AC GT ATT ACT 


4020 


CTCCATGCCT 


TACTCGGTGC 


AGGATTTTTA 


TACCAGTATA 


GACCTCTTGG 


CGGACTTGGA 


4060 


TAACGGTACT 


CAAGTAATTA 


TTGAGATTCA 


AGTCCATCAT 


CAGAATTTTT 


TCATCAATCA 


4140 


CTTGTGGGCT 


TACCTGTCCA 


CTCAGGTTAA 


TCAAAATCTT 


GAAAAAATTC 


CTCAGCGAGA 


4200 


AGGTGATACT 


C ACT AGAGCT 


ACAAACACAT 


CGCTCCTGTT 


TACGCCATTG 


CTATCCTGGA 


4260 


TAGTAATTAT 


TTCTCAGATG 


ACCTGGCTTT 


TCATAGCTTT 


AGTATCCGCG 


AAGACACAAC 


4320 


AGGTGAGGTA 


TTGGCGATTA 


CCAACAATGG 


ACAGGAAAAC 


CATCTGGTTA 


AGATGGCATT 


4380 


CTTGGAATTA 


AAAAATACAC 


AGAAACCACC 


AAAGACAAGG 


TTCGCAAGCC 


A7CCTTGCAG 


4440 


TTTTTCGGCA 


ACAAGCCCTT 


TACCCAGCAA 


CCGCAACGAG 


CCATTACCCA 


AGCAAATCAA 


4500 


CTGCTGCACT 


ACAAGAGCTG 


GTCCGAGGAG 


GACAGGAAAA 


TGTTTAGTCA 


ACTACATATG 


4560 


CGAGAAGAAC 


AAGTCTTGTT 


AGCACAGGAC 


TATGCCTTGG 


AAACTGCTAG 


GGCTGAAGGC 


4620 


CTTGAACAAG 


GACTAGAGCG 


TGGGAAAGTT 


GAAGGAAGGG 


CAGAAAGGAA 


ACTTTTTGCC 


4680 
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TTCCTAGACA TAGTACCCCA AGCTCTTCTC ACTTCTCACG TTGCCAGCCA CCAATTAGGT 4740 

ATCTCACTAT CTGAATTTGA GGCACTGTTG TAAAATGGCT CCATAATATC CATAGTGGGT 4 800 

AAATCCCCTA TGG AT ATT AT GGACCCTATT TTGTGTAGAA AAAAAGTCCC AT ATGACCTA 4860 

TAATGAAAAG CGACAAAACA ACTCATTAGA AAGAATCATA TGGAACAATT ACATTTTATC 4 920 

ACAAAATTAC TAGACATTAA AGACCCTAAT GTCCAGATTT TAAACATCAT CAATAAGGAT 4 980 

ACACACAAGG AAATCATCGC CAAACTGGAC TACGACGCCC CATCTTCCCC TGAGTGCGGA S040 

AACCAATTGA AGAAATATGA CTTTCAAAAA CCTTCTAAAA TTCCTTATCT TGAAACGACT 5X00 

GGTATGCCTA CAAGAATTCT CCTTAGAAAG CGTCGATTCA AGTGCTATCA CTGTTCAAAA 5160 

ATGATGGTCG CTGAAACTTC TGATGACGTA CAGTCATATT TCTTCTCTTT TTATTATATC 5220 

ACAGTTTTAA ATCTAGCTTT ACTAGATTCA CCGCTACTAT CTATTTATTC GGAAAAAAGA 5280 

CGAAAAAACC TGAGAATCAT CTCAGGCTTG CTCATTAAAT TTTTTTCTCA ATATCGAAAA 5340 

« GTGGAGAAAG TGGTCGTTTT TCATGAATAC GTACGATAGC ATCCCCTACG AGATGAGCGA 5400 

TTCAAATCTG CTCAATCTTA TCAATCAAAC GCTCTTCTGG CAGATACATG GT AT CCAAAA 5460 

CAACCAATTT CTTAATAGCT GATTTTTGGA TATTGTCCGT AGCAGCACCA GAAAGAACTG 5520 

GGTGCGTACA CCTTGCATAG ACTTCAACAG CACCAGCTTC CGCAAGAGCA TCTCCCCCAT 5580 

GACAAATCGT TCCACCGGTA TCAATCATAT CATCAATCAA GATACAAGTC TTGCCTTCAA 5640 

CCTTACCGAT GATATTCATA ACTTCACTAG TATTCATCTT ATCAACCCTA CGACCTTTAT 5700 

CAATAATAGC CATAGATGTT TTCAAAAATT CTCCCAACTT ACGAGCACCA GTCACCCCTC 57 60 

CATGGTCCGG GCTCACAACC ACATAGTCAG AACCAACCAT ACCACGACGC TCAAAATAAT 5820 

CTGCAATCAG AGCAGCACCC ATCAAATGAT CCACAGGAAT ATCAAAGAAT CCTTGAATTT 5880 

GCGCAGCATG CAAGTCGATG GTCAATAAAC GATCCACTCC AGCTACTTCA AGCATATTTG 5940 

CGACAAGTTT TGAAGTCATT GGCTCACGCG CTCTCGCCTT TCTATCCTGA CGTGCATACC 6000 

CATAGTAAGG CATCACAACA TTGACAGATT CTGCACTCGC ACGCTTCAAA GCATCTACCA 6060 

TAATCAAAAT TTCAAGCAGA TTCTCATTTA CAGGCCAACT AGTTGATTGT AAGATAAAGA 6120 

CGTGTTTCCC ACGGATTGAT TCTTCAATGT TGACCTCAAT CTCTCCATCT GAAAATTGCC 6180 

GAACACTTGA TTTCCCCAAC TCTATCCCAA TCTCCTGCGC CACACGTTCT GCCAATTCTT 6240 

TATTAGAAGA AAGGGCAAAC AGCTTTAAAT CAGAAAAAGA CATGATTTCC TCCGGTATAT 6300 

ATGTATAACT T G T G CTTTTC ACAAGATTTT CCATCTACCA TTGTAGCGCT TTTTGCACTA 63 60 

TTTTTCAATC AAAAATAAAA GAAGGGCACC ATATTTGTAC CCTTGCATCA TTCTTTTGAA 6420 
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AAATATTCTA CGTCATCAAC TCATTGTGTT TCTCAACAAA GCAATAAGCA TCATAAAAAC 6480 

CATAGAGAGC AATAGCCGTA ACCACTGGAA TCGCTAAAGG CAACTCTGTT TCCAACTCCA 6540 

CAAAAGGAGA GTTAAACAAG AAGTGAGTTC CCAACGCTAA ACCTAGAAAA ATAAGGCCCT 6600 

CTTTCTTGCC AACCTTCTGT CCTTTATAGG CTCTGTAAAG CAAGTAAACA CCTACTACAC 6660 

CTACACCTGA AAAAGTCCAG TGAGAGGCAA TTCCTGAGAT GATACGCTCT AAAATTCGCG 6720 

AAATAGTAAA GTCAAACCCC TCTGGCAAAT CCGTACGAAT ATAACCAATA TCCTTAATCA 6780 

TTTGCAATCC CAAACCCCAA CCAATTCCAA GTAAAAACAA ACATTTTAAT TTTCGCACAG 6840 

GAATCAAAGC CAAAACAAAA ACAACTGACA ATAATTTCAA GGCTTCTTCT ACCAAAGGAG 6900 

CCGCAATAGC ACTTTCAAAG GCATTTAAAA ATGCACTATC TGGCAAAAGA ACCCCCAGTA 6960 

AATCATGGAT ATAAGTATTA GCAAAACTAG ACAACCAGCC TGAAAGGAAC ATCCCTCCCA 7020 

ATAAAGACAG AATCAAAACC TTCTTTGCCA ATTCCCATTT TTCCCAATAC GGAAGAGAAA 7080 

ATAAAGAGCC GGAATCATGT AAAAGAGAGC TAGAAACATA GAAACTCCCA TTAGTCCATA 7140 

TTCCGCACCT GACCTCGAAC CGTCCGTATA CTACATCGTT TCATACTGTA AACCAATACA 7200 

TAGCAATAAA ATAAAAATAA ATAAAATATT GCTTTTCTTC ATACACTTTC TTTCTAAATG 7260 

AACTATTTAT AATTCTACGA CTGTCATACT TCCTGTATCA AC ATT GT AAA TGGCACCAGA 7320 

GATAATGACA TCGTCTCGTA TTACGGGAGA CTCGATAAGC AGTTGCATAT CCTCGCGTAC 7 380 

ACTCTCTTCT ATATCTTGGA AGGGCAAGAA GTCCTGGTCT GACACATCCA CACCCAATTC 7440 

TTCCTTCAAA TACTCCTGAA AAGCTTCATT TTCAAAGGTC TGACCACCAC ACTCTGTATG 7500 

ATGCAATACC ACAATTTCTC TTCTCCCCAT TTGTTGCTGG GAAATAACTA GACAACGAAT 7560 

CATATCCTCA CTCACTCGAC CACCTGCATT CCCCAAAATA TGAGCATCCC CAAGTGCCAA 7620 

ACCTAGAGCT TGCGCAACGT GCAAACGTGA CTCCATACAC GTCACAATGG CTACTCTGGT 7680 

TTTAGGTTTA AGTCGCACAT TTAACTGCCC ATGTAGGGCA ACATAAGCCT GATTGCCTTG 7740 

CATAAACTCT TCAAAATACG ACACGATTCC CTCCTTGAAA ATTTGATAGT CAAATATTTC 7800 

TCCTATCTTA TCATTTTTAA GAGAATTTGT CACGGATTAT GCAAAGACCT TTTTCAAGAC 7860 

TTCCTGAATC GTTGTCACGC CAATGACCTG AATTTCCTTA GGCAGAGTGA TTCCTGTCAA 7920 

GGAATTCTTA GGTACATAAA TCTTACTAAA GCCCAGTTTA GCAGCTTCGT TGATGCGTTG 7980 

CTCAATACGA TTCACGCGCC CAATCTCTCC TGTCAAGCCC ACTTCTCCGA CAAAACATTC 8040 

CTGAGGATTA GTTGGCTTGT CTTTGTAGCT CGAAGCAATA GCAACTGCAA CAGCCAAGTC 8100 

AATCGCAGGT TCATCCAATT TAACACCACC AGCAGATTTG AGATAGGCAT CCTGATTTTG 8160 

CAAGAGAAGC CCTGCCCGTT TTTCCAAAAC AGCCATAATC AAGCTAGCAC GGTTAAAATC 8220 
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AAGTC CTGTC 


CTAGTACGCT 


TGGCATTTCC 


AAACATCGTC 


GGTGTTACCA 


AAGCCTGAAC 


8280 




ATCGGACGCG 


TCCCTTCCAT 


GGTTACAACG 


ATGGAGGAAC 


CAGTCGCCCC 


8340 


ATPCAAACGC 


TCTTCTAGGA 


AAACTTGACT 


CGGATTGAGT 


ACCTCAACCA 


AGCCGCCCGA 


8400 




AAAATCCCAA 


TCTCATTAGT 


GGAACCAAAA 


CGATTTTTGA 


CCGCTCTCAA 


8460 




GTGTGGTGAC 


GCTCCCCTTC 


AAAGTAAAGC 


ACCCTATCCA 


CCATATGCTC 


8520 




GGCCCAGCCA. 

V W W w 


AGGTTCCTTC 


TTTGGTCACA 


TGACCTACGA 


TAAAGATGGC 


8580 






ACTGCATGAG 


TTCAGCGGTC 


ACTTCACGCA 


CCTCAGAAAC 


8640 




ArCCCTCAAA 


TCTCAGGAGA 


CATGATGGTC 


TGGATGGAAT 


CAATAATGAG 


8700 






CCACTTC TGC 


ACCAACACTC 


TGCATATTGG 


TCTCTGCATA 


8760 


uAGATAAAAL 




TATCACCTAA 


GCGCTCTGCA 


CCTAGTTTAA 


TCTCCTGCGC 


8820 






AGAGAACTGT 


CCCCACTTGG 


GACAACTGGC 


TTGAGACTTG 


8880 


TAGuACAAbA 


O 1 1 i 1 


CAATCCCAGG 


ATCCCCACCG 


ATAACCACCA 


CACTTCCTGG 


8940 


TACCACTCCG 


CUiWCAAUL « 




TTCCTCCATC 


TCCGTCTTGG 


TTCGATTCAC 


9000 


ATTGATGOAA 




CTAGTTTCAT 


GGGCTTGGTT 


TTCTCACCTC 


TCAAGGACAC 


9060 


ACGCCLAiT t. 


l 1 An^. 1 1 VjO 


C AACCTC AAC 


CTCTTCCACA 


AAAGAACACC 


AAGACCCACA 


9120 


GTTGGGGL.AA 




ATTTAGGGGA 


ATTATACCCA 


CAATTTTGAC 


ATACAAATGT 


9X80 


CGCTTTTTTC 


Ill wWun I Vj#% 


CAAACCTCTT 


TCTATATCTC 


TAACTCACAC 


TCAATCACTT 


9240 


wGCAAAAATU 


& ITfTTPTrA 
nA 1 L l 


TTTGGCACAA 


ACTGGCGCAT 


GAGCATTCCA 


TCAGCAACAA 


9300 






CGATACTTAG 


ACATACATTC 


TAGAAACCGA 


GACTTCATTT 


9360 




CTCATATTGA 


ATAGCACTAT 


TAGGAAGCAA 


CTCCCCCTTG 


TTTTCTAAAA 


9420 




A £1 TGTTT C A 

* w * 4 * 


AAGTTTTCTA 


TTCCTGTTTT 


ATAGACCTGC 


CATTCATGTA 


9480 


ATAAACGCTC 


TACTCTTAAA 


GGAAGACCCG 


TAGCACAGAC 


CAGATACCAA 


CCCGTTTCTA 


9S40 


AAGCTCTTGT 


GACTGCAGAA 


GATACGATTA 


TTTCAGCTGA 


CGAGAGTAAA 


GGATTTTTGC 


9600 


TCAATTTCTG 


GACTTGCTGC 


CCTCCCATCT 


CAGACAAGGG 


TGCCAAATCT 


ATCCCAAATC 


9660 


CTATATAAGA 


ACGCTCCTCT 


AACTCACGCT 


AATCTGCCTC 


CCCATGACGT 


ACAAAGATAA 


9720 


TCTTCATTCT 


AGTGCCCTGT 


CGATCCAAAT CCACCAGTTC 


GAACGCCATC 


AGCTGCATCT 


9780 


CCATCTGCAA 


TTAAGAAAGT 


AGCAAAAACA 


GCCTGGACAA 


TACGCTCCCC 


AACTTCAAGA 


9840 


AC AACCT CTT 


GGTCTGTGAT 


ATTCTTCATC 


TGCGCAAAAA 


TATGCCCTTC 


ATTTCCAGGA 


9900 


TTTCCATAAT 


AATCCCCATC 


AATCACTCCA 


ACTGAGTTAA 


TTAAAACCAA 


GCCCTTCTTA 


9960 
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CCAGGATTTG AAGAACGATC ATAGAGGTAC ACAACCTCAG TCGGCTGCAT ATAAGCCTTA 10020 

ACCCCTGTCG GAACCAAGAC AATCTCTCCT GGCGCAACAA CTGTACGCAC AGCAACCTTT 10080 

AAGTCCTAAC CAGTCGCATG CGCTGTCTCA CGCTTGGGCA ATAAATTTTC ATCTGTAAAA 10140 

CTCGAAACCA ATTCAAAACC ACGAATTTTC ATAATTTTCT CTTTTCTATT ATCATTTATT 10200 

CTAGATTATT CTATACTTAT TTA 10223 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16535 base pairs 
IB) TYPE: nucleic acid 
<C> STRAND EDN ESS : double 
(D) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TGGTTCTGTC CTTATCCGCG CCTTGTCTTG CTTGCCATCG CTACACCAAC TATCTCATCC 60 

GACGAAACTA CACCAACCAC TAACGAACCC AACAACAGAA ATACAACCAC CCTTGCCCAA 120 

CCTCTTACTG AT AC ACC AG C TGGCTCTGCT AAGAACGAAA GTCATATTTC TTCACCTGGA 180 

AATGCAAACC CTTCCCTACA CAAAACAGAA CAAAAACCTG CTGCAAGCCC AGCCGATCCA 240 

GCACCACAAA CTGGACAAGA TCCTTCAAGT GACCCAACTA CTTCTACTAG TCCACTAACA 300 

ACTGAAACTA AGCCAGAAGA GCCCATCGAA GATAACTACT TCCCTATCCA TGTCAAAAAA 360 

CTTCCTGAAG AAAACAAGGA TCCTCAAGGA CTATGGACTT GGGACGATGT TGAAAAACCA 420 

TCTGAAAACT GGCCAAACCG AGCTTTGTCC TTCAAGGATG CCAAGAAAGA TGACTACCCC 480 

TATTACCTAG ATGTCAAATT AAAGGGAGAA CAAGCCAAGA AAATTAGCTT CCTCATCAAC 540 

AATACAGCTG GAAAAAATCT AACCGGCGAT AAATCTGTAC AAAAACTAGT TCCAAAAATG 600 

AACGAAGCTT GGTTAGACCA AGATTACAAG G T T TTCTCTT ACGAGCCACA GCCTCCACGA 660 

ACTGTTCGCG TCAACTACTA CCGCACAGAT GGCAACTATG ACAAGAAATC TCTCTGGTAC 720 

TGGGGAGATG TGAAAAATCC AAGTAGCGCT CAATGGCCTG ACGGAACAGA CTTTACGGCT 780 

ACAGGCAAAT ATCGCCGCTA TATCGACATT CCTCTTAATG AAGCCGCAAG AGAATTTGGA 840 

TTTTTATTAC TAGATGAGAG CAAACAAGCA GACGACGTGA AAATCCGTAA AGAAAATTAT 900 

AAGTTCACAG ATTTGAAAAA TCATAGCCAA ATTTTCCTAA AAGACGATGA TGAATCGATT 960 

TACACAAATC CATACTATGT CCATGATATC CGTATGACAG GAGCCCAACA CGTAGGCACT 1020 

TCTAGCATTG AAAGTAGCTT TTCAACACTT GTCCGTGCTA AAAAAGAAGA TATCCTCAAA 1080 

CACTCCAACA TCACTAATCA CCTAGGAAAC AAGGTAACTA TTACCGATGT TGCAATCCAT 114 0 
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CAAGCTGGTA 


AGAAAGTCAC 


CTACAGCGGA 




acapaaaaca 


TCCTTATACT 


1200 


GTTAGCTACA 


ATTCCGACCA 


ATTCACTACC 






AGATGAGACA 


1260 


TACAGCTATG 


ATGGCAAACT 


GGGAGCTGAC 


a a a acs a Ac? 


AAfWAAAACA 


AGTTGATTTG 


1320 


ACCCTTTGGT 


CACCAAGTGC 


TGATAAGGTT 




TTT AmA{~AA 


GAATGACCCT 


1380 


GACAAAGTAG 


TTGGAACTGT 


CGCTCTTGAA 






CAAACAAACT 


1440 


CTAGACACCA 


CAAACAAACT 


CGGAATCACA 


CAl TiCALlTj 


<A.TAV. 1A1 in 




1500 


ATCGAGCGTC 


AAGGTAAAAC 


TGTTCTTGCA 


CTCGATCCTT 






1560 


TGGAATAGCG 


ACGATTCCAA 


GATTCACGAT 


GCCCATAAAG 


TGGCTAAAW. 


CGCCTTTGTA 


1620 


GATCCAGCTA 


AACTCGGACC 


TCAAGACTTG 


ACTTATGGTA 


ft ft WMHI- ft ^ fc. ft 

AGATTCACAA 


I 1 i\AAUAl. 1 


1680 


CGTGAAGACC 


CCGTTATCTA 


CGAAGCTCAT 


GTGCGTGATT 


TCACTTCAuA 




1740 


GCAAAAGACT 


TGACCAAACC 


ATTTGGGACT 


TTTGAAGCCT 


TCATTGAAAA 


ALlAbALTAi 


1800 


CTCAAAGACT 


TGGCTGTAAC 


CCATATCCAG 


CTCCTTCCAG 


TCTTGTCTTA 




1 RAO 


AATGAATTGA 


AAAACCATGA 


ACGCTTGTCT 


GACTACGCTT 


CAAGCAACAG 






TGGGGATATG 


ACCCTCAAAA 


CTACTTCTCC 


TTGACTGGTA 


TCTACTCAAG 


(AiATLV. 1 Anu 


1 980 


AATCCAGAAA 


AACGAATCGC 


AG AATTT AAA 


AACCTCATCA 


ACGAAATCCA 


CAAAtCTtitiT 


2 04 0 


ATGGGAGCTA 


TCCTAGATGT 


CCTTTATAAC 


CACACAGCCA 


AAGTCGATCT 


CTTTGAAGAT 


2 100 


TTGGAACCAA 


ACTACTACCA 


CTTT ATGG AT 


GCCCATGGCA 


CACCTCGAAC 


TAGCTTTGGT 


2160 


GGTGGACCCT 


TGGGGACAAC 


CCACCATATG 


ACCAAACGCC 


TCCTAATTGA 


CTCTATCAAA 


2 220 


TACCTAGTTG 


ATACCTACAA 


AGTGGATGGC 


TTCCGTTTCG 


ATATGATGGG 


AGACCATGAC 


2280 


GCCCCTTCTA 


TCCAAGAAGC 


TTACAAGGCT 


CCACGCCCCC 


TCAATCCAAA 


CCTCATCATG 


2340 


CTTGGTGAAG 


GTTGCAGAAC 


CTATGCCGGT 


CATGAAAACA 


TGCCTACTAA 


AGCTGCTCAC 


2400 


CAAGATTGGA 


TGAAACATAC 


CGATACTGTC 


GCTGTCTTTT 


CAGATGACAT 


CCGTAACAAC 


2460 


CTCAAATCTG 


GTTATCCAAA 


CGAAGGTCAA 


CCTGCCTT T A TCACAGCTGG CAAGCGTGAT 


2520 


CTCAACACCA 


TCTTTAAAAA 


TCTCATTGCT 


CAACCAACTA 


ACTTTGAAGC 


TGACAGCCCT 


2580 


CGAGATCTCA 


TCCAATACAT 


CGCAGCCCAT 


GATAACTTGA 


CCCTCTTTGA 


CATCATTGCC 


2640 


CAGTCTATCA 


AAAAAGACCC 


AAGCAAGGCT 


GAGAACTATG 


CTGAAATCCA 


CCGTCCTTTA 


2700 


CGACTTGGAA 


ATCTCATGGT 


CTTGACAGCT 


CAAGGAACTC 


CATTTATCCA 


CTCCGGTCAG 


2760 


CAATATGGAC 


GTACTAAACA 


ATTCCGTGAC 


CCAGCCTACA 


AGACTCCAGT 


AGCAGAGGAT 


2820 


AAGGTTCCAA 


ACAAATCTCA 


CTTGTTGCGT 


CATAAGGACG 


GCAACCCATT 


TGACTATCCT 


2880 
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TACTTCATCC ATCACTCTTA CGATTCTACT GATGCAGTCA ACAACTTTCA CTGGACTAAG 2940 

GCTACAGATG GTAAACCTTA TCCTGAAAAT GTCAACAGCC GTCACTATAT GAAAGGTTTC 3000 

ATTGCCCTTC GTCAATCTAC AGATGCCTTC CGACTTAAGA GTCTTCAAGA TATCAAAGAC 3060 

CGTGTCCACC TCATCACTGT CCCAGGCCAA AATGGTGTGG AAAAAGAGGA TGTAGTGATT 3120 

GGCTACCAAA TCACTGCTCC AAACGGCGAT ATCTACGCAG TCTTTGTCAA TGCGGATGAA J180 

AAAGCTCGCG AATTTAATTT GGGAACTGCC TTTGCACATC TAAGAAATGC GGAAGTTTTG 3240 

GCACATGAAA ACCAAGCAGG ACCAGTCGGA ATTGCCAACC CGAAAGGACT TGAATGGACT 3 300 

GAAAAAGGCT TGAAATTGAA TGCCCTTACA CCTACTGTTC TTCGAGTCTC TCAAAATGGA 3360 

ACTAGCCATG ACTCAACTCC AGAAGAGAAA CCAGACTCAA CCCCTTCCAA CCCTGAACAT 3420 

CAAAATGAAG CTTCTCACCC TGCACATCAA GACCCAGCTC CAGAAGCTAG ACCTGATTCT 3480 

ACTAAACCAG ATGCCAAACT ACCTGATCCG GAAAATAAAC CTAGCCAACC TACAGCTGAT 3540 

TCACAAGCTG AACAACCAGC ACAAGAACCA CAAGCATCAT CTGTAAAAGA AGCGCTTCGA 3600 

AACCAATCGG TAGAAAACTC TAGCAAGGAA AATATACCTG CAACCCCACA TAAACAACCT 3660 

GAACTTCCAA ATACAGGAAT CAAAAACGAA AACAAACTCC TATTTGCAGG AATCAGCCTC 3720 

CTTGCGCTCC TTGGTCTCGG TTT CTT ACT A AAAAATAAAA AAGAGAACTA AACTAGCCCT 3780 

CCTATAGAAA AATCCCCCAA GCATTATAGC TCGGGGGATT AATTTTTCTA CAATATTTGT 3840 

TGTCCTAATA AACTTGATTA GGATTTTTTA TTAAGCCTCT TTCATAGCAA AATAACCTCG 3900 

TACTTTGGGT GCAACTTGTG TTCCGAAGAG TTCAATAGCT CTCAGAACCT GGTCATGAGG 3960 

CATAGAACCA AGCGGTACAT GAAGCATGAA GCGGTCCAAT CCTAAATCCT CTATCATCCG 4020 

AATCAATTTT TCGGCCACCT GATCTGGATT GCCAACAAAC ATCCCGCCAT TTGGCCCTAC 4080 

CTGCTCCAAA TATTGCTCAT AACGCAATTC CTGCCAGTGC GGACGCTCTT TGGAAATAGC 4140 

ATCCACCACT TGCTTAGTCG GATGGAAATA ATCTTTCACC GCCTGCTCAC CATCTTCCGC 4 200 

AATCCACCCC CAAGAATGGG CTCCCACTTT CAAGTCTTTG TCAGCATGGC CCCTTCCCT7 4260 

CCAATCTCAC GATAAGCCTG AATCAACTTT TTAAAATAAC GTGGATTACC ACCAATAATA 4320 

GCATATACAA TCGGTAGACC AGCCTGAGCA ATCTTCACTG TTGATTCGAC ATGACCACCT 4 380 

GTAGCTATCC ACAAGGCCAA TTTGTCCTGA ACTGGACGAG GATAAACTTC TTTACCAGCA 4440 

ATCGTTTCAG TCAATCGACC TTGCCAGTCT AACTTGGTCT TTTCATTGAC TAACTGAAGC 4 500 

AAGTCTAATT TCTCATCAAA AAGAGAGTCG TAGTCTTTCA AGTCATAACC AAACAGAGGG 4560 

AAACATTCCG TGAAAGAGCC CCTTCCAGCC ATAATCTCCC ATCGTCCATT TGACAAAGCA 4620 

TCGATAGTGG CATACTGTTG GAACAAACGA ATCGGCTCCA TGCTTGACAG AATGCTGACT 4680 
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GCACTGCTCA 


AACGGATTTT 


CTTGGTATTG 


ACTGCCCCAG 






4740 


GCTGATACTG 


CAAAATCCCC 


CCGATCGTGC 


TCACCAATCU 


L.A IA1 ntft is. 




4800 


TTGTCAGCCA 


CCTCAATCTC 


TGCCACCAAC 


TGGCGAATUt 




ACTGTAAGTT 


4860 


TGTCCAGTCC 


CTTCAAGCTC 


CCTTATTTCC 


CCAAATGTTG 


AAATTCCCAA 


TTCTACCATT 


4920 


GTGATTCTCC 


TTATCTATCT 


CTGTACTTCA 


ATTTGAAAAA 


TTATTCTAAC 


ACGAATCTTw 




ACTACAAGCA 


ACCGATTTGC 


TCATTAGAAA 


AACCCTAGAT 


AACTAGACTT 


TTTTAGCTTA 


5040 


TTCTACCGTT 


ACTGACTTGG 


CAAGGTTACG 


TGGTTTGTCC 


ACATCGAGGC 


CACGGTGGAG 


c 1 no 


CGTTGCAAAG 


TAACCGACTA 


ATTGCGTTGG 


TACGACCATT 


GAAATTGGTG 


AGAGGTATGG 


!> lOU 


ATGTACGGTC 


GTAAGGACGA 


TATCGTCGCT 


ATCTTTGGCT 


ACATTCTCTT 


CTGCGATAGT 




GAGGACTTTG 


GCACCACGGG 


CTGCGACCTC 


TTGGATATTT 


CCACGAGTAT 


GATTGGCAAG 


c o an 
52oU 


AACTGGATCT 


GACAAGAGAG 


CCAAAACAGG 


CGTTCCTTCT 


TCAATCAAGG 


CAATGGTTCC 


5 J40 


GT GC T TG ACT 


TCTCCTGCAG 


CAAAGCCTTC 


ACACTGCATA 


TAAGAAATCT 


CTTTGAGTTT 


5400 


GAGACTTGCT 


TCCATGGCTA 


CGTAGTAATC 


TTGACCACGT 


CCGATGTAAA 


AGGCGTTACG 


5460 


AGTTGTTTCA 


AGAAGTTCAC 


GAACCTTGAC 


TTCAATGGTT 


TCTTTCTCTG 


AAAGAGTTGA 


5520 


TTCGATAGAC 


TGACCTACGA 


TTCACAATTC 


ATGAACC AGC 


TCAAAGGCTT 


GCCCTTTACC 


5560 


ATTACCATTT 


GCTTCTCCGA 


CTGCTTTTCC 


AAGGAAGGCA 


AGGGCTGCGA 


TTTGCGCTGT 


ft 


ATAGGCTTTA 


GTTGATGCCA 


CGGCAATTTC 


AGGACCTGCG 


TGAAGGAGCA 


TGGTATAGTT 


5'OU 


GGCTTCACCT 


CAGAGGGTTG 


AACCTGGAAC 


GTTTGTCACT 


CTTAAGCTTG 


ft ft WWIM ■ ft »•» 

GAATTCCCAT 


3 / DU 


TTCATTAGCC 


TTGACCAAAA 


CTTGACGACT 


ATCCGCTGTT 


TC AC CAG ATT 


GGCTGATAAA 




CATGAAGAGT 


GGTTTCTTGC 


TGAGAAGTGG 


CATACCGTAG 


C CCC ACTC AG 




300U 


AACTTCAACT 


CGTGTATCTG 


TCAATTCTTC 


CAACATTTTC 


■MB *k ft ft ft ft. 

TTAGAAGCAA 






GTAAGATCTT 


CCAGCTGCAA 


GGATGTACAT GCCGTCTGCG 


TCTTGAACAG 


CCTTAATGAT 


6000 


ATCTGGGTCT 


ACGACAACTT 


GACCAGCCTC 


ATCTGTGTAG 


GCTTGGATGA 


GTTTCCGCAT 


6060 


AACACTTGGT 


TGCTCGTCAA 


TTTCCTTGAG 


CATCTAGTAA 


GGGTAAGTTC 


CCTTAC CGAT 


6120 


ATCTGACAAG 


TCAAGTTCAG 


CAGTGTAGCT 


AGCACGCTCA 


CGACGATTTC 


CATCATAGTC 


6180 


TTGAACTTCC 


ACACTATCAG 


CCTTGACGAT 


TACCAACTCT 


TCGTCATGGA 


TTTCCATGTA 


6240 


TTGGTTAGTT 


TCACGAATCA 


TAGCCATGGC 


GTCTGAGCAG 


ACCATCTTAT 


AGCCTTCTCC 


6300 


AACACCAATC 


AAAAGTGGTG 


ATTTATTTTT 


AGCTACGTAG 


ATGACTTCAG 


GATCTTGTGA 


6360 


GTCAACCAAG 


GCAAAGGCAT 


AAGAACCACG 


GATGATGTGA 


AGGGCTTTTT 


TGAAGGCTTC 


6420 
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AACAACTGAG AGCCCTTCTT CTTCCGCAAA TTTTCCAATC AAATCAACGG CTATTTCAGT 6480 

ATCTCTCTGC CCCTTGAAGT GCTGACCTGC AAGCTATTCT TCCTTGATTT CAAGATAGTT 6540 

CTCAATCACC CCATTATGCA CCAAGACAAA ACGTTCCGTC TCAGAGCGGT GTGGGTGAGC 6600 

ATTGTCCTCA GTTGGTTTTC CGTGAGTAGC CCAACGAGTA TCTCCGATAC CA G TT G TT C C 6660 

CTCAACACCA GCTCTCTTGG CAGACAATTC TGCAATACGA CCAACCGCCT TCACCAAATG 6720 

GTTATCAGCA CCATCTACGA CAAAAATTCC CGCAGAATCA TAGCCACGGT ATTCAAGCTT 6780 

TTCAAGCCCT TGAATCAAAA TATCAGTTGC ATTTGTGTTT CCAACAACAC CAACAATTCC 6840 

ACACATAGTA TATACGACAC ACGCAAGCTG TGCTTTCTCC TTAAAATTGG TATAGTCTAA 6900 

TTCATCTTTT ATAGAATCAC CAAAAACACT ATATACTTCT TTCTTTCACT TCTCAACAGT 6960 

AAAAATTGGT ATAGTTCAAA TTAACCTCCT GTAAGCATAA AAACTCTGAC CGATTGGGAT 7020 

AATCAGTCAC AGTCCTTTTT AAAATCCATT ATTATCGCTT AATTCTTTGA ACCAGTGGCC 7080 

TGATTTCTTC AGACGACCTT CTTCCGTTTC CAAGTCTAAT TCGACCAAAC CATAGCGATT 7140 

TTTATAGCTG TTGAGCCATG ACCAGCAGTC AATAAAGGTC CAAATCAAGT AGCCCTTACA 7200 

GTTGGCACCA TCTTCAATGG CACGGTGAAG TTCACGAACA TCACCTTTTA CAAAGTCAAT 7260 

ACGGTAATCA TCTTGAATCA TTCCA7CTTC ACCCAATTTT TCTTCCCCTT CAACACCCAT 7320 

ACCATTCTCA CTCAACATCC ACTCAATATT CCCATAATTT TCCTTGATAT TTTGGGCGAT 7380 

GTCATAAATC CCTTGCTCAT AAATCTCCCA ACCACGCTGA GAATTCATTT TACGTCCAGG 7440 

CATCACATAA GGCTCGTAAA AATGTTCTGG TAAGAGTGGA CTCTCTGGAT GCTTAGCAAA 7500 

TCGAGGAGCC ATAACACGCA AAGGTTCATA GTAGTTCACA CCAAGGAAGT CCACCGTATT 7560 

ATCACGAATG AGTTCCAACT CTTCCTCTGT ACCATCAGCT AAAACACCCT GTTCATGCAA 7620 

GATTTCTACC AACTCCTGTG GATAAGTCCC CAAGACAGAT GGATCTAAGA AAGATTGGCC 7680 

CTGAAAAAGG GCCGCAATAC GAGCTGCCTT GACATCACCA GGATGCTGGC TACGTGGATA 7740 

AGCCGCTGTC AAGTTGAGGA CAATCCCAAT CTTGGAATCA GGCAAAAGTT CATGGCAAGC 7800 

CTTAACAGCC CGGCTGCTGG CCAATTGTCT ATGATAGGCT ACCTTAACAC CTGCCTCTGC 7860 

ATCCACCTTA TGTGGATAAT GGGCATCATA AAAATAACCA AATTCTACAG GAACGATGGG 7920 

CTCGTTAAAG GTAATCCATT GATCCACTAA ATCTCCATAA GTCTCAAAAC AAAAACGAGC 7980 

ATAGTCTTCA TAGGCTGAGA CTCTCGCCTT ATTTTCCCAA CCATCACCAT CCTCTTGAAG 8040 

GGCAAAAGGT AAATCAAAAT GATAGAGATT GACTAACAGA CCAATTCCTT TAGCCTTAAT 8100 

AGCCTCAAAG ACCTTACGAT AAAAATCCAC ACCTTGAGTG TTGACTTTTC CACAGCCTTG 8160 

TGGAAAAATC CGTGACCACT GAATAGAAGT CCGAAAGGCT GTGTGACCAG TCTCTAACAA 8220 
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AAGCTCAATA 


TCCCGCTCCC 


AATTTTCATA 


AAAACTCGAT 


GTCTTATCTG 


AACCAATCCC 


8280 


ATTATAGTAA 


CGATTTGGCT 


CCACTTGGAA CCAGTAATCC 


C AC AGATTGT 


CTCCCTTACC 


8340 


GTCACCAGCT 


ACACGTCCTT 


CTGTCTGCGG 


TCCAGAAGTA 


GAGGATCCCC 


AGACAAAATC 


8400 


CTTTGGAAAT 


CTTAGCATAC 


» ttt Afv ' TP T 

All A /\v— v» 


TTATCTACTC 


ATTTCTCCCA 


TT AT ACAGAA 


8460 


AAAACAAGGT 


AAAAACTACT 


Ta^aI lilt* 


CCTTGTTTTT 


CTTCTGATTA 


TAGTTTTTAT 


6520 


TTCTTGCTTA 


GGATTTCAAG 


CCT 1 l LnnuU 


ACjCTTATCTG 

AWNJ A A t* A W * ^* 


CATCAACCTC 


AATGGTGTCA 


8580 


CCACTTGCCT 


TGATCTTAAC 


TTtTAUnA 1 \f 


CCATCGGCCG 


LI 1 i 1 1 i /*WW 


AACAGTGATA 


8640 


CGGATTGGAA 


GACCAATCAA 


CTCACTATLu 


w A AAA w\n 


CACCCACACG 


TTCGTTACGG 


8*700 


TCATCTGTCA 


AGACTTCATA 


ACCAGCTtt. 


ATPAACCTTG 

A 1 LIWJV. A * VJ 


/*T*rpAAGTTT 

W> A A vnll\) AAA 


TTCTGTCAAG 


87 60 


GCTTGCGCTT 


CTTCATCCTT 


GACATTGACa 


V* 1 AA 1 w/wVA 1 


WvnLn *vwA*\« 


TGGTCCCAAT 

A \**» A V#W Wnn * 


8820 


TCTTTAGGGA 


AATTGATTCC 


CCAAGCGTAA 


w>U** 1 A 1 1 W/\l— 


1.1 1 i 1 WAiVI I 


TTTGTTAACA 

AAA W A A J M V W #» 


8880 


AAGAGGCCAG 


CGTGTTGCTC 


CATCACTGCT 


AAAuAnu AV. 




CATACCGTAA 


8940 


CATCCCATGA 


TCATTGCCAC 


AGCACGACuA 


til 1Wa1<>v>/\ 




AWWWnAY#W 1 A 


9000 


GCTGAATAGC CAGTTCCCAC 


I 1 J uAAnn I A 


tcaccgatct 


w AA • v,nv,« 


rcCAAACTTA 


9060 


AGGACACCTT 


GTCCATCTGG 


UOAAA I I lv.n 


CCCTCACGAA 

W W A W 




ATC C AC AT AT 


9120 


TCTCCAGTAA 


AATCACGGCC 


TGGGTTCAi.n 




ftrtT A£t*TC ATC 
V^w A A * w 


TTCGTTAGCA 


9180 


CCGACAACTG 


CATTGCGAAC 


A I\- 1 llil Ml— w 


TTACGATCTG 

A A #"»W V» * w » w 


A A T* A ATTTT 




9240 


GGCAAACCAA 


CTGGTCCAAG 


T*"! A APf* AAAT 


CCTGCTTGAA 


CAACATTCCC 


CACTTCTTCT 


9300 


TCGCTACCAA 


CGTCAAACAA 


A I w t w*. A WWW 


AAGTGATTTT 


TCAACTTGAC 


TTCGTTGAGT 


9360 


TGGTCATTTC 


CAACTAGAAG 


GGCTGCAACA 


ACCTCACLAT 


CTGCAATGTA 


GAAGAGGGTT 


9420 


TTAATCGTTT 


GTTCTTCTGG 


AACATTGAGG 


AAGGCTGCAA 


CTTCATCAAT 


TGATTTAACA 


9480 


TCTGGCGTTC 


CAACACGAGT 


AACTTCTTCT 


TCAGCGACAA 


CACGGTTGCT 


tggtttgtac 


9540 


TCGTTTGTTG 


CCATTTCTAA 


GTTAGCTGCA 


TAGCTAGACT 


CACTTGAGTA 


AGCAATGGTA 


9600 


TCTTCACCAG 


AGACTATCCA 


TTTGAGCAAT 


TCTGCCTTGA 


TTTCTTCTTG 


CACTTCTGCA 


9660 


GGAATTTCGT 


CAAATGAGGC 


AACTGACTTG 


TCCAAGACAA 


CCCAGCGGTC 


AAGGTCTGTA 


9720 


CGAGCAGATG 


TAATGGCCAT 


AAATTCTTGG 


CTATCCTTAC 


CACCCATGGC 


TCCACCGTCA 


9780 


CCAATAATAG 


CCTTGAAGTC 


TAAACCACTA 


CGAGTGAAAA 


TACGCTCATA 


GGCTGCTTTG 


9840 


TACTCATCAT 


AAACACTATC 


CAAACTATCA 


TAGTTAGCGT 


GG AAACT AT A 


AGC ATCCTTC 


9900 


ATGATAAACT 


CACGTGTACG 


AAGAAGTCCA 


TTACGCGGGC 


GTTTTTCATC 


ACGATACTTG 


9960 



[ 

0 
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GGCTGAATTT 


GATAAAGGTT 


GAGTGGCAAT 


TGCTTGTAAG 


ATTTAACAGA 


ATCACGGACA 


10020 


ATAGCTGTAA 


AGGTTTCTTC 


GTGAGTTGGA 


CCTAAGATAA 


AGTCTGATTT 


TTCACGGTTT 


10080 


TTTAGTTTCT 


AAAGCTCTTC 


ACCATAGGTT 


TCGTAACGAC 


CTGATTCACG 


CCACAATTCT 


10140 


GCACTAAGAA 


GGGCTGGAGC 


CAACATCTCA 


ACAGCACCAA 


TCTTTTCGAA 


TTCTTCGCGC 


10200 


ATGATGTTTT 


TAGCTTTTTC 


AATCACACCG 


TTCGCAAGTG 


GTAGATAAGA 


ATAAACACCT 


1O260 


GCTGAAACTT 


GGCCAACATA 


ACCAGCACGC 


AACATAAGAG 


CATGGCTGAT 


AACTTGAGCA 


10320 


TCGCTTGGCA 


TTTCGCGAAG 


CGTTGGGATA 


GGCATTTTAC 


TTTGTTTCAT 


AATATTCCTC 


10380 


GATTATCTAA 


AAAAGAGTCG 


CATAATGTCA 


TTCCAAGTCA 


CAGCAATCAT 


CAAGACAACC 


10440 


ATGATGACCA 


CTCCGGCCAA 


GCTGACATAG 


GTTTCAATTT 


CTTGTTTCAA 


TGGTTTGCGC 


10500 


CGGATGGCTT 


CTAGGATATT 


GAGCACAATC 


TTACCACCAT 


CCAAGGCTGG 


AATCGGAATA 


10560 


AGATTAAAAA 


TCCCAATATT 


GATCGAAATC 


ATTGCCAAGA 


AGTACAAGAT 


ATTT TCAATT 


10620 


CCATTTTTAG 


CAGCATCACT 


ACTTGCCTTA 


AAGATAGCAA 


CAGGTCCACC 


CAACTTGTTC 


10680 


AAATCTGGTT 


GGAAAATCAG 


ATTTTTCAGA 


GCTGAGAGAA 


TTCGGAGAGC 


TGAGTCAGCA 


10740 


CCAGTTGTAA 


AACCACCTAC 


AAACATGGAT 


AGAAAATCTG 


ACTTAACCCC 


CGGTTGAACA 


10800 


CCTAGAAGCT 


AACGACCTTG 


ACTATCTTTG 


GGTGTAACAG 


TCACTTGTTT 


CTCACTCCCC 


10860 


TTTTCAGAAA 


TAGTCACATC 


CAAACTCGCT 


GCCCTCTTAT 


CTTTGGTTTC 


TGTTTCCACA 


10920 


GCTTCGATCA 


AGCTTTCCCA 


GTTGCTAACC 


TCATCTGACC 


CAATCTTGGT 


AATTTGTGCC 


10980 


ATTTCTCCTA 


CTCCTACCTT 


GGCCAAGGCA 


CCTTGGGGCA 


TCATATGGAA 


CTGATTGGTA 


11040 


TCAACATCTC 


TCACACCACC 


CTGCATAAAG 


ATTAAAACCC 


AAAAAACAAC 


CACACCTAAG 


11100 


ATAAAATTGT 


TCATAGGACC 


TGCAAAATTG 


GTAATCAGTT 


TGCCCCAGAT 


AGTCGCATTT 


11160 


TCATATTGAA 


CATCTAAACG 


TGCAATCCCA 


ACCTCAGTAC 


CATCTGCTTC 


CACAACCGTT 


11220 


GCATCGTGAT 


CCACTGCAAA 


TGTTTTTTCT 


TCTTCCAGAA 


CCAATCCTTT 


GATAAAGAGC 


11280 


TTGTCTTCAA 


AATCAAACTG 


GGTCACCTGC 


ATAGGGAGGG 


CTGTTTGATC 


CAATTTTTIA 


11340 


CCTGAGAGAT 


TGATGCCTTT 


AACCTTACCA 


TCATCAGCAA 


GTGTCAAACT 


AACAGGCGTT 


l1400 


CCTCTCTTGA 


TTTCAGTTGT 


ATCATCACCC 


CAACCGGCCA 


TCCGGACATA 


GCCACCCAGA 


11460 


GGCAAGATTC 


CAATGGTATA 


GCCCGTTCCA 


TCCTTGCCAA 


TGTGAGCAAA 


AATTTTAGGT 


11520 


CCCATACCGA 


TGGCAAATTC 


ACGTACTAAA 


ATCCCTCATT 


TCTTGGCAAA 


GTAGAAGTGA 


11580 


CCG AACTCGT 


GCACCACTAC 


AATAATCCCG 


AAAACCAGAA 


TAAACGTTAA 


AATTCCGAGC 


11640 


ATAGCGTTTC 


CTCCGTCTTT 


TGATTAAAAG 


AGTCCAAATA 


AGTGCATGAT 


TGGAAATACA 


11700 


AGCAACATAC 


TATCGAAACG 


ATCCAAAACA 


CCACCATGTC 


CAGGGATAAA 


TTTCCCAGAA 


11760 
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TCCTTAACAC 


CAAAATGACG 


TTTGATCGAA 


CTTTCTAGTA 






11820 


ATGCTAAAGA 


AAATAGCAAA 


GACTGACATC 




CATATGGAAG 


AGCAACTGTA 


11880 


CTGTCAACTA 


TCATAAGGAT 


m m >»»MM>MM M ^*t* 

AAT\»GTTACT 




CTAAAATACC 


ACCCAAGGCA 


11940 


CCCTCAAGGG 


TTTTATTAGG 


CGATACCCTT 




TTCGTTTCCC 


ATAGTTCATC 


12000 


CCAACAAGAT 


AGGCACCACT 


MIMI M«B» MM M M 

GTCTCTCGCC 




ACAAGGCTAA 


GAGAGCCTTG 


12060 


TCCAAACCTG 


CAACACGACC 


% #*W*4 X MP % m & 

ATCTAGTAAA 


GC A™1 AAA 1 V_ 


CAAAGCCCAC 


GTAGAAGCTC 


12120 


ATAGCAAGAG 


GGAAAACCGC 


ATCCTCAATC 


GTATAAGACT 


TGCTAAAAAC 


GGTCGTTCCT 


12180 


AACATCATTG 


AAATCAAAAC 


ACTATAGGCA 


ACCACATTCv 


CATCAACTGG 


CAAAAAAGTC 


12240 


AGGTAATTCT 


CCAAGGGAAT 


GGTCAATGCA 


AAGGTTGCAA 


AGAGGCTCAA CAGGCCCTCC 


1 9 ion 


ATCGTCATGG 


TCTCTAGACC 


TCTCATCTTC 


AAAAGTTCAT 


GCATGGCTAG 


CATGGCTATG 


12360 


ATTCCGATTC 


CTATCTGAAG 


CAAGAGGCCC 


CCAATCATTA 


AAATTGGTAG 


GAAAATAGCC 




ACGGCAATCC 


CTGCAAACAA 


GGTTCTTTTC 


TCTAAATCCT 


GCCTCATATT 


TCCTCCTAAA 


i *5 a on 


CTCCTCCAAA 


TCGGCGATGA 


CGACGATTAT 


AGGCAAGAAT 


ACCTTCCTGC 


AAGGCCGCTT 




CGTCAAAATC 


AGGCCATAAG 


GTGTCCGTAA 


AATAAAGCTC 


ACT AT AGGCT 


CCCTGCCATC 


i *) Ann 


GAAGGAAATT 


GCTCAAACGT 


AATTCT CC AC 


TAGTACGGAT 


AATCAACTCT 


GGGTCTCGTA 


liOOU 


AGTCCTTAGG 


CAAATGCTGA 


GTAAAGAGAT 


AGTTACCAAT 


CAATTCCTCT 


CTGATCTCAC 


12720 


CTCGGTTGAT 


TTTGGCATCT 


AAAACATCCT 


GGGAAATCAA 


CTTAAGCGCC 


TGTCTAATCT 




CAGCACGTCC 


ACCATAGTTA 


AGAGCAAAAT 


M » » M K * Ik Jk 

TAAGAATCAA 


TCCTGTGTTG 


TTCTTAGTCA 


12840 


ATTCCTCAGC 


CTTGGTTAAA 


GCTTCAAAGG 


iFumwi "1"!' n r~f^ 


CAGGCCGTCT 


CTCTCCCCAA 


12900 


TCATTTGAAT 


CTTAACATTA 


TTCGCATGTA 




ATAATTATCA 


TAAAACTCTA 


12960 


CTGGCAAGTT 


CATGATAAAC 


TTGACTTCCT 


kj/\ J k_ I ViuAUVj 


CCTCCAGTTT 


TCCGTAGAAA 


13020 


AAGCATAGAC 


CGTAATAACC 


TTGACGCCCA 


GTTTGTTGGC 


TGCCTTGGTC 


ACCGTTTGCA 


13060 


ATGCTTCCAT 


GCCCGCCTTA 


TGTCCAAAAA 


CTCGCGGTTG 


CATACGTTTT 


TTAGCCCAAC 


13140 


GGCCATTGCC 


ATCCATGATG 


ATGCCGATAT 


GAGCAGGAAC 


CTGTGTCGGA 


ACCTCTACTT 


13200 


CCACAGCCTT 


ATCTTTCTTA 


AAAAATCCAA 


ACATGATCTT 


ATTCCTATTC 


AAAAATCTAT 


13260 


CGTTTCATTA 


TACCATATTT 


CCCCATTTTC 


TTCTATCACT 


AAGCTATTTA 


TTCTCAGGCA 


13320 


CCAAGCCCAT 


TTTTCAAAAA 


AATAAGCCGC 


CTGATTGGGC 


GACTTTATTT 


TTATAGGGAG 


13380 


ATTATTATGA 


AAAACTTTTA 


GGAGTTTAAG 


TTAACGTCTT 


CTTAACTTAT 


GAACTTAGTG 


13440 


TACACTCCCT 


AGCTTAAAGT 


TTCCTTAAGT 


ATTTTTAAAA 


ATCAAATTTT 


TCCATTTCTC 


13500 
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CTGCCAATTT 


TTCTTGGATA 


. AACGTGTTTG 


ATAGAGTTCC 


ATTCGGTCTT 


CATTTTCTAA 


13560 


GAAATGAGGA 


GTTGGACGAA 


CTTGAAAATT 


CAAAATATCC 


TCCAAACCAT 


AAGGTACATA 


13620 


GAGTTCAAAA 


TCTAATTCTT 


CATTCAAGCG 


CACTCCAACT 


GCCGTACACC 


GTTCTGGATA 


13680 


CTTACTCATA 


GCATCACGAG 


AACTGGTATA 


GGAAGCAGTG 


TGAGCACTGT 


GCTGATGCAT 


13740 


ATAGACCTGA 


TTTTTCAATT 


CCCACTGGTA 


CTGAGGAAAA 


TCCTCTCTCA 


CCTTTTTCTC 


13800 


CAGTAATAAG 


GTTTCCTCAT 


AAGAAAAATC 


TGGATCAAAG 


AAAATCACAT 


CTATATCTGT 


13860 


TTCATGATCA 


AAAGGGCATT 


TGTCTGACAA 


AAGATTCCAG 


ATGAAATTTC 


TGACAGAACC 


13920 


TGCTGCCAAC 


CACGAGTCTT 


TCAAACCAAG 


GTCTCGGATG 


ATCCTCAGAA 


TGGCCATCAT 


13980 


ATCTGGACTT 


TCTCTAAAAC 


CCTCTAAGAT 


TTCTTGCTTA 


TTTTTCACTG 


TATTCATAAC 


14040 


CTAAGTGCTC 


ATATGCCTTA 


CCAGTCGCCA 


CCCCTCCAGA 


CCGTCTCCCC 


ATGATAAAAC 


14100 


CTTTTTGAAT 


CAAGTAAGGC 


TCATACATGT 


CTTCAACTGT 


CTCACGCTCT 


TCGGCGATAT 


14160 


TCACAGAAAG 


AGTTCCTAGA 


CCAACAGGTC 


CTCCACTGTA 


CATCTCAATC 


ATGGTGCGAA 


14220 


GGATTTTTTC 


ATCCACATAG 


TCCAAACCTT 


CATGGTCAAC 


ATCCAGCATA 


GTCAAAGCCT 


14280 


TATCCCTAAT 


AACATCATCG 


ATAACCCCAT 


TCCCCATTAT 


CTGGGCAAAA 


TCGCCCACGC 


14340 


GCTTGAGCAC 


ACGATTGGCA 


ATACGAGGGG 


TTCCACGACT 


ACGTAGGGCC 


AACTCAGATG 


14400 


CTGCCTCATG 


GGTGATTTCC 


ATCTCAAAAA 


TATCTCCCGT 


CCGCTCGACA 


ATTTCTGTCA 


14460 


ACTCAGCATG 


AGCATAATAC 


TCCATATGAC 


CTCTAATCCC 


AAAACCTGCC 


CGTAGTGGAT 


14520 


TTGAGAGCAT 


ACCAGCCCGA 


GTCGTCGCAC 


CAATCAAGGT 


AAAAGGAGGC 


AACTCCAAAT 


14580 


GAACACTCCG 


ACTCCCTTCA 


CCAGCCCCAA 


TCATAATATC 


GATGTAGAAC 


TCCTCCATGG 


14640 


CACTATAAAG 


CACTTCTTCC 


ACTCACATGG 


GTAAGCGATG 


AATCTCCTCA 


ATAAAGAGGA 


14700 


CATCTCCAGG 


CTCTAAATCA 


TTCAAAATCG 


CTACCAAATC 


ACCCGCTTTT 


TCGATAACAG 


14760 


GACCAGACGT 


TTGCTTGAGA 


TTGACTCCCA 


GTTCATTGGC 


AATGACAAAA 


GCCATGGTTG 


14820 


TTTTCCCAAG 


CCCTGGAGGG 


CCAAATAAGA 


GCACATCATC 


CAGCGCTTCA 


TCCCGCATTT 


14880 


TAGCGGCTTC 


GATAAAGATC 


TGAAGTTGAT 


CCTTAACCTT 


ATCCTGACCA 


ATATATTCAC 


14940 


GTAAATACTG 


AGGACGGAGC 




CTAACTCCTC 


ATCACCCATC 


ATCTCATTAT 


15000 


CTAAAATTCT 


ACTCATGGCT 


CTATTATATC 


AAAAAAAACA 


ACCCACAAAC 


AAAAAAGCCA 


15060 


CCTGATTGGG 


TGACTCCTAA 


GTTTAGCACT 


TATGTCCTAT 


AATATTATAC 


GGCACTTCTA 


15120 


CACCGCCTAC 


GAAAGGAGGT 


GAGATAGCCC 


ATGATGGAAT 


TACTACTCAA 


AACTATTATC 


15180 


GGACCAATTG 


TGGTCGCTGT 


CGTTCTTCGT 


ATACTCGATA 


AATGGCTAAA 


CAAGGACAAA 


15240 


TAGTGTCAAA 


AAAGACCTCA 


AGCTTATTTG 


GTCGTGAGCT 


TGGGGTCTTT 


TCTAGCCTAT 


15300 
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GATATAGAAC 


TACTACTCAA 


TTCCTTTTTA 


TTATCCCATA 


GTTCACGAAT 


TTTGTCAAAA 


13JOU 


CTTTACATTT 


TCTTCAACCG 


CTGTACGACA 


AGACGCTTAA 


GATTAAGAGA 


ACU I l AwvAm 


15420 


TTCTATCAAT 


TTCATAGAAA 


TTTTGATTTC 


GTAAACGAAG 


AGACAATCTT 


ACATGTCACT 




TCTCATTTAA 


TACGCCACTA 


CTAGACAAGC 


AAAATCATTA 


TTACAGTACT 


TCCAGTCCTT 




CAATTAACAG 


TCACTTACAA tcaaattcag 


TTTCAACTAG 


CTGAAGCGAC 


CACAGACCTA 


i ccnn 
130UU 


TTTCTTAGTC 


ATATTCGCTA 


AAAAAATCCC 


CGCCAAAATC 


TCAAAAAGTC 


CCCGCCAAi l 


1 3O0U 


CCCCCACCAA 


AATCCGAAAA 


ATACCGAAAA 


ATATCGAAAA 


ATTATTTTTA 


GAATAGTCCC 


15720 


AAAAATCCTG 


AAATAGAGCT 


AAAAAACTCC 


ACCTGATTCG 


GTGGACTTAA 


GGGAGATTAT 


15780 


TATGAAAAAG 


AAAAGTTTAG 


GATTTTATTA 


AATAAAGTTA 


GGAGGTCTTT 


ATTTAATAAC 


i c a ji a 
1504U 


TACATGATAC 


AAGACGAAAC 


TTAAAACTAG 


CTTAACTTTT 


CTAAAATTTT 


ACTATTTTGC 


15900 


AAAAAATTTC 


TATCACCAGC 


ACCTCACCAA 


TCGACTAGGG 


GATAATCTCT 


AGCCCCTCTC 


15960 


ACACCACCCT 


ACGTGCCGTT 


TGGCATACGG 


CGGTTCAACT 


AACTTTTAAC 


GCATGTCGTT 


16020 


CAAGGTAATA 


ATCCAAACAC 


GAAACCAGTC 


CACGTTTTTC 


CACGACTCGT 


TTTGATATAG 


16060 


CACGTTTAAG 


TACCGACTTC 


TGAGCTACTA 


ATTGATAATG 


GTCGCCCCAG 


CCACATACCT 




T ATCTGCT AT 


CCATTTAGCA 


ACTCCTAACT 


TAAGCAATCC 


CCATAATCGT 


CTCGATTTCT 


16200 


TCTTCCATTG 


CTTCCACATA 


ATCACTCGTA 


GGCGAGTACG 


CAAGCCCTCA 


TCTATGCTGC 


16260 


CGACTATACT 


TTTCATATTT 


CCCAATGAGC 


AATACTTTAT 


CCATCCTCGA 


ATAGACAAAT 


16320 


TCAGTTGCTC 


AATACGTCTT 


GTTAGGTCTA 


TACTCCATTT 


CCTCTCTCTT 


AGTTTCTTCA 


16360 


ATTTAAACTT 


AAATCTCCGA 


ACACTATCTT 


GATGTGGACG 


GCTTTTCCAA 


CCATCTGATA 


16440 


ATTTCCAGAA 


CCCAAAACCT 


AGATATTTCA 


ACTCTCTTGG 


TCATGTTTAC 


TTTCAAACCT 


16500 


AGCCCTTTCT 


CAATAAACGA 


CTGACTCAAT 


ACATC 






16535 


(2) INFORMATION FOR SEQ ID NO: 75: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CCAGAGCGTT GCCTCCCAAA GTCTATCCAG ACACGGCTCT TTAAAAACAA AAGGAGAAAT 60 
GATGCATACT TATTTGCAAA AGAAAATTGA AAATATCAAA ACAACCCTAG GTGAAATGTC 120 
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AGGTCCTTAC CGTCGTATGG TTGCGGCTAT GGCTGATTTA CGATTTTCAG GAACTATGAA 180 

GGCTATCTGG GATGACCTCT TTGCCCATCG TAGTTTTCCC CAGTGGATTT ATTTGCTGGT 240 

TTTAGGAAGT TTTCCTCTCT GGCTGGAGTT GGTTTACGAA CATCGTATTG TTGACTGGAT 300 

TGGGATGATT TGTAGCTTGA CAGGGATTAT CTGTGTAATC TTTGTATCGG AAGGTCGAGC 360 

• AAGTAATTAT CTTTTTGGCT TGATTAACTC TGTTATTTAC CTTATTTTGG CCCTACAGAA 420 

AGC CTTTT AT GGTGAGGTGC TGACGACACT TTACTTCACA GTCATCCAGC CAATTGGACT 480 

TCTAGTTTGG ATTTATCAGG CACAGTTTAA GAAGGAAAAG CACCA G TTTC TCGCCCGTAA 540 

ACTGGACGGC AAGGGCTGGA CAAAGTATCT TTCCATTAGT GTGCTTTGGT GGTTCGCCTT 600 

TCGCTTCATT TATCAGTCTA TTGGTGCCAA TCGTCCCTAT CGTGATTCAA TCACAGATCC 660 

AACCAATGGG GTAGGGCAAA TCCTCATGAC AGCTGTTTAC CGTGAACAGT GGATATTCTG 720 

CGCGGCTACC AATGTCTTTT CAATCTATCT CTGGTGGGCA GAAACCCTGC AAATTCAAGG 780 

GAAATATCTA ATTTATCTCA TTAACACTCT AGTTGGTTGC TATCAATGGA GCAAGGCAGC 840 

TAAGCAGAAT ACTGATTTAC TTAACTAGGA AAAGATGTTT GAAAGTGCTG TTTTGAGATT 900 

TCGATTAAAA C AG AT AT ACT TGATAATCAA GGATTTATAC TATGAAAAAG AGCATCGGCG 960 

GGTCCTCTTT TGTTGTTGAA AACATAAAAA ACTCAGTAAC CTAGAAATAA GACAACTGAA 1020 

GCTTTACTCT AT ATT C AATT TTTAGGAATC ACAACCTCTA GATAAAATTG GACAACTTCC 1080 

TGCTCTGTGA AATCTTGACC TTTTTTGAGC CACCAGGTCA ATGTCTCGAT AAACTTGGAC 1140 

ATGACCAAGT GTTGGAGGTA AGAAGTAGGC AGATTAGGGT GGGCTTCTTT TAAATTATCA 1200 

GCTAGCACGG AATAGACATG CTGTTCTAGC TCTTTATGGA GTTGACGGAG GAAGTAGTCA 1260 

TTTTTGGAAA ATAGCAGACT GGTGATATGG TCTTGGTTTT TATGAAAATG GAGAAAGAGG 1320 

TGGGCGAGGT AGTCCTCGGT TGAAATGGCT TGCTCTCTTT CAAAAAGATG ATGGAAGAGG 1380 

TAGCGGCAGA CCTCCTCCAC AAGAAGCTCC TTACTCTCAT AGTGACACTA AAAGGTGGAT 1440 

CGTCCCACAT CTGCGAGATC AATGATATCC TGAACAGTAG TGGCCTCGTA GCCCTTAGCA 1500 

TTCAAAAGTT GTATAAAAGC TTGAT AG A TG CCTTTTTTGG TTTTGCTGAT ACGGCGGTCA 1560 

ATGTTACTCA TATGGACACT TAAGGCAAAT TGTTCAGAAC TGAATAAACC TGACGTTTTG 1620 

CTTCTATCCT TTCTTTGAGT TTTAGTGGAT AATGATAATG AACAAGGTCT TCATAAATCT 1680 

ATTATAACAA AGGAATGAGA AATATGAAGG CAAAATATGC TGTTTGGGTG GCTTTTTTCT 1740 

TAAATTTGAC TTATCCCATT GTTGAGTTTA TTGCAGGTGG AGTATTTGGT TCTAGCGCTC 1800 

TTCTTGCTGA CTCTGTGCAT GACTTCGGAG ATGCGATTGC AATTGGAATA TCAGCTTTTC 1860 

TAGAAACAAT CTCCAATCGT GAAGAAGACA ATCAGTACAC CTTGGGCTAT AAGCGGTTTA 1920 
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GCCTGCTAGG 
TGGAAAATGT 
TAGGAATTAT 
CAAAGAATGA 
TTATCCTGAT 
CCCTTCTCAT 
AGATTTTCTT 
AGCGATTGGA 
AAAAAAATGC 
AGTCTATTCG 
CTGACCTAGA 
ACCATCAACA 
ATTTCTTTAT 
GATATATCAT 
CAACTCAAAG 
TGAGAGGAGG 
ATACGATTCG 
GGATTCGTGA 
CGGCGGGTCT 
AAACGAGAGA 
TGTCTCAGCT 
GAAAATTTTA 
TTGAACGTCC 
TTTGTGGTTC 
GTGGACACGA 
CAGGTGATTT 
CTGGATTTGA 
CAGAATATAT 
CTGACTATAC 



AGCCTTGGTA 
CACGAAGATT 
TGCGATTACT 
GTCTATTCTG 
GGCGATTGTT 
TTCTTTCTTT 
GGATGCTCTG 
CAATGTGGCC 
CATTGTCCAT 
AATTTTCCTA 
AACTCACCAA 
TTAGAAAAAA 
TATTTAAATA 
TGTTAATGAT 
TTATATAATA 
TTAGCGTCTG 
GTATTATGAA 
TTTTCAAGAT 
CTCTCTAGAT 
GGAGAGCCTT 
ACACACAGCT 
AATGAAATCA 
GCAAATAATA 
AGATTTATGG 
AGCGATTGGG 
TCTGATTGTC 
CGGTTCTTGC 
TCGCTTCCAC 
AGAAGGGATG 



ACAGCTGTGA 
TTGCATCCCC 
ATCAATCTGT 
AGTCTGCATT 
CTTCGATTTA 
ATTCTTTCAA 
CCAGAAGGTC 
AGCCTTAATC 
GTTTGTCTAA 
AAAGATTGTG 
ACCCATAAGC 
GTGAAAAATA 
TTTCAAAAAT 
AAAAATTTTT 
AGATAAGTGA 
AATATTAAAT 
CGGGTTGGTC 
CAGGATATCG 
AGTTTACTTG 
GCTATTTTAG 
TTAAATCGTT 
CCAGTATATA 
GAAGCGGATC 
AGGTACCGTA 
ATTGTTGAAG 
CCTTTTACAC 
GACAATCATA 
TATGCAAACT 
CTCAAGTCCC 



619 

TTCTCGTAAC 
AACCAGTCAA 
TAGCGAGTCT 
TTCTGGAAGA 
CGCACTGGTA 
AAGCCCTTCC 
TTCATATCAA 
AGCTTAATCT 
AAGAAATGGA 
CTTTTCAAAA 
GAAAGCTCTG 
CTTGGGTACT 
TGGTAAGAGA 
CGATTAGATA 
GTTAGAATAG 
CTGCCAGTGA 
TTCTGCCACC 
AAGCGCTGGA 
ACTATATGTC 
AAGAGGAAAA 
TAAATCTCAA 
CAAAGGCAGG 
ATGTGATTAT 
ATCCACAAAC 
AAGCTGGGGA 
ATGCATGTGG 
TTGGCAATAA 
GGGCGCTGCT 
TTTTGACTCT 



GGGCTCTGTT 
TGATGAGGGG 
GGTGGTTGCT 
TACGCTAGCG 
TATCCTAGAT 
ACGTTTTTGG 
CCAAGTAAAG 
CTGGACTATG 
ACATATGGAA 
TATTACCATT 
TCACTTCGAA 
ATCTTATTTG 
AGAGCATTCT 
CAAAATGCTT 
CGTGAATTCA 
TTTGTTGGGA 
GATTACTCCT 
ATTTATTAAG 
GCTCTACCAA 
GCAAAAATTA 
AATTAAACTT 
TCAGGTTGGA 
TCCTGTCCTT 
GAAAGCTGGA 
AGCCATTACG 
TGAGTGTGAT 
TTTGGGGGCT 
TAAAATCCCT 
TGCAGATGTC 



CTAGTCATTT 
ATTCTCTGCT 
AAGGGAAAGA 
TGGGTAGCTG 
CCTCTTTTGT 
TCTACACTCA 
AGTGGCCTGG 
GATGCTTTGG 
ACTTGTAAAG 
GAAATTGATG 
CGGAGTTATG 
GAATAGAGTA 
ATAAACTCCA 
GACTTCGAGT 
GTGAATGAAA 
ATTTCAGCGG 
ACTCCTACTC 
TGTTTTCGTT 
AAGGGAGATG 
GAGGAGCCCT 
TATAAGGAAG 
CTTCCTAGCA 
CGTGCGTGCG 
CACAAAAATA 
ACGGTGAAAG 
GCCTCTCTTG 
GATTTTCAGG 
CGTCAACCTT 
ATGCCGACAG 



1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2S80 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 
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GCTATCATGC 


GGCGCGTGTT 


GCAAATGTTC 


AAAAAGGGGA 


CAAGGTTGTT 


GTTATCGGTG 


,3720 


ATGGGGCTGT 


TGGTCAATGT 


CCTGTCATCG 


CGGCTAAGAT 


GCGTGGAGCA 


TCACAAATTA 


3780 


TCCTTATGAG 


CCGTCATGAA 


GACCGTCAAA 


AGATGGCTAT 


GGAGTCAGGT 


GCGACAgcTG 


3640 


TTGTTGCAGA 


ACGTGGTCAA 


GAAGGAATTA 


CCAAGGTGCG 


TGAAATCCTC 


GGTGGAGGAG 


3900 


CAGATCCAGC 


ACTTGAATGT 


CTTGGTACGG 


AGGCTGCTAT 


AGAACAGGCG 


CTAGGTGTTC 


3960 


TTCATAATGG 


AGGGCGTATG 


GGCTTTGTAG 


GAGTCCCACA 


CTATAATAAT 


CGTGCTCTTG 


4020 


GTTCGACATT 


TATGCAAAAT 


ATCTCTGTAG 


CAGCTGGGGC 


AGCTTCTGCT 


ACAACATACG 


4080 


ATAACCAATT 


TTTACTAAAA 


GCCGTCCTTG 


ATGGTGATAT 


CAATCCAGGT 


CGCCTCTTTA 


4140 


CTTCAAGTTA 


TAAACTGGAA 


GATATCGACC 


AAGCCTATAA 


AGATATGGAT 


GAACGTAAGA 


4200 


CAATTAAGTC 


TATGATTGTA 


ATCGAATAAA 


AAACGAATAG 


GAGTTTTAGA 


ACTCTATTCG 


4260 


TTTTTTATGT 


TATCCTATTC 


TTGATTTAGG 


CTACTTTCTC 


TTAATGTCAG 


TCTGGTTCCC 


4320 


AGCATGGTCA 


GGCTAGGGAT 


TTTCCGACCG 


TGGAGGACTT 


CCTTGTTAAG 


AATATCCATA 


4380 


CCTGCTCGGC 


CCATTTCTTC 


ACT AT AAA CT 


GTAATACTAG 


AGAGGGGAGG 


ATAGACCTGT 


4440 


TTGGTCAGAC 


TAGTGTCGTT 


AAAGGAAATG 


AGGCTGACGC 


GATCTGGCAG 


GCTGATTCCA 


4500 


GCTTCTTGGA 


GGGCACGGAG 


GGCACCCATA 


GCTAAACTAT 


CGCTGGCTGC 


CAAAAATCCT 


4560 


GGCGGAAGTT 


CCTCTCCCAA 


GCTCTGAATG 


CCCTCCTTCA 


TTAAGTCATA 


GCCACACTGG 


4620 



GCACTAAATC TTCCTTGAAA GACCAGTTCA TCATGATAGA TTCCCCTCGC TTGACTATAG 4 680 

TTTTTGAAGT TTTCTAGACG CTTGTCCTGA ATGATTTCTT CTTCCTCTGT TGTTTCTTCA 4740 

AGGCCTGTTA GAATCCCGAT ACGGTCCATT CCTTGACTGA CGAAATAATC GACAACCTGT 4800 

TTCATAGCAG TGTAAAAATC CGTGATAATA CAGGTATCTC CCAGGGAAAG TCTATCGCTG 4860 

TCTAGAAATA CAAGAGGCTT TTGCTATTCT TCAAAGGCAG AAATCTGAGC TCGACTAAAC 4920 

TTTCCGATGC AGAGAATCCC AATCACTTCC TCGCTTAGGG TAAAACGGTG GTCATTAAAA 4980 

TAGCGCAAGA TATCATAGTC CAACTCTTGG GCTCTTTTTT CTATTCCTAG GCGAATCTGG 5040 

TAGTAGTAGA CCTCGTCCAG CTCCCCTTGT TCGCTGACCC ATTGGATAAT GGC AATCT TT 5100 

TGCTTGGCTT TGTGGGACTC GCCTGTCTTG AGGTGCTTGG TGTAGCCCAG CTCTTCAGCA 5160 

ACGCTTAAAA TACGGTGTCT G G TTTCTTCT GTAACAGATA GGCTCTGGTC CCGGTTGAGG 5220 

ACCCGGGATA CGGTCGCGAT AGAGACAGAG GCTAGCTGTG CAATGTCTTT TAAGGTAGCC 5280 

ATAAATCCTC CTTGATTAGG TTAGTATATC ATGTTTTTCT TCTTTTTACT GATATTTTAC 5340 

TAAAATTTTA GTAAAAAGGA TTGACCTTGG AAAATTCCTT GGATATAATA GAAAGAAAAC 5400 

GATTACACGT TAAGATGGCT TAACGGACAG TCAAAGGAGA ATTCATATGG CACAACATCT 5460 
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1 J\\m I AV» luAA 


GCCX I 1T-GCA 


AAGAC4~l~l U 1 










c ri"\.i"i"riCA 


CCAGGCCGCA 


TTAATTTGAT 


TGl* 1\»AACAC 


ACAGACTACA 


ni>uu ivKivLA 


33HU 


CGTTTTTCCT 


GCTGCTATTT 


CCTTGGGAAC 


ttalgctgca 


CtTVwTAACC 


UTUAluACCA 


3040 


AGTCTYGCGT 


TTCTACTCAG 


CTAACTTTGA 




ATTATlUAAv 


TGCCTCTCGC 




TGACCTCAAG 


TTTGAAAAAG 


ft 0*0* ft ft ft 0*m0+ 

AGCACAACTG 


GACCAA I i AT 


C.CAAAAGGTG 


TCCTTCATTT 


5700 


CTTGCAAGAA 


GCTGGGCACG 


TGATTGACAA 


AGGTTTTGAT 


ft: ivft^«*wvMVl 

TTTTATGTTT 


ft Hft^^ ft ft ft #m ft fft 

ATGGAAATAT 


5820 


TCCAAATGGT 


GCTGGCTTGT 


CTTCTTCTGC 


ft M J 11 * J n mUB LJ ni J' ■ ft ft. 

ATCCTTGGAA 


CTCTTGACAG 


GAGTCGTGCC 


S880 


TGAGCATCTC 


TTTGATTTAA 


AATTAGAGCG 


TCTCGATTTG 


GTTAAAATCG 


GCAAACAAAC 


5940 


AGAAAACAAC 


TTTATCGGAG 


TAAACTCTGG 


CATTATGGAC 


CAGTTTGCTA 


TTCGTATGGG 


6000 


GGCAGACCAA 


CGTGCTATTT 


ACCTAGATAC 


TAATACTTTA 


GAATACGACT 


TGGTGCCACT 


6060 


TGATTTGAAG 


GACAATGTCG 


TTGTTATCAT 


GAACACCAAC 


AAACGCCGTG 


AATTGGCGGA 


6120 


CTCTAAATAC 


AATGAACGTC 


GTGCTGAGTG 


TGAAAAAGCA 


GTGGAAGAAT 


TGCAAGTTTC 


6180 


CTTGGATATT 


CAGACTCTGG 


GTGAATTGGA 


CCACTGCCCC 


GTTCACCAAT 


ATAGCTATCT 


6240 


GATTAAAGAT 


CAAAATCCTT 


TGAAACGTGC 


TCGCCATGCT 


GTGCTTGAAA 


ACCAACGTAC 


6300 


CCTCAAACCT 


CAAGTACCAC 


TCCAAGCAGG 


AGATTTGGAA 


ACATTTGGAC 


GCTTGATGAA 


6360 


TGCGTCACAC 


CTTTCTCTGG 


ACCATGATTA 


TGAACTAACT 


CCTTTGGAAT 


TCGATACCCT 


6420 


TGTTCACACA 


GCTTGGGCAC 


AAGAAGCAGT 


TCTCGGTCCT 


CGTATGACAG 


GGGCTGGTTT 


6460 


TGGTGGCTGT 


GCcATTGCCT 


TGCTTCAAAA 


AGATACTGTT 


CAGGCCTTTA 


AGGAAGCTGT 


6540 


AGGCAAACAC 


TACGAGGAAG 


TAGTTGGATA 


CGCTCCAAGC 


TTCTATATCG 


CTGAAGTTGC 


6600 


ft ^nt>/^^»^ ft ^ftftt 

AGGTGGCACT 


CGCGTCCT'TG 


ACTAGTCAAA 


AGGAGGCTCT 


ATACTGACCT 


ft ft. ft «■» m ft 

TAGTAAATAA 


6660 


ATTTGTAACA 


CATGTCATTT 


MHM ft ft ft 0* 0+91*0+ 

CTGAAAGCTC 


ft MMI/« ft 0*0* ft ft 

ATTTGAGGAA 


ft. 99*0+0+ ft *W*f +*4 n * ft ft 

ATGGATCGAA 


ft Ift^fffl^ ft 0+ 

T CT ATCTG AC 


6720 


CAATCGTGTT 


TTGGCACGAG 


TGGGAGAAGG 


TGTTTTGGAA 


GTTGAGACCA 


ATCTGGATAA 


6780 


ATTGATTGAC 


CTCAAGCACC 


AGCTGGTTGA 


AGAAGCCCTT 


CGATTAGACA 


CGATTGAGGA 


6840 


TAGTCAGACT 


GCGCCTGAAA 


TCCTTGGTGC 


TGAACTGATG 


GATTTGGTGA 


CTCCTTGTCC 


6900 


AAGTCAGGTC 


AATCGTGATT 


TTTGGGCAAC 


CTACGCCCAC 


TCTCCAGAAC 


AAGCGATAGA 


6960 


CCATTTTTAC 


CAACTCAGTC 


AGAAAAATGA 


CTACATCAAA 


CTCAAGCCCA 


TTGCTAGAAA 


7020 


TATCGCTTAT 


CGTGTTCCAT 


CTGACTACGG 


AGAACTTGAA 


ATTACCATCA 


ATCTCTCTAA 


7080 


GCCTGAAAAA 


GATCCCAAAG 


AGATTGTGGC 


AGCCAACTTC 


GTGCAAGCTA 


GTAATTATCC 


7140 


TCACTGTCAG 


CTTTGTCTAG 


AGAATGAGGG 


CTACCATGGT CGAGTTAACC 


ACCCAGCTCG 


7200 
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TACCAATCAC CGTATTATCC GTTTTGAAAT GGTTCCTCAG 

CCCCTATGCT TACTTTAATG AGCATTGTAT CTTTTTAGAT 

CATTAGTCGT CAGAGTTTTG AACGTCTGTT GGCTATCCTA 

TGCTGGATCT AATGCCGACC TGCCCATTGT GGGGGGCTCT 

TCACGGAGGC CGTCACGTAT TTCCTATGGA ATTGGCTCCC 

TGCTGGTTTT GAGCAGGTCA AGGCTGGAAT TGTCAACTGG 

GACTTCGGAT TCCAAAGAGC ATTTGATCAA TTTGGCTGAT 

CCAGTATTCA GATCCTGCAC TGCAGATTTT GGCAGAGACA 

TATCACACCC ATTGCCCGCA AACGCGATGG ACAGTTTGAG 

CAATCACACT TCAGCAGAGT ATCCTGATGG TATCTATCAT 

TATCAAGAAG GAAAATATCG GCTTGATTGA GGTCATGGGC 

TCTGAAAGAA GAACTCGAGC AACTCGCTAG CTATCTTGTA 

CCATTATCAT CAGGAGTGGC CAGACCAACT CAAATCCCAA 

GAAAAAGCCC TTGCAATCGT CAAGGACTCT GTGGGTGCTA 

GATGCAGCAG TCTACAAGCA GACAGAACAA GGGCAGACAG 

CAGGTCGGAA TTTTACTACA CTAGGAGCTT TCTCGG 

(2> INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 10011 base pairs 
<B) TYPE: nucleic acid 
(C) ST RAND EDN ESS : double 
(O) TOPOLOGY: linear 



GAATGGGGTT 
GGCCAGCATC 
GACCAGTTTC 
ATTCTAACTC 
TTGCAAAAGG 
CCCATGTCTG 
AAGATTTTGC 
GACAGGACAC 
TTCGACTTGG 
CCCCACAAGG 
TTGGCAATCT 
GGAGAAGCTG 
CATCCACACT 
TCTTTGCGCG 
CCTTTATGCG 



TCCAGTATTC 
CTCCCATGGC 
CAGGATATTT 
ATGATCATTA 
CCTTCCGATT 
TCCTACGTTT 
AGGAATGGCG 
CGCATCACAC 
TCTTCCGAGA 
ATCTCCAACA 
TGCCACCACG 
TTACAGTTCC 
AACGGATAAA 
TGTACTTGAC 
CTTTGTGGAA 



7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7B00 
7660 
7920 
7980 
8040 
8100 
8136 



(xiJ SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
CCCATACTGA AGACTGGCCA TAAGAAGGTC TTCTAGGCTT AATTTAGGTT 
TTTTGCGTGT TTAACTTGAT AAGCTGTTTT TAACACAGCT GAACATCTCT 
TCCCCTCAAC ACCAACAAGA CATTTAAATC GTGTATCAGT TAGTTGTTTA 
CATTCATAGA ACTACTATAC CATGTTTTGT TTCCCAGGAA GTCTAATATT 
GGAACGCTCA TTCCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCG AT AAC 
GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GCTGTAAAGA TATCGTAACC 
TCTTCGTTTA CATCTTTCAC CATAACTGCA TCACAGTGAA CATCGTAACC 
AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC 



TTCGTCCACC 
TCAAAAGTCG 
CTTGCTTCAT 
CTCAAATACT 
TGGGATACCT 
TTTCATAAGG 
ACGGTTTGAA 
ACCGCAGGCA 



60 
120 
180 
240 
300 
360 
420 
480 
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CCAAGAATTT TAATCATTTG GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAACC 540 

GCTTGCTTCA GCAATGTAAG CATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 600 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATGTTCG TTTGACTTGA 660 

ACTTGAATTA TTGATGATAA AGAAGAGCAA GGATACTTCT ACTTCCTTAC CTGGCGCAAT 720 

CATATTATGG AAAGTCACCC GTTTCTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 780 

ATGAACAATA TCTCTCTCAG GAATCATTAC ATTTGCAACT CCTTTCCTAG AAATTCCATA 840 

TATAAACCAG TTGGAAATGA CTTTTCACGC GTGATCAAGG CTTCACGATA AGTTCGAGTG 900 

ACAATTTCTC GTTCTTCCAA CAAGCTTGCT ACCTGATCAA AAACTTATTC TTGATTATCC 960 

GCTTCTAAGC AAAACACAAG GTTTTTGTCA AACAAATAAT CTAATACCAT AAGGTTTTCC 1020 

CTTCTTTCCA TTAACTTTAT GCTATAAGTA TAACACTATA TGAAATCGTT GTTAATTACT 1080 

TTCTATTCTT TTTTCTCTCT TTTTTTATAT TTTTGTTTTG TTTATAGTTT GTTATATAAA 1140 

AATAAACACA CAAACAAATA CTCCAAGCAT TTTTCTGTTC TAATACTCAA TGAAAATCAA 1200 

AGACCAAACT AGGAACCTAG CCCCAGTTGT TCAAAACACA CTTTTGAGGT TGTAGATGAA 1260 

ACTGACGAAC TCACTCAAAA CATGGTTTTG AGGTTGTAGA TGAAACTGAC CAACCAACAg 1320 

CCATACATAC GGTAAGGCGA CCCTGACGTG CTTTGAACAG ATTTTCCAAG ACT AT AAAAA 1380 

CTAAAAAAGC ACACCATCTA AGCCTGCTTT ACTATTGATT CTTATATAAA TTTCCTGTGA 1440 

ACAAGGAAAG GCATTTCTGA TAACTTATTC TTCATCCATA CTCAAGACGC TGAGCAAGGC 1500 

TTCTTGCGGA ACTTCAACTG ATCCGATGGA TTTCATGCGT TTCTTACCAC CTTTTTGTTT 1560 

TTCAACGACT TTACCCTTAC GACAAACGTC ACCACCATAA CATTTAGCAA CTACGTTCTT 1620 

ACGAAGGGCC TTGATATCAG T ACGAGC G AC AATCTTGTCT CCAATAGCCG CTTGGATTGG 1680 

AACTTCAAAT TGTTGGCGAC GGATGATTTT CTTGAGTTTA TCAACGATGA GTTTCCCACC 1740 

TTCGTAGGCA AAGTCCTTGT GAACGATAAA GCTGAGGGCA TCCACCTTAT CTCCATTGAG 1800 

AAGAATATCC ATTTTCACCA GCTTAGATGG CCGATATTCT CACAATTCCT AGTCAAAGCT 1860 

TGCATAACCA CGTGTCGAAG ACTTAAGTTT ATCAAAGAAG TCAAAGACAA TTTCACCAAG 1920 

AGGAATTTGA TAGATAACAT TGACACGGTT ATCATCAATA TAGTCCATAG TCACAAAGTC 1980 

CCCACGCTTA CGCTGAGCTA GCTCCATTAC TGCTCCGACG AACTCCTGTG GTACCATGAT 2040 

TTCCGCCTTG ACATAAGGCT CTTCAATGCT CGCAATCTTA GTTGGGTCTG GAAACTCAGA 2100 

TCGCTTAGAC ACATCCATAG ACTCACCGTC GGTCAAATTA ACTTTGTAAA TAACAGACCG 2160 

AGCTGTCATG ATCAGGTCAA TATTGAACTC ACGCTCTAAA CCTTCCTGGA TAACATCCAT 2220 
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ATGGAGAAGT 


CCAAGAAATC 


CACAACGGAA 


ACCAAATCCA 


AGTCCCTCAG 


ATCTTTCTGG 


2280 


TTCAAACTCA 


ACACTAGCAT 


CATTCAGTTG 


CAATTTTTCA 


AGcGCTTCAC 


CCAGGTCATT 


2340 


GTACTTGTTT 


GATTCGATTG 


GGTAGAGACC 


CGCAAAGACC 


ATAGGATTCA 


TCTGCTTATA 


2400 


ACCATGTAAT 


GGTTCTGCCG 


CAGGATTGGT 


TGCCAAGCTA 


ACGCTATCAC 


CCACACGAGT 


2460 


ATCCTGAACC 


GTCTTGATAC 


ACCCCGCAAT 


GTAACCAACA 


TCACCAGTCG 


CAAGGAAATC 


2520 


ACGACCAACC 


CCTTTTGGTG 


TAAAAATACC 


GACTTCGGCC 


ACATCAAAGC 


TCTTACTATT 


2S80 


CCTCATGACC 


TGAATCTTAT 


CACCAGGTTT 


GACCACTCCG 


TCCATGACAC 


GCACTTGGAC 


2640 


GATAACCCCA 


CGGTAAGCAT 


CGTAAACAGA 


GTCGAAAATC 


AAGGCCTTAA 


GTGGCGCCGT 


2700 


CACATCACCC 


GTTGGTGCTG 


GTACTTTTTC 


TACAATTTGC 


TCGAGGATTT 


CTTCAATCCC 


2760 


AATACCAGCC 


TTCGCAGAAG 


CCAAAACTGC 


TTCACTGGCA 


TCCAAACCAA 


TCACATCTTC 


2820 


AATCTCTGTA 


CGCACGCGCT 


CCCGATCTGC 


AGCCGGCAGG 


TCAATTTTAT 


TAATGATAGG 


2880 


CATGATTTCC 


AAATCATTAT 


CCAAAGCCAG 


ATAAACGTTG 


CCAAGAGTTT 


GAGCCTCAAT 


2940 


TCCTTGACCC 


GCATCCACCA 


CCAAAATAGC 


ACCCTCACAG 


GCAGCTAGCG 


AACGTGAAAC 


3000 


TTCATAGGTA 


AAGTCAACGT 


GCCCTGGTGT 


GTCAATCAAG 


TGGAAAATAT 


AAGTTTCCCC 


3060 


ATCTTTTCCA 


GTGTAATTCA 


ACTCGATGGC 


ATTCAACTTA 


ATAGTAATTC 


CACCTTCCCG 


3120 


CTCTAGCTCC 


ATGCTATCCA 


AAAGCTGGGC 


CTGCATTTCA 


CGACTTGAAA 


CCGTCTCTGT 


3180 


TTTTTCCAAA 


ATCCGGTCTG 


CTAGAGTTGA 


TTTTCCGTGG 


TCAATATGGG 


CGATAATAGA 


3240 


GAACTTACGG 


ATCTTCTCCT 


GTCGTTTTTT 


CAATTCTTCT 


AAGTTCATGA 


TTCTCTTCCT 


3300 


TTCAGGGTAT 


CTATTTATTA 


TAAATTGTTT 


TTGATATTTT 


GACAAGACCA 


T AC CCTGCT A 


3360 


GGAGTACTAA 


TCTTCAGCGA 


CAAAGCCGTC 


ATTTTCGATA 


AAGTGGTGTT 


CTGTCATTCC 


3420 


TTGGTCTGTA 


AACACAATCC 


CGTCAAGGAC 


ACCACCATAA 


ACAGCTCCTC 


CATCCATTCC 


3480 


AATCTTGCCA 


TCTTCTCTAG 


TCCAAAGCTC 


AGATCTACCG 


CGTTCTTGCT 


GTAACAAACC 


3S40 


ATAGACCGGT 


GTATGACCGA 


AGACAATGGT 


TTTTCCAGTA 


TGATTTTCAG 


CTCCGTGGAA 


3600 


TGGTTTTCTA 


AGCCATACTT 


TTTTATAATC 


TGTTGTTTCA 


TGCCAGTCGT 


CCAAGGTCAA 


3660 


ATCAATACCT 


GCGTGAACAA 


AGATATACTT 


GTCTGTCTCT 


ACTACAAATG 


GCATTTGACG 


3720 


AATCAATTCG 


ACCAAGTCTG 


CCGCTTCAGC 


GgCAACCCGC 


TTGGCATCTT 


CTACTCCATC 


3780 


AACTGGTCCA 


TCCAAGGGAC 


GACCTAGGAT 


AGAGTTAATG 


GTTCTATCTC 


CACCATTGCG 


3840 


ACTATAATGG 


TCATAACTTT 


CTTCTGGGTC 


ATCTAGCCAA 


GTCAAAAACA 


TATACTCGTC 


3900 


GTTTCCGCAC 


AAACAGATAG 


CCCCTTGATT 


GTCCACCAAG 


TCCTTGACCA 


TTTCAAGAAC 


3960 


ACGGTGACTA 


TCCTCACCTC 


TGTCAATCAA 


ATCACCTAGA 


AAGAGCAACT 


GGGGCTGACC 


4020 
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ATCCCAGGTT TTGACAAGCT CTTCCAGCAT CCCAGCTTTT CCGTGAACAT CTCCAATTAC 
ATAATAATCT GTCATCTTAT TTCTCCCTGT TTCTCAACAA TTCTCTTGCT TCCGTCAGGG 

CTCCrrCTCT cacatcatca cctgccaaca tcttggcaac ttcctccact cgctcttcga 

CCGTCAAGAG ACGAACAGTC GAAACCGTTG AATGGTCATT ACTAATCTTC TCAATAAAGA 
ATTGATAATC TGCAATCCCA ATTACTTGTG GCAAATGGGA GATAGCCAAA ACCTGACCAT 
GCTGACCAAT TTTATGAATT TTCTGAGCAA TAGCTTGAGC AACACGACCT CAAACTCCCG 
TATCCACCTC ATCAAAGACA ATGCTAGTCT TGCCTTCTTT ACGTGAAAAG CCAGACTTAA 
TGGCTAACAT GACACGAGAT AATTCCCCTC CAGAAGCAAC CTTAACCAAG GCTTTAAACT 
CTTCTCCAGG GTTGGTTGAA ATATAAAACT CAACCATTTT ATTTCCCTCA CGACTGAATT 
TTCCCTTACT AAAACGAACC TGAAACTGGC CTTTTTCCAT ATAAAGATCT TGCAGTTCTT 
GTTTAATCTC AGCTTCGAGT TGCTGAGCCA AATTATGACG AGCACAAGCA AGTTGACCTG 
CCAAATTCAC AAGATTGACT TCCAACTTCT TAACCTCTGC TTCCATGTCC TCAGACGAAA 
GATTATTGCC TGTCAAGAGA TTGTATTCTT CCCTAATCTT CGCAAAATAA AGCAAAACAT 
CATC AAC ACT CCCACCATAC TTACGACTAA TAGTATGAAG G AGGTCC AAA CGATTCTCAA 
CCTCCATCAG GCGATTGCCA TCAAAATCAA GGTCCTCAAT GATAGCTTCC AAACCTTTGC 
TAATGTCTTC TAAAACATAG TAGGTCTCAG ACACATACCT TGAAATTTCA CGGTATTCAG 
GATCATACTC TTCCACACTT TCCATGTCAT TCATACCTGA ACGAACATTG CCCAGACTTC 
AAAAATCTTC ATTGTCCAAC ATACTGTAGG CATTCCTCAC TGTATCCGCA ATATTTTTCT 
GGTTGAGGAG TTTATCTCGC TCTTGATTGA GAGCCAAGTC TTCTCCAGCC TGCAAGTTTG 
CTGCCTCAAT CTCTGCCATT TGAAATTCCA ACATTTCGAT ACGTGCCTTC TCTTCCTGTT 

cxrrrrrrcrr gacttccaga acctccttgc gcattttccg atacgcatca aaactcgttt 

GATAGGTTTC TTTCAAGTCC CAAAAACCGC CATCACCAAA TTCATCCAAC ATCTCGATAT 

gcacttcggg ACGCATTAAC tcctcatggt catgctgacc atgaatatct acaacatctt 

GCCCAATAGC TCGCAAAACA GACACATTAA CCATCTGACC ATTTACACGG CTGATACTAC 

gaccattttg caagatttcc ccacggatga taatttcatc acctaattct aaaccttcct 

CATCAAAAAT TTCCTGTAAA AGACGACTAT TCTCAACTCA GAAAAGCCCC TCAATCTCTG 
CCTTTGGTGC ACCATGACGA ATAACATCTG TCGTCGCACG AGCTCCCAAC ATCATATTCA 
TGGCATCAAT GATAATCGAC TTCCCTGCAC CCCTTTCACC AGTCAGGACA CTCATCCCCT 
TTTCAAAATT GAGGGAAATA GCCTCAATAA TGGCAAAGTT TTTTATCGAA ATTTCAAGTA 



4080 

4140 

42O0 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

S340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 
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ACATATAGAC CTACCAATTT TTTACTTGTT CAAAGATTTC CTCTGCTAGA CTTCCACTTC 5820 

TGGCAATGAC TAAAATCGAG CTATCATCAC TCAAACAGCT AAAAATCTTG TCTGCAAAAC 5880 

TCTCGATTAA CTGAGCTTTT ACAAAACCCC TATTTCCTGG AATAACTTGG AGATTGATCA 5940 

TCTTATCCAT CAATTCAGCC CATTCGATAT TGTCTTCAGC CAGTTGCAGA CTTTTTACGA 6000 

TTGATTTTGG CAATTCGTAG ACATAGCTGT TCTCTCTCAA AGGAATTTTG ACAATACCTA 6060 

ACTCTTTGAT ATCTCCGGAT ACCGTCGCCT GAGTGGCAGT GATACCTGCT TCTTTCAAAT 6120 

GTTCTACAAT TTCTTCTTGC GTGCCGATTT GATAATCTGT CACCAATCTT CTAATTTTTT 6X80 

CAAGTCTCTC TTTTTTATTC ATTTTTAAAT TGACTATCCG CCCTCTCTAC TGCTTCTTTA 6240 

ATCTCACCAA GAATCTGATT GCTTGCTGAC T TTT CTT TTT TCAAATACGC TAAAAATTCA 6300 

ATATTTCCAT GTCCACCTTG GATGGGAGAA AAGTCCAACC CAACCACTGA AAAACCTACC 6360 

TCTACTGCCA TAGCTGTTAC AGATTCAAGG ACATTCTGAT CAACCTTACC ATCTCGAATA 6420 

ATTCCATTTT TCCCAATCTG CTCACCTCCT GCCTCAAACT CAGCTTTGAC AAGTGCTACC 6480 

ACCTGACCTT GATCACCCAA GACACGGTGC AAGGCTCGCA AAATCACACT AAGCGAAATG 6540 

AAACT CAC AT CAATACTGGC AAAGCTCGGC TCCTGCTCGA AATCAGTCTT TTCACCATAG 6600 

CGCAAATTGA ACTCCTCCAT GCTGACAACT CGTGGGTCTT GGCGTAATTT CCAAGCCAAC 6660 

TGATTGGTAC CAACATCGAC TGCAAACACC AACTTGGCAC TATTCTCTAC CATGACATCG 6720 

GTAAAACCTC CAGTAGAGCC CCCCATATCA ATCCTAGTCG CCCCATCCAC CGACAAATCA 6780 

AAGACCTGCA AGGCCTTTTC CACTTTCAAA CCACCACGGC TGACATACTT GAGTTTCTCC 6840 

CCCTTGACTT TTAATTCGGT CTCATCTGGA ATTTTCTCTC CTGGCTTGTC AAACCGTTCT 6900 

CCATTAAGGA CTCCTACGAC TAGGCCAGCC ATCACACCTC CCTTGGCCTG CTCTCTCCTT 6960 

TCAAACAACC CCTGTTTATA AGCTAGTACA TCCACTCTTT CCTTAGCCAT TGATTCTCAA 7020 

ACTTTCTACT ACACTTACAA TCGATTCTGT TTCAAAGGGA AGCTGCTGGG CAATTTCTTC 7080 

TAATTTTTCA TTAGCTTGAT CCAGGGTTTG GTTACAAAAG GCAATGGACT CTTCCAACCC 7140 

CAACAGGGCA GGATAGGTTC ATTTTTCTGC CTGCAGATCC TTTTGAGGTC TCTTGCCGAT 7200 

TTCCTCAAAA CTAGCTGTCA CATCCAGTAC ATCATCTCTG ACTTCAAAAG CAAGTCCAAT 7260 

CAATTCACCC ACAGTTTTCA GCTTCACCTG CATTTCAGGT G AC AATTC AG CTATAATAGC 7320 

TGCCGCTTGG AAGGGATAGG CTAGTAACTT CCCAGTCTTA TTGGCATGAA TAGTCTGAAG 7380 

T TCTTCC AAA GACAAGTGCT GGTGTTCGCC CTCCATATCC AAAACTTGCC CTGCTACCAT 7440 

ACCCAGACTA CCTGAAGCAA GGGATAAGTT GGCAATCAAG TCCACCTTAA TCTGACTTGG 7500 

CAAATCTGCC TGCGCAATCA AGGCATATGA CTCTAAGAAT AAGGCATCTC CAGCCAAAAT 7560 
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GGCCATAGCT 


TCACCGAATT 


TCTTCTCATT 


GGTTAACCGC 


CCTCTTCGAT 


AATCGTCATC 




ATCCATAGCA 


GGAAGGTCAT 


CGTGAATCAA 


GCTCCCTGTA 


TGAATCATCT 


CTAAGGCAGT 


/ OOU 


AGCTACCTGC 


GCGTGAGCAG 


GTTTGATGGT 


AACCTGCAAG 


GCTTCCAGAA 


CTTCTAACAA 


7740 


GAGAAAAGGC 


CGAATACGCT 


TGCCACCAGC 


ATGAATAGAA 


TAGAGAACAG 


ACTCCCGTAA 


7800 


ACTAGAGGCA 


AACTGCTGGT 


CTCCATAAAA 


ATCTTCCAAA 


GCCGACTCGA 


CAAGAGCTAA 


7860 


TTTTTCTTGC 


TTTTTCATTC 


AAAATCACTT 


tctuttccgt 


CTTCTTGCAT 


GACCTTGACC 


7920 


AAGGTCTTTT 


CAGCCTTGTC 


CAGCGTAGCT 


TGGAGCTCTT 


TTGACAAGAC 


CATGCCCTTT 


7980 


TCAAAGGCAG 


TAATCGCATC 


TTCCAGAGCA 


ATTTCACCAT 


TTTCCAAACT 


TTGGACAATG 


8040 


GTTTCCAGTT 


CTGCTAGATT 


TTCCTCAAAT 


TTCTTTTGTT 


TTGACATCTT 


TAACCTCTAA 


8100 


TTCTACTTGA 


CCATCTCGCA 


TCAAAAGCGT 


TACTTGGTCT 


TTTTTC TTC A 


AACTCTCAAC 


8160 


CGAATCTACA 


. ACGGACTCTT 


CTTTTTTGAC 


AATAGCATAA 


CCACGCGCCA 


CGATTCGGCT 


8220 


AGTATCCAAC 


ATGAGCAAAC 


CTTCCGAAAG 


TCGCTTGGCC 


TCAGCAACCT 


TGGCGTCATA 


8280 


AACTAACGCC 


ATTTGGCTAC 


CTAAGAGCTT 


GTCCAACTGT 


CCTAAACCCT 


CTTGATAGCG 


8340 


. TTGGATTTTC 


GTAACACGTG 


ATAATTCTAC 


TAATTCATGA 


GTTCTTGCTT 


GAACTAATTG 


8400 


TTTCTTATCA 


GAAATCCGAG 


TTCGCAAACT 


TTGTTTCAAA 


CGCACTTCCA 


GTTCCTCCAA 


6460 


GCCTTGCAAA 


TAACCGTCAT 


ACAAGCGCTC 


AGGTTGTCTA 


AAGATAACAG 


ACTGACTGCA 


8S20 


TTTTTTCAAA 


GCCTCTTGTT 


TCTTAGATAG 


AACATTTCGG 


ACTGCCGTTA 


CCATCCGTTT 


8S80 


TTCCTGATTT 


TGCAAATGAG 


CTAATACATC 


CAACTTGCTC 


ACAGGTGTTG 


CCAGTTCAGC 


8640 


CGCCGCTGTT 


GGCGTTGCAG 


CGCGTCGATC 


TGCCACAAAA 


TCTGCCAAGG 


TCACATCCCT 


8700 


CTCATGCCCC 


ACACTAGAGA 


TAACTGGCAA 


ACCAGATTCA 


AAAATAGCTC 


GTACCACAAT 


8760 


TTCTTCGTTA 


AAGGCCCAGA 


GATCCTCAAT 


AGAACCACCT 


CCACGACCAA 


TAATGAGCAA 


8820 


ATCCAAATCG 


TCCCGTTGAT 


TACCACGCCC 


AATATTTCTA 


GCAATTTCCT 


CCGCAGCCCC 


8680 


TTCACCTTGA 


ACCTTGGTCG 


GATAAAGAAG 


GATGTCAACA 


CCTGGGAATC 


CCCTCCTGAC 


8940 


GGT CGTGATA 


ATATCTCGAA 


TAACGGCTCC 


ACTACGGCTG 


GTTACTACAC 


CAATTCTCTT 


9000 


AGAAAATTGG 


GGCAGAGCTT 


GCTTGAAGCG 


TTCTTGAAAC 


AGGCCTTCTT 


CTGTCAATTT 


9060 


TTTCTTAAGT 


TGTTCAAACT 


GAATCGCAAG 


CGCCCCAACC 


CCATCAGGCT 


CAGCTTTTTC 


9120 


AATGATGATG 


GAGTAGCTAC 


CACTTGGTTC 


ATAGACCTGT 


ACACGCCCAA 


TCACATTGAT 


9180 


CTTCATTCCT 


TCTTCCAGGT 


CAAACCCTAA 


TTTCTCATAA 


ATCCCAGACC 


AGATGGTCGC 


9240 


TTGAATAACT 


GCATGGTCAT 


CCTTTAGGGA 


GAAATATTGG 


TGAGTAGGTC 


GTTTACGAAA 


9300 
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GTTGGAAACT 


TGACCAGTTA 


AATAGACCCG 


TTCCAAGTAT 


GGGTCTTTAT 


CGAATTTCAT 


9360 


TTTCAGATAC 


TTGCTCAAAG 


TTGTTACCGA 


TAAATACTTT 


TCCATCTCCA 


CCTACT ATTC 


9420 


ATTTACTTCC 


TCTTTCATGG 


GTATTATTAT 


ACCAAAAATA 


TGCCTAAAAA 


TCTCCATTTA 


9480 


TGTACCATTA 


TGAGGGAAAA 


ATAGAAAAAG 


GAGGCAAGGC 


CTCCACATGT 


GATTATTTGC 


9540 


TGTTTCCAGC 


TTCTTCCAAA 


ATCTTTGCAA 


TCTTGGTCGT 


CAACAGGTCG 


ATAGCCACGG 


9600 


TATTGCTAAC 


CCCTTCAGGA 


ATGACGATAT 


CAGCATAACG 


CTTAGTTGAC 


TCGATAAACT 


9660 


GCTGGTACAT 


TGGTTTGACC 


ACACCTAACT 


ACTGGTTAAT 


AACGCTATCA 


AGGCTACGGC 


9720 


CACGCTCCTC 


CATATCACGC 


TTGATACGAC 


GAATAATGCG 


CACATCGTCA 


TCCGTATCCA 


9780 


CAAAAATCTT 


GATATCCATC 


AAATCGCGCA 


GACGCTTGTC 


CTCCAAGACC 


AAAATACCCT 


9840 


CAACCATAAA 


GACATCTTGA 


GGTTCCTGAC 


GATAGGTCTT 


GCTACTCCGT 


GTATGCTCTG 


9900 


TATAGTCGTA 


GGTCGGGATG 


TCCACCGGAC 


GCCCTGCCAA 


CAATTCCTTA 


ATCTGCTCGA 


9960 


TCATCAAGTC 


TGTATCAAAG 


GCAAAAGGAT 


GCTCATAGTT 


GGTTTTGACG 


G 


10011 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5365 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 





TAAAAATAGA 


AGACAAAGAA 


CAAACTCTTG 


GAGGCTTTGT 


CCTTGCAGGC 


60 


TCAGCCCAAG 


AAAAAACCAA 


AACAGCTCAA 


GTTGTGGCTA 


CTGGACAACG 


TGTTCGTACC 


120 


TTGAACGGTG 


ACTTCGTTGC 


TCCAAGTGTT 


AAAACTGGAG 


ATCGTCTCTT 


AGTTGAAGCC 


180 


CACGCAGGTC 


TTGATGTCAA 


AGATGGCGAT 


GAAAAGTACA 


TCATCGTAGG 


CGAcTAACAT 


240 


TTTGGCAATC 


ATTGACGAAT 


AGAAGGAGAA 


AGTAAGTATG 


TCAAAACAAA 


TTAAATTTTC 


300 


ATCAGATGCC 


CGTTCAGCCA 


TGGTTCGTGG 


TGTCGATATC 


CTTGCAGACA 


CTGTTAAAGT 


360 


AACCTTGGGA 


CCAAAAGGTC 


GCAATGTCGT 


TCTTGAAAAG 


TCATTCGGTT 


CACCCTTGAT 


420 


TACCAATGAC 


GGTGTGACCA 


TTGCCAAACA 


AATCGAATTG 


GAAGACCATT 


TTGAAAATAT 


480 


GGGTGCTAAG 


TTAGTATCAG 


AAGTAGCTTC 


TAAAACCAAT 


GATATCGCAG 


GTGACGGAAC 


540 


TACGACTGCA 


ACAGTCTTGA 


CCCAAGCTAT 


CGTCCGTGAA 


GGAATCAAAA 


ACGTCACAGC 


600 


AGGTGCAAAT 


CCAATCGGTA 


TTCCTCCTGG 


GATTGAAACA 


GCAGTTGCCG 


CACCAGTTGA 


660 


AGCTTTGAAA 


AACAACGCCA 


TCCCTCTTGC 


CAATAAAGAA 


GCTATCGCTC 


AAGTTGCAGC 


720 
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CGTATCTTCT CCTTCTGAAA AAGTTGCTCA GTACATCTCT 
CAAAGACCCT CTCATCACCA TCGAAGACTC ACGTGGTATG 
AGAAGGAATG CAGTTTGACC GTGGTTACCT TTCACAGTAC 
AATGGTGGCT GACCTTGAAA ATCCGTACAT TTTGATTACA 
CCAAGAAATC TTGCCACTTT T GGAAAG CAT TCTCCAAAGC 
TGCGGATGAT GTGGATGGCG AGGCTCTTCC AACTCTTGTT 
CTTCAACGTA GTAGCAGTCA AGGCACCTGG TTTTGGTGAC 
AGATATCGCC ATCTTAACAG GCGGAACAGT TATCACAGAA 
AGATCCGACA ATTGAAGCTC TTGGTCAAGC ACCGAGAGTG 
GGTTATTGTA GAAGGTGCAG GAAATCCTGA AGCGATTTCT 
GTCTCAAATC GAAACTACAA CTTCTGAATT TGACCGTCAA 
CAAATTGTCA GGTGCTGTAG CGGTTATTAA GGTTCGAGCC 
AGAAATGAAA CTCCCCATTG AAGATGCCCT CAACGCTACT 
TATTGTTGCA GGTGCTGGAA CAGCTCTTGC CAATCTCATT 
ATTGACAGGA GATCAAGCAA CAGGACGTAA TATTGTTCTC 
TCGTCAAATT CCTCACAATG CAGGATTTGA AGGATCTATC 
TGCTCAGCTT GGTATAGGAT TTAACGCAGC AACTGGCCAG 
AGGTATCATT GATCCAGTTA AAGTGAGTCG TTCAGCCCTA 
CAGCTTGATT TTGACAACAG AAGCAGTCGT AGCCAATAAA 
TCCAGCAATC GATCCAAGCA TGATGGGCGG GATGATGTAA 
TATAAAAAAC ACAAAAGGAG GGAATGACTA ACCCTTCTTT 
GTAGTGGGTT GAAGTCAGCT AAGCTCGAGA AAGGACAAAT 
GTTCAAAGCG ATAAAAATCC GTTTTTTGAA CTTTTCAAAC 
GCGCTTGATA AGTTTGATGA GATTATTGGT CGCTTCCGGT 
TTGAAGGGCG TTCATAATCT TTTCTTTATC TTTCAGGAAG 
AATAGGATGA ACTTGCTTAA GATTGTCCTC AAT AAGTCCG 
ATTCTGAAAG TGAAACACCA AGACTTGATA GAGCTGATAG 
ATAGCTCAAA AGCTTGTCTA AAATCTCTTT ATTGGTTAAA 
ATAAAATCGC TTATCACTCA GTTTACGGCT ATCCTCTTGT 



GAAGCAATGG 
GAAACAGAGC 
ATGGTGACAG 
GACAAGAAAA 
AATCGTCCAC 
TTGAACAAGA 
CGTCGCAAAG 
GACCTTGGTC 
ACCGTGGACA 
CACCGTGTTG 
AAATTGCAAG 
GCAACTGAAA 
CGTGCAGCTC 
CCAGCTGTTC 
CCTGCTTTGC 
GTTATCGATC 
TGGGTTAACA 
CAAAATGCAC 
CCACAACCAG 
CCTTTCTATA 
TTATAGGCTC 
TTCGTCCTTT 
TTTCGAAAAC 
TTGGCGTTAG 
GTTTTAAAGA 
AAAAATTTCT 
TGATGTTTCA 
TGCATACGAA 
ATGACCTTCC 



AAAAAGTTGG 
TTGAAGTCGT 
ATAGCGAAAA 
TTTCCAATAT 
TCTTGATTAT 
TTCGTGGAAC 
CCATGCTTGA 
TTGAGTTGAA 
AAGATAGCAC 
CCGTTATCAA 
AACGCTTGGC 
CTGAGTTGAA 
TTGAAGAAGG 
CTACCTTGCA 
AAGAACCCGT 
CTTTGAAAAA 
TGATTGATCA 
CATCTGTAGC 
TAGCCCCAGC 
GAAAACAACT 
TTTCTCAACT 
CTTTTTTGAT 
CAAAGGCATT 
AATAGTGTAG 
CAGTCTGAAA 
CCGGTTCCTT 
AGTCTTGTGA 
AAGTAGGACG 
AGTAGCGCTT 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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GATACCCTTG TATTCATGCG ATTTTCGATC CAATTCGTTC ATAATTTGAA CACCCACACC 2S20 
ACTCATAGCA CGGCTAAGAT GTTGTACAAT CTCAAAGCGA TCCAACACGA TTTTAGCATT 2580 
CGGGAGTGAA ACAGTCTGGG AGACTGTTTC AGCCTCAGCC TAGAAATTTG AAAGCGAAGC 2640 
TGTTTAGCCA AGTCATAGTA AGGACTAAAC ATATCCATCG TAATGATTTT CACTTGACAA 2700 
CGAACGGCTC TATCGTAGCG AACAAACTGA TTTCGGATGA CAGCTTGTGT TCTGCCTTCA 2760 
AGAACAGTGA TAATATTAAG ATTATCAAAA TCTTCCCCAA TGAAACTCAT CTTTCCCTTA 2820 
GTGAAGGCAT ACTCATCCCA AGACATAATC TTTGGAAGCC GAGAAAAATC ATGCTCAAAG 2880 
TGAAAGTCAT TGAGCTTGCG AATGACAGTT CAAGTTGAAA TGGCCAGCTG ATGGGCAATA 2940 
TCAGTCATAG AAATTTTTTC AATTAACTTT TGAGCAATCT TTTGGTTGAT GATACGAGGG 3000 
ATTTGGTCAT TTTTCTTTAC CAGGGGAGTC TCACCAACCA TCATTTTTGA ACAGTGATAC 3060 

CACTTGAAAC GACGCTTTCT AAGGAGAATT CTAGAAGGCA TACCAGTCGT TTCAAGATAA 3120 

GGAATTTTAG AAGGTTTTTG AAAGTCATAT TTCTTCAATT C C TT T CCCCA CTCAGGGCAA 3180 

GATGGGGCGT CGTAGTCCAG TTTGGCGATG ATTTCCTTGT GTGTATCCTT ATTGATGATG 3240 

TCTAAAATCT GGATATTAGG GTCTTTAATA TCGACCAGTT TTGTGATAAA ATGTAATTGT 3300 

TCCATATGAA TCTTTCTAAT GAGTTGTTTT GTCGCTTTTC ATTATAGGTC ATATGGGACT 33 60 

TTTTTTCTAC AACAAAATAG GCTCCATAAT ATCTATAACG GATTTACCCA CTACAAATAT 34 20 

TATAGAGCCG AAAATTCACA TCT AATAT AT GCAGACTACT TTCAAATGAA ATTAAAAAAA 3480 

TTATTAAAGG ATGACACAAA A G TTTTTGAA AAATCTACAT TCAAATTTGT AGAAGGATAT 3540 

AAAATATACC TGACAGAATC TAAAGAATCT GGAATTAAAC AAATGGACAA TGTCATAAAA 3600 

TATTTTGAGT TTATTGAATC TAAAAGTATT GCTTTATATT TTCAAAAACC ATTAAATGAG 3660 

CTGATAGATT AAATAGCATT TTCTCTGTTC AGAT A TT G TT TTTAAAATAT TGTACTAAAT 3720 

GATTCATGCT ATGTGGAAAT ACAAAAAAAT G T TTTT GATA CGAACTTGAC CTGTATTTTT 3780 

TATACTAATC ATTTTCGTAT TTTTTGTATT AAACGATATA AGTTTGTTGT AAACTTACAA 3840 

GCAATAAAGA CATTAAAAAA TAACAGTATA TCTATTT G TT TTATATATTT TACGAATTCT ;'900 

GCATAAATCT CTTTCTAGTA ATGTGTTGTA ACTCTGCTAT AATAGATTTA TTCCTTTTTG 3960 

TGTTTACACA ATTTATTTTA TACTACCAAA AAAGGTCAGG ATTTTGTTCC TCACCTTT3A 4C2C 

CAACTTTACC CATTCTTTAG TTCTACATAG CCCTTGTACC AAATGTTTAC ATAGGCTTCT 4080 

GAGAAAGGAC CACGTCCATT GTTAATCCAA TCAACAAGAA TTTTGACATG TTCTTTTAAA 4140 

ATATAGTCCA AGTCATCAGA ATAATTCATT TTGCGTTTGT GACGCTCGTA CTCTTCAACG 4200 

TCCAAGACAC GTTTTTCCCC ATCTCTAAAA ATTTTAACAT CCAAATCGTA ATCAATATAC 4260 
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TTCAGTGCTT CTTCATCCAG ATACTAGGGG CTACCCATAT TGCAATACTA ACAAGTTCCA 4 320 

TTATCACGAA TCATGCCAAT GATATTAAAC CAATATTTCT TGTGAAAGTA AACAATAGCC 4360 

GGTTCTCGAG TGACCCAACG ACGACCATCA CTTTCGCTAA CAAGTGTATG ATCCTTGACA 4440 

CCAATAATGG CGTTTTCTGT TGTTTTTAGT ACCATGGTGT CCCGCCAAGT TCGGTGGAGA 4500 

CTCCCATCAT GCTTATAACT TTGAATTGTA ATAAACTCCC CTTCTTTTGG AAGCTTCATA 4560 

ACTAACCAAC TTTCTACAAT TTATAAGTTT ATCATTTACT ATTGTACCAT AAAATTACCC 4 620 

AAAATCTGTG AATTTCACTT GGAAATATTA AAGATATTCT CTAAGAGCGC TTGCTATATC 4680 

CGAAAAATCG TAGCCCTTTC CTCCTAAAAC TTGAGTTAAA CGCTGCTTCA GTTCCTATCC 4740 

TT C AT ACTTT CGGGCATACT TAGTATATTG CTTATCAACT TCCTTGAAGA TGAGTTCCTC 4 800 

AGTCGTTTCT TCATCAACTT GACTATCCAA TTCGTCAAAG GCAATTTTAG CATCAAAATA 4860 

AGAGAAGCCC TTGTTAGTCA AGTTCTGGAT AATCTTATCT TCCACGGCAC GAGCTGGAAG 4920 

TTTTCCCTCA TATTTTTTCA ATACTTTATT GGCTACACGT TGAGCAACTT CCGAAAAATC 4 980 

AAAATCATTC AAGATTTCTT CTATAGTAGA TTTTGAAATT CCTTTTTGTG CTAATTTCTG 5040 

AGT C ACT AC A TAAGGTCCCT TCTCTCCTGA AAGTTGATTG GCATTGATGA TACCATAACC 5100 

CTACTCGCTA TCATTAATCC ACTTCTCTTC TTTAAGATTA GCAATGACTT GAGAAACGAT 5160 

GTTTTCATTA ATATCATATT TTTT CAG AT A TTCTCTCACC TCTTTTTCAG TACGTGCTTT 5220 

AAAGGATAAG TGGTAGAGGG CCACATTCTT ACCATAAGAA AATTGAGCAA AGTCTTGAAT 5280 

CTCTTTCAAT TCCTCTTCGC TTATCACCTT ATCTCTCCAT AACATAAAAC GAACAATTGT 5340 

GTCTTCCCTG ATATAGCATT TGTCG 5360 
(2) INFORMATION FOR SEQ ID NO: 78: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3636 base pairs 

(B) TYPE : nucleic acid 

(C) ST HANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

TTTCCAGAAA CAAGTTGAGT AAAGTCTTTA TCAAAGAGAA TGACTTCCGT ATTGGAACTG 60 

ACATTAGGTT TTATTTCTAC TTTACTAGCC TCCGCCCTAG CATTTTCTAA ATCTTTAATC 120 

TCTTCTGTTG CCCTATTTAT AGCCAGCTGA ATAACTGCTT GAGGATTTTC ACTCAGTCCA 180 

TGAAGCTTAT CGTCCACCGA AGTATAAAGA CTCGAATCCA TGACTTGTAA AATAATCAGA 240 
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CTCATTGTAG AAAAAATCAG GGTGAAGACA CCGAAGTTGC GGATAAAATA ACTAAAGTCA 300 

TCCGCATACC ATGTTTTTTT AAGTTTACTG AACATCTTTT AAAAGATACC CAACACTACG 360 

CAAAGTTTGC AAATTCTCTG CAAAAGTGGT TCCCTTTAAT TTCTTACGGA CTTTTGAAAC 420 

ATAGACTTCG ACAACCGAAA TCGTTGTATC ACTATCAAAT CCCCATAGAC GGTCAAAAAT 4 80 

CTGCGTCTTA CGCAAAATCA CATTTTCATT TTCAAGGAAA TAAACTAGTA AATCGAACTC 540 

TTTCCCCAGC AATTCGACAG GAGTATCTTC AACTTTAACG GTATTCGTTG ATAAATTAAC 600 

CACGATATTC CCATAAGTCA AGGTGTTTTC ATTAAACTTC CCTGAACGTT TGAGAAGGGC 660 

CTGAATCCGC ATTTTAAGTT CTTCTAGCTA GAAAGCTTTG GTCAGATAAT CATCCCCTCC 720 

CAGTTCAAAT CCATGTCCCT TGTCATCCAA ACTTTCCTTG GCAGTCATAA TCAGAACTGG 780 

TGTCGTAATT CCCTTTTCAC GCAATTCTTT TAAGACTTGG AAACCATTTT TTTCTCCCAA 840 

CATC AAATCC AGCAAAATCA AGTCATAGAC ACCACTCTCA GCTTCGTAGA GACCTTCTTC 900 

TCCATCAAAT ACCTGCATAA CATCCGCAAA ATCGTCTAAA AAGTCAAATA CTGAATTTGA 960 

CAGACCTAGG TCATCCTCAA CCAATAAGAT TTTTATCATG AGAAACTCCT CCTTATTAAA 1020 

ACTATTATAC CAAATTTGCC TTAAAAAAAA CTCAACTCTC TGCATTTTAC ATGACATAGC 1080 

TGACTTTTCT TTTTATTTTA GGCTT ATTTA TGCATTTCCG TATTGAAGAA CAA CTGCTTC 114 0 

GACTGCAGCT TTTTCACGGC TAATCAAGTC AACACGCGCT GCAATTTCCT TGATTCCCAT 1200 

ACCGATGTTA CGGCTAAGAG CAAGGTCAGA AACTTGCGGT TCAAAGAACT CCTTGTATTC 1260 

CGCCAAGCGT TGCTCAGTCT TAAATACATG AGCACGAAGG ATAACAAAGC TATCAAAGCT 1320 

CATATCTCCT CCAAGGGCTG CCTTAATCCA AGCCCAGTTT TCACCCGCCC AAGACCAAGC 1380 

TGTTTTCTGA GTTGCTTGAT GAGCTAGGAA TTGGTAATAC CAAGCAGACA AGTCCTGTGG 1440 

TTTGACCACA AATTTGTCCT TCCAAGAAGT AATCAGGTTT TGGATATTAT CCGCATCTCT 1500 

ACTGTATGCA AGAGCTCCTG CCAACTGGCG TTTAAAGACA GCATCTGTTG CGTGAGTATA 1560 

AGTATCAAGA TAAAGTGCTA ACAAGTCTTT AGTCTCATGA TGTTTCATCT CATTAATCAG 1620 

AACTTGTGAG CGAATAGCTG CTGGGAGTCC TGCAAGATTC TCCTTGTGTC TTGCGAAGAT 1680 

TTGGCTAGCG ACTTGACTAG CTTCTGCATC ATTTGAGCGA ATCATCATCG AAACAGCCAG 1740 

CTGACGAACC AATTCATCCT CATCTGATTC TCCGTCTTTA GCTTCAAAAC CAACACCCTC IS 00 

ATAGTTATGA CGAGCCAATT TAGCAACCAG TCCTTTGAAG GCTGTTTCAG CATCCGTTCC 1860 

TTCATCAATA AAGCGCTCAA GGGCTCAAAT CACTTGAGAA ACAGCTCAAA CCACCAGATA 1920 

AGACTCTTCC TTAGCAAGTT TATCAAGAAC TGGAAGCAAG TCTGCATAAG AAATGTGCCC 1980 

TGCCTCAGCC AACAAACGAC GTTCTTGAAC AATTTGCAGT TTGCTTGTGT TATCAAGTGT 2040 
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CTCTAGCTCA GCAAGAACAG 


CTGCTAACAA 


GTCTCCTTGA 


l nvi 1 Www 1 m\ 




2100 


AGTATTTTCA 


CTGTTGAGAC 


GAAGAGCTCC 


TTCATTTTCA 


f*r* a. a r* a a c Aft 


W • \*W\* • ^»ww w 


2160 


AGGGATTTCG 


ATACTTTCAG 


TTTCGAGTGT 


ATCAGGCAAG 


CCTTTCCAGT 


IWwlAl &WnM 


2220 


GGGCACCACC 


CAGACACGGT 


TCTTGTCTTC 


GTTCTCACCG 


ATviAAGAATT 


111 1 1 1 IW A **r* 


2280 


AATCTTCAAG 


ACATCATTTT 


CAACTTTAAC 


AGTAAGAACT 




rrTflTTCCAA 

WW iu 1 X \.V.f" 


2340 


CCAAGAATCC 


ATGAAGGCTG 


CGACATCACG 


TCCTGACGCT 




wV\ Iv V. Lnnnu 


2400 


GTCACTACCA 


ATGGTGTTGC 


TGTATTGGTG 


TTTTTCAAAG 


TAGGCGTGCA 


AAuLl 1 InlAi 


2460 


AAAATCACCA 


TCTCCTAGCC 


AACGGCGAAG 


CATGTGCATG 


AGACGGCTTC 


tTT rwwwA 1 M 


2520 


GACGATAGCG 


CCGTCAAAGA 


GTGTATTGAT 


TTCATCTGGA 


TGTTTAACTT 


r*r* a f^nvy^Ar 
WunLw luuftL 


2580 


AGACTGAACG 


CCATCAGTAG 


CGTCACGTTC 


AAGAGCAAGA 


GGTACTCCAC 


w~l\* i l iwwnA 


2640 


ATCTTCAAAG 


ATATTCCAGC 


TTGGTTCGAT 


GGTATCCACA 


CAGACGTATT 


CCATCATATT 


2700 


AGCGAAACTT 


TCATTGAGCC 


AAAGGTCATC 


CCACCATTTC 


ATACTCACCA 




* / vU 


CCATTGGTCA GCCAATTCAT 


GGGCCACAAC 


AAGGGCAACT 


TGTTGACGGC 


»■« K^^*tk K %*^W™*^ 




AGAGTTCTCA 


TCGACAACCA 


AGTAAACTTC 


ACGGTAGCTC 


ACAAGACCCC 




2 680 


AGCACCAGCT 


GAGAAGTCAG 


G AAGGGC G AT 


GTCGAGAGAT 


TGAGGAATTG 




2940 


TCCATAGTAA 


TCTTCGTAAA 


ACTCGATAGA 


GCGAACAGCG 


ATATCCAGTG 


ft/* A A i rriifi 


3000 


ATTTGAAAGT 


GGATGTGCTT 


TGCTTGAGTA 


GACACCTACC 


AGGGTACCAT 


TTTTACTTTT 


3060 


AGCGCTCACC 


CCTTGCAAAT 


CACCAGCAAC 


AAAGGCCAAC 


AAGTAAGAAG 


Awn 1\jLw\uO 


3120 


TGTTGTCTCA 


AACTTCCAGA 


TACCTGTTTC 


CTTACGGTTT 




TTTCTGGCAT 


3180 


GTTTGACAAG 


CCCAATTCAC 


CTTCTGCTTG 


GTCAAAGCGA 


AGAGAGACCT 


CAAAAGTTGC 


3240 


TTTGGCTTCA 


GGCTCATCCA 


CACATGGGAA 


AGCTTCGCGC 


CCAAAATGGC 


TCTCGAACTG 


3300 


AGTAGACAAG 


ACCTCCTTCT 


TGACTCCATC 


AACTCTATAA 


TAAGAAGGCT 


AAATCCCTGT 


3360 


CATGTTGTCT 


GTAATTTTAC 


CAGAAAAGGC 


AAGAACCAAT 


TCAACTTGAC 


CACCCTCAGC 


3420 


CAATTCGATA 


TGAAGGCCTT 


CATTGTCATG 


GTCAACTGTA 


AATGCACGAG 


CTTGACCTGC 


3480 


AACTTCTACA 


GAGGTGATTT 


CCAAATCTTT 


TTGGTGGAGG 


GAGATGCGGT 


CACTCTGTGC 


3540 


TTGACCAGTG 


ATGGTCACTT 


TCCCAGAAAA 


AGTCTTGGTC 


TCACGACTCA 


AATCTAAAAA 


3600 


TAAATCATAA 


TGTTCAGGAA 


CAAATTGCTT 


AATGGG 






3636 



(2) INFORMATION FOR SEQ ID NO: 79: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5066 base pairs 



I: 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



ATACCGTGTA 


ATAATCGATT 


TTAGAGGTAC 


CATAAGCCAC 


CTCCTACAAA TAGAAACCGA 


60 


TATAAATCAA 


TGCCTTCCAC 


CCTTAGACTT 


CCCTAGTTCC TGTCTCAAGC GAAACATTTC 


120 


TTTCAAACAG 


GAATAAGTTA 


ACCAATTCAT 


ACCAATAGCT 


AGCAGAATAA AAAGAAACCA 


180 


AATCCCCCAT 


AACTTCATAT 


CTCTCACATT 


TCTCAAGACG 


GTATTGAAAA ACAGAACTCA 


240 


AACAACTGTC 


CAAGCAAGGC 


TAAAAAGAGA 


ATAGAAGGGG 


ATGTAAAACC AGTAAAAATA 


300 


ATAAAAAATT 


GGAAAAAACT 


TACTATTTCT 


CTTGGCCTTT 


TCAATCCAGT TATCAAAATA 


360 


AAACTACGGT 


GCTAAAAGTA 


AGAATTTAAA 


CAAATGTTCC 


ATCACCGACA TCCCCCCTTC 


420 


TTTTGATAGC 


GTTTTCTATT 


ATTTTATTAT 


ATCAAAAAAA TCCCGAACTG TCATTCCAGA 


480 


TTCTACTTTT 


TTATTTGCGT 


TTTCTTGCGA 


TGAGATGAAT 


CGGTGTTCCC TCAAAAACAA 


540 


AGGCCTTGCG 


GATTTGATTT 


TCCAAGAAAC 


GCAGGTAACA 


AAACTGCATG AGTTCTTCTT 


600 


CATTGACAAA 


GATGACAAAG 


GTTGGTGGTT 


TGGTTGCCAC 


TTGGCTCGCA TAGAAAATCT 


660 


TCAGACGTTT 


TCCTTTGTCT 


GTCGGTGTTG 


GGTTGATGGC 


AATCCCATCC ATGATGACAT 


720 


CGTTCAAGAC 


AGCTGATGGA 


ATACGTGTAT 


TTTGACTTTC 


GCTGATTTGC TTAATCATCT 


780 


CAGGAAGTTT 


CTGGAGACGT 


TGCTTGCTTA 


AAGCTGATAC 


AAAGATAATC GCTGCCTAAC 


840 


GCAGGTATTG 


GAACTGCTCA 


CGGATATCTT 


CTTCCCAGTT 


TTT CAT AGTG TGGTTATCTT 


900 


TTTCAACCCT 


ATCCCACTTG 


TTGACCACGA 


TAATCATCCC 


TTTACCAGCT TCATGGGCAA 


960 


ATCCTGCGAT 


ACGCTTGTCG 


TACTCACGAA 


TGCCTTCTTC 


CGCATTGATG ACCATCAAGA 


1020 


CCACATCTGA 


ACGGTCAATA 


GCACGCATGG 


CACGCATAAC 


AGAGTATTTC TCAGTATTTT 


1080 


CATAAACCTT 


ACCAGACTTA 


CGCATACCAG 


CCGTATCAAT 


CATGGTAAAC TCTTCACCAT 


1140 


CTGTATCTCT 


AAAGTGGGTA 


TCAATGGCAT 


CACGAGTTCT 


TCCAGCAACA GGACTAGCAA 


1200 


TAACACGGTC 


TTCTCCCAAG 


ATAGCATTGA 


TCAAGCTTGA 


TTTTCCAACG TTAGGACCAC 


1260 


CAATCAACCT 


AAACTTAATG 


ACATCTGGAT 


TTTCTTCCTC 


ATATTCATTT GGAAGATTTT 


1320 


CTACGATCGC 


ATCTAGCACA 


TCCCCTGTAC 


CGATTCCATG 


GACAGATGAG ATAGGCAATG 


1380 


GTTCACCCAA 


ACCGAGAGCA 


TAGAAATCAT 


ATATATCATT 


TCTCATCTCA GGGTTGTCCA 


1440 


CCTTGTTGAC 


TGCGAGGATA 


ACTGGTTTGT 


GGGTCTTATA 


AAGCTTACGA GCTACGTATT 


1500 


CGTCTGCATC 


AGTAATTCCT 


TCCTTACCAG 


ACACGACAAA 


AACGATAACA TCTGCTTCTT 


1560 
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CCATCGCAAT 


TTCTGCCTGG 


TGCTTGATTT 


w iTk * \+t\f\ 


ACCACCATCG 


ACATCATCAA 


1620 


TTCCTCCTGT 


ATCAATCATG 


Li AAAauAAL 




CTCACCCGTT 

^ A ^A\>^LU * A 


GCATAAATAC 


1660 


GGTCACGTGT 


CACTCCTTCC 


ACATCTTCTA 


CAATGCAwA^ 


I LuV. A A. ALU* 


i\»Uwn a 


1740 


TAAATACGGT 


TGATTTCCCA 


ACATTGGGAC 


MW«/<HI ft ^ ft ft 






1600 


TAATTTCTCA 


CTTTCTACAA 


TAA"i"ri"V'ri 


TGTTCAAGAT 


TTTTTCTAGT 


A VJAVJU A A WW A 


1660 


TCAGCTTGAC 


CAAACTGTTC 


TGCTAGGCGC 


TGACTCCACC 


TTGTGGTCGC 






CCATAGTCAG 


CCTGAACACG 


GTCATAAGCT 


TGGATTGCCT 


CAGTTGACTG 


TTCTTGGTAT 


x you 


TCTTCCTCAA 


AGACAACATT 


CTCTAGTGGC 


AGTCTCGGTT 


TCATATCATG 


ATGTTGATTT 




GGCACACCCA 


GTGCCATCCC 


AAAGACAGAA 


TAGGTGTAGT 


CAGGTAGGTT 


ft ft * n ^i">^t^nt 

AAAGACCiVl 


2 1UU 


GCCACTTCTT 


CAGACTTGTA 


TCGAACCAAA 


CCGATAATCA 


CACCACCATA 


GCCCAAGCTT 


Ol CA 

21bu 


TCAGCTGCCA 


ACAAGGCCTT 


TTGTCCAGCA 


AGAGCTGCAT 


CGACCGAACT 


& <TV» A ft ^ m ft ft 

AATCAAwAGA 


222Q 


CCTTCCACAC 


CTTGGGGTTG 


GAAGGTGTCG 


GTATGAAGTC GGGCTCCCTT 


TTCTGCTCGG 


o i an 
2280 


TTCAAATCTC 


CGACAAAGAG 


AAGCAAAACA 


GCAGACTGGC 


GAATGGCTTC 


TTGAGGTACC 


2340 


AATTCATACA 


AGGCATCTTT 


CTTCTCTTGA 


CTTCGTACCA 


CAATCACAGA 


GTAGGATTGG 


2 4 00 


AAATTCTTCC 


AAGATGATCC 


CATCTGGGCT 


CCTGTCAAAA 


TCTCATTTAA 


GTCTACTTGG 




GGAATTTCTT 


GCTCTTTAAA 


CCTGCGCACT 


GAACTATGAC 


CCTTCATCAA 


TTTAATGGTT 


4 3 2U 


TCTGTCATCG 


ACGGTTTACT 


CCTTCTAAAC 


GAGTCTCCTC 


AGCCAAATAA 


CGwATCCwTT 




CCATCACCCG 


TCTCGCTTCC 


CAGGTTTCCT 


CATTTCCATG 


TTTCACTTTC 


bLAAAATULT 




TCTCCAAATC 


TTCAAAGTTG 


AAGTTGGATG 


TGAAAAAGGT 


CGGTAAATTT 


TCCTCCATCC 


* ' uu 


GATATTGGAG 


AATGACCTCC 


AGGATTTCGT 


CACGCACCCA 


AACCGTTGAT 


luL 1V.UUV.UV. 


«. r OV 


CAATATCATC 


TAAAATCAGC 


ACCTCAGACA 


GCTTAATCTC 


ATCCACCAAG 


vi I V. 1 i nAV.n 1 


2620 


TGC CATC ACT 


GATAGCATTT 


TTGACATCAA 


TGAGAAAGCT 


AGGATAGTGG 


AGCAGAGTTG 


2860 


ATGAAACACC 


ACGTTTTTCT 


GATAAATCAT 


GAGCTAAGGC 


CCCCACCATG 


AAACTTTTAC 


2940 


CCACACCAAA 


GTCTCCATAT 


AAGTAAAGAC 


CTTTTCGAAT 


AGCTGGATAT 


TGCTCCACGA 


3000 


AGGCTAGTAG 


CTTTTCAAAA 


ACTGGTAAGC 


GCCCCAAATC 


ATCCAAGTCA 


ACTTGAGCCA 


3060 


AACTAGCTTT 


CTTGAGACTG 


GCTGGTAGAT 


TGATTAACTT 


GAGACGGTTC 


TTAATAGCCG 


3120 


CTTCTTTTTC 


AGCCGCGATT 


AGCTCAGGAG 


TTTCTTCATA 


TGAAACATCT 


GCATAACCAT 


3180 


GATTCTTAAC 


CAAAATCGGC 


TTGTAGCCTT 


TGGCAATATA 


ATCCGTATCC 


CCACGGAGAA 


3240 


ACTTGTCACG 


CTCGGTGATG 


TACTGATTAA 


ACTTGGAGAT 


ACTGCGATTT 


AATTCCTTTG 


3300 
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GAGTTAAGGA 


TTCTTGCTGG 


ATAAAGGCCG 


CAACATCAGG 


GTCCTTCATG 


ATTTTCTGGA 


3360 


CCAAATCTTG 


ATAATAAAAA 


CGGCTGGGTT 


GACGTTTGAG 


TACGTCTCCG 


ACACTTTCCA 


3420 


TCTAATCTCC 


TCCTTTTTCT 


AATCGAGCTA 


ATAGrrCTTG 


CTTCTTACGT 


TCTAGTTCCA 


3480 


GACGAGTTTC 


CTCGCTGGTT 


TCATTCTTAT 


ATTCAGGATT 


ACTCCATTTA 


GGAACATTGG 


3540 


TTTTTTCTGC 


GGCAGTCTGA 


TTCTGTTTTT 


GTGTTTTTGC 


TTTCTGCCCT 


CCATCACGAA 


3600 


TTCGTAAAAC 


GGCCTCTTCT 


GCCGAATGAA 


TCTTTTGATA 


GGCATAGTCA 


TTGGCTACCT 


3 660 


TCATCGCATA 


TTTCTCATTG 


ATATTTGCCG 


AATCCACCTT 


ATTAAAGGTC 


AATAAGAGAA 


3720 


TAATATTGAT 


GACTTCGTCC 


AGTAAGCCCA 


AGCCAGCCAT 


CTGTTGCAAG 


AGTTCTCTTT 


3780 


CTGTTTGGGT 


AATGCTTCCC 


TTGCGTGTTT 


GCTTGATTTC 


TGCTAAGAAC 


TGCACGGCAG 


3840 


TTTTACTTTT 


AGCTTCTTTG 


ATAATGGTCG 


CTTCCTTAAG 


ACTAAAGTCA 


GAGGAAACTG 


3900 


GTTTTTGAGC 


AATTTTTTCA 


CGCATGCGTT 


TGGTTGAAAT 


AACCTGGGAA 


ACAGCTGTTG 


3960 


ACTTGGCCAA 


TTGATAGCTT 


TCAAACCAAG 


TCCATTTCTT 


CTCCTCGGCA 


ATAGCAAAGA 


4020 


GGTTTAACAC 


ATCGGACTGC 


TCATCCGCAA 


AACGAAGTCC 


ATCTCGAGCC 


ATCAGCTGGC 


4080 


GAAAATGTTC 


CAAGTCAAAA 


TCATTGCCCA 


CTTTCTTCTT 


GAGACCAAGG 


TCTTCTTGAC 


4140 


TGCCTAGTTC 


TGCCAATTCT 


GGAAAGACTT 


GATTGACTGA 


G AC AGGT ATT 


TCTTCACCAT 


4200 


CAGCACTTTC 


AACTTTCAAA 


TCCTCCACAC 


CTACATCGCC 


AATCTTTTTC 


TCTAAGAGTC 


4260 


TCCGATAAAC 


AGGATGCCCC 


AAGAAGTCTT 


GACTAGATAG 


AGGAGCATGG 


AGGGCTAGCT 


4320 


GATAAACATC 


ACCCTTTTGA 


TAGAGGGTCA 


AGAGATTAAA 


AGCAGATAAC 


ATTTTCAATG 


4380 


ATTTTATCAG 


TCTATCCATC 


CCAAAGTTGA 


GATGGTTGAG 


AATGCTTGAA 


AAAAGATATT 


4440 


CCTTTCTACC 


ATTATCCCAA 


AAACTGATTG 


TATAAAGATA 


AAGGCTCAGT 


GCCTCCTGAC 


4500 


CGATAATCGG 


GAGGTAGCAC 


TGTACCAGAG 


ATGAGGTATC 


TTGCGACACC 


CGATTATTCT 


4560 


TTAGATAAGA 


AAAACGGTCA 


ATTGGCTTCA 


TTTATCTTTC 


CTTTTTCTTT 


TTAGAGGACT 


4620 


GGGTGATTTG 


TTGGAGCAAG 


CTCTCTAACT 


CACTGACATC 


CTTAAAACTA 


CCATAGACAC 


4680 


TAGCAAAACG 


TACATAGGTA 


ATCTCGTCCA 


ATTCACCCAA 


CTCCTCCATG 


ACGAGTGAAC 


4740 


CAATGTCCTC 


ACTTTGAATT 


TCATTTTCAT 


TTCCACCACG 


GAGTTTCTGT 


TCGATACGAT 


4800 


TGACTACCAT 


GTTGATTTCA 


TCACTTGACA 


CAGGACGTTT 


CTGGGCTGAG 


CGGATAATCC 


4860 


CATTAAAGAT 


TTTATCTCTG 


GAGAATTGTT 


CCCGTGTGCC 


ATCTTTTTTA 


ACAACCACTA 


4920 


AGGTTCTTTC 


TTCTACTCGT 


TCGTAGGTTG 


TAAAACGGTG 


TTGGCATTCG 


TCGCACTCAC 


4960 


GTCTTCTACG 


AATGGTGTTC 


CCTTCTTCTG 


CTTGGCGACT 


ATCGATAACA 


CTTGACTTGG 


5040 


TAGCCCCACA 


TTTTGGACAG 


GGTACC 








5066 
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12) INFORMATION FOR SEQ ID NO: 80: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9607, base pairs 
IB) TYPE: nucleic acid 
)) <C) STRAND EDN ESS : double 

<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



CACTTGAAGT 


ATTTGAAACA 


GCTATGGAAA 


ACATCATGCC 


TGTACTTGAA 


GTACGTGCAC 




GTCCTGTTGG 


TGGTTCTAAC 


TACCAAGTCC 


CACTTGAAGT 


TCGTCCAGAA 


CGTCGTACAA 




CACTTGGACT 


TCGTTGGTTG 


GTAACAATCG 


CTCGTCTTCG 


TGGTGAACAC 


ACAATGCAAG 


i an 


ACCGTCTTGC 


AAAAGAAATC 


TTGGATGCTG 


CTAACAACAC 


TGGTGCAGCA 


GTTAAGAAAC 




GTGAAGATAC 


TCACCGTATG 


GCTGAAGCTA 


ACCCTGCATT 


CGCACACTTC CCTTGGTAAG 




ATAGGATGCG 


AAAGCGTTAA 


GAAACTCCCA 


GAGAAAATAG 


GGAATCGAAG 


CAGCTTGCGG 




TTGCAACCAA 


TGAGATTCAT 


CTTTTTCTCC 


AGACTTTTAG 


CTTGAGCTCA 


ACTAAATCAT 




CATGCTAGCA 


ACGGTAAGGA 


TGCAAGGTAA 


AAATAGGAAA 


CTGACGCAGT 


ATTCGACGAA 


4 80 


TACAAGGAGT 


TTTATCTTTT 


TCACGCAGCA 


TCCCGTTCCA 


CCTCACATCC 


GCT AACT AAC 


540 


TTTAGCCCGG 


GTTCAAATTA 


GCTAAATCGA 


TTACTATTAG 


CTATAACTCA 


GCTTACCATC 


600 


TCGTAAGTTG 


AAACCAACAA 


TAGCATGAAA 


ACATTGAGAA 


CCGCTACGTC 


CTGCCTATCC 


660 


CTTTTTATTA 


AAATCGTGTT 


AT AAT AG AAT 


AGAAATCAAA 


AATAAATAGG 


AGAAACAAAC 


720 


CTCATCGCAC 


GCGAATTTTC 


ACTTGAAAAA 


ACTCGTAATA 


TCGGTATCAT 


GGCTCACGTC 


780 


GATGCCGGTA 


AAACAACAAC 


TACTGAGCGT 


ATTCTTTACT 


ACACTGCTAA 


AATCCACAAA 


840 


ATCGGTGAAA 


CTCACGAAGG 


TGCGTCACAA 


ATGGACTGGA 


TGGAGCAAGA 


GCAAGAACGT 


900 


GGTATCACGA 


TCACATCTGC 


TGCGACGACA 


CCTCAATCGA 


ACAACCACCG 


CCTAAACATC 


960 


ATCGACACAC 


CAGGACACCT 


GGACTTCACA 


ATCGAAGTAC 


AACGTTCTCT 


TCGTGTATTG 


1020 


GATGCTCCCG 


TTACCGTTCT 


TGACTCACAA 


TCAGCTGTTG 


AGCCTCAAAC 


TGAAACAGTT 


1080 


TGGCGTCAAG 


CAACTGAGTA 


CGGAGTTCCA 


CGTATCGTAT 


TTGCCAACAA 


AATCCACAAA 


1140 


ATCGGTGCTG 


ACTTCCTTTA 


CTCTCTAAGC 


ACACTTCACG 


ATCGTCTTCA 


AGCAAATGCA 


1200 


CACCCAATCC 


AATTGCCAAT 


CGGTTCTGAA 


GATGACTTCC 


GTGGTATCAT 


TGACTTGATC 


1260 


AAGATGAAAG 


CTGAAATCTA 


TACTAACGAC 


CTTGGTACGG 


ATATCCTTGA 


AGAAGACATC 


1320 


CCAGCTGAAT 


ACCTTGACCA 


AGCTCAAGAA 


TACCGTGAAA 


AATTGATTGA 


AGCAGTTGCT 


1380 
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CAAACTGACG AAGAATTCAT GATGAAATAC CTCGAAGGTG AAGAAATCAC TAACGAAGAA 1440 
TTGAAAGCTG GTATCCGTAA AGCGACTATC AACGTTGAAT TCTTCCCACT ATTGTGTGCT 1500 
TCAGCCTTCA AAAACAAAGG TGTTCAATTG ATGCTTGATG CGGTTATCGA CTACCTTCCA 1560 
AGCCCACTTG ACATCCCAGC AATCAAAGGT ATTAACCCAG ATACAGACGC TCAAGAAATT 1620 
CGTCCAGCAT CTGACGAAGA GCCATTTGCA GCTCTTGCCT TCAAGATCAT GACTGACCCA 1680 
TTCGTAGGTC GTTTGACATT CTTCCGTGTT TACTCAGGTG TTCTTCAATC AGGTTCATAC 1740 
GTATTGAATA CTTCTAAAGG TAAACGTGAA CGTATCGGAC GTATCCTTCA AATCCACGCT 1800 
AACAGCCGTC AAGAAATCGA CACTGTTTAC TCACCTGATA TCGCTCCTGC CGTTGGTTTC I860 
AAAGATACTA CAACTGGTGA CTCATTGACA GATGAAAAAG CTAAAATCAT CCTTGAGTCA 1920 
ATCAACGTTC CAGAACCAGT TATCCAATTC ATGGTTGAGC CAAAATCTAA AGCTGACCAA 1980 
GACAAGATGG GTATCGCCCT TCAAAAATTG CCTGAAGAAG ATCCAACATT CCCCGTTCAA 2040 
ACAAACCTTG AAACTGGTGA AACAGTTATC TCAGCTATGG CTGAACTTCA CCTTGACGTC 2100 
CTTCTTGATC GTATGCGTCG TCAGTTCAAA GTTGAACCCA ACGTAGGTGC TCCTCAAGTA 2160 
TCTTACCCTG AAACATTCCG CGCTTCTACT CAACCACGTG GATTCTTCAA ACCTCAGTCT 2220 
GGTGGTAAAG GTCAATTCGG TGATCTATGG ATTGAATTTA CTCCAAACGA AGAACGTAAA 2280 

GGATTCGAAT TCGAAAACGC AATCGTCGGT GCTGTGGTTC CTCGTGAATT TATCCCAGCC 2340 

GTTGAAAAAG GTTTGGTAGA ATCTATGGCT AACGGTGTTC TTGCAGGTTA CCCAATGGTT 2400 

CACGTTAAAG CTAAGCTTTA TGATGGTTCA TATCACGATG TCGACTCATC TGAAACTCCC 2460 

TTCAAGATTG CGGCTTCACT TTCCCTTAAA GAAGCTGCTA AATCAGCACA ACCACCTATC 2520 

CTTCAACCAA TGATGCTTGT AACAATCACT CTTCCACAAC AAAACCTTGG TGATGTTATG 2S80 

GGTCACGTAA CTGCTCGTCG TGGACGTGTA CATGGTATGG AAGCACACGG TAACAGCCAA 2640 

ATCGTTCGTG CTTACGTTCC ACTTGCTGAA ATCTTCGGTT ACGCAACACT TCTTCGTTCT 2? 00 

CCATCTCAAG GACGTGGTAC ATTCATGATG GTATTTGACC ACTACGAAGA TGTACCTAAG 2760 

TCAGTACAAG AAGAAATTAT TAAGAAAAAT AAAGGTGAAG ACTAATCCGT CCTCACTCTA 2820 

GAAGGAAGTC ACTTAGTGGC TTCCTTTTGT CTTTAGAAAA TACCTCTAAA TATGGTAAAA 2880 

TAGTAGAAGA ATAATGTGAG GAAAATGAAT GTCAAATAGT TTTGAAaTTT TGATGAATCA 2940 

ATTGGGGATG CCTGCTGAAA TGAGACAGGC TCCTGCTTTA GCACAGGCCA ATATTGAGCG 3000 

AGTTGTGGTT CATAAAATTA GTAAGGTATG GGAGTTTCAT TTCGTATTTT CTAATATTTT 3060 

ACCGATTGAA ATCTTTTTAG AATTAAAGAA AGGTTTGAGC GAAGAATTTT CTAAGACACC 3120 

CAATAAAGCT GTTTTTGAAA TTAAGGCTCG GTCTCAACAA TTTTCAAATC ACCTCTTCCA 3180 
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CTCCTACTAT 


AGGGAGGCTT 


TCTCTGAAGG 


TCCATGTGCT 


AoTCaAWj I 1 




3240 


TTATCAAAAT 


TTCCAAGTTC 


GTGCTGAGGG 


TAATCAGL. 1A 


I i 1A1 1 wWJ 




3300 


GATTGATAAG 


GAACATTTTA 


AGAAGAATCA 


TCTTCCTAAT 


TTAGCCAAAU 


AALi (unAAA 


3360 


GTTTGGTTTT 


CCAACTTTTA 


ACTGTCAAGT 


CGAGAAGAAT 


GATGTCCTGA 




3420 


CCAAGAGGCC 


TTTCATGCTG 


AAAATGAGCA 


GATTGTTCAA 


GCTCCCAATu 




3480 


CCGTGCTATG 


GAACAACTGG 


AGCAGATGGC 


ACCTCCTCCA 


GCGGAAGAGA 


AACCAGCUi l 




TGATTTTCAA 


GCGAAAAAAG 


CTGCAGCTAA 


ACCCAAGCTG 


GATAAGGCGG 


AGATTACTCC 


1 £ AA 


TATGATCGAA 


GTGACGACAG 


AGGAAAATCG 


TCTGGTATTT 


GAAGGGGTTG 


TTTTTGATGT 


JOOU 


GCAGCAAAAA 


GTGACTAGAA 


CAGGTCGTGT 


TTTAATCAAC 


TTTAAAATGA 


CGGACTATAC 




TTCAAGTTTP 


TCTATGCAAA 


ACTGGGTTAA 


AAACGAGGAA 


CAGGCCCAGA 


AGTTTGACCT 


3780 


CATCAAGAAG 


AATTCTTGGC 


TCCGAGTTCG 


AGGGAATGTG 


GAGATGAATA 


ACTTCACACG 


1 O VI A 


CGATTTCACT 


ATGAACGTAC 


AGGATCTGCA 


GGAAGTTGTT 


CACTATGAGC 


GGAAGGATTT 


3900 


GATGCCAGAA 


GGTGAGCGTC 


GGGTTGAGTT 


TCATCCTCAT 


ACTAACATGT 


CGACTATGGA 


3960 


TGCTTTGCCA 


GACGTCGAAG 


AGATTGTTGC 


AACAGCTGCT 


AAGTGGGGAC 


ACAAGGCGGT 


4020 


TGCTATCACG 


GACCATGGGA 


ATGTCCAGTC 


CTTTCCACAT 


GGCTATAAGG 


CGGCTAAGAA 


4080 


AGCGGGAATC 


CAGCTGATCT 


ATGCGATGGA 


AGCCAATATC 


GTGGAGGACC 


GTGTCCCTAT 


4140 


CCTCTATAAC 


GAAGTGGAGA 


TGGACTTGTC 


ACAACCAACC 


TACGTGGTCT 


TTGACGTGGA 


A 1 Art 


AACGACGGGA 


CTTTCAGCTA 


TCTATAATGA 


CTTGATTCAG 


CTTGCGGCTT 


CTAAGnTGTA 


A 


CAAGGGGAAT 


GTTATTCCTG 


AATTTGATGA 


ATTTATCAAT 


CCTGGCCATC 


CCil^* li-AG C 


4JiU 


CTTTACTACA 


GAGTTAACTG 


GAATTACAGA 


TGATCATGTC 


AAAAATGCCA 




d 1 AO 


ACAAGTTTTG 


CAAGAATTCC 


AAGAATTTTG 


CAAGGATACG 


GTCCTAGTTo 






TACCTTTGAC 


CTTGGCTTTA 


TGAATGCTAA 


TTATGAGCGG 


CATGATCTTC 


CAAAGATTAG 


4500 


TCAGCCAGTT 


ATTGATACCC 


TGGAGTTTGC 


TAGAAACCTC 


TATCCTGAGT 


ATAAACGCCA 


4S60 


TGGTTTCGGG 


CCTTTGACCA 


AGCGTTTTCG 


TGTGGCCTTG 


GAACATCACC 


ACATGGCCAA 


4620 


CTACGATGCG 


GAAGCGACTG 


GTCGTCTGCT 


TTTCATCTTT 


ATCAAAGAGG 


TAGCAGAAAA 


4680 


ACATGCTCTG 


ACCGATTTAG 


CTAGACTCAA 


CATTGATCTA 


ATCAGTCCAG 


ATTCTTACAA 


4740 


AAAAGCTCGG 


ATCAAGCATG 


CGACCATCTA 


TGTCAAGAAT 


CAGGTAGGTC 


TAAAAAATAT 


4800 


CTTTAAGCTG 


GTTTCCTTGT 


CTAATACCAA 


GTATTTTGAA 


GGAGTGCCAC 


CCATTCCGAG 


4860 


AACGGTTCTA 


GATGCCCATC 


GAGAGGGCTT 


GATTTTAGGT 


TCAGCCTGTT 


CAGAGGGTGA 


4920 
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AGTTTTTGAC CTGGTCCTTT CTCAAGGTGT GGATGCGGCG GTTGAGCTGG CCAACTATTA 4980 

TCATTTTATC GAGGTCATGC CACCGGCTAT CTATGCACCC TTGATTGCCA AAGAGCAGGT 5040 

CAAGGATATG GAGGAACTCC AGACCATTAT CAAGAGTTTG ATAGAGGTTG GAGACCGCCT S100 

TGGCAAGCCT GTTCTGGCTA CGGGAAATGT TCACTATATC GAACCGGAAG AAGAGATTTA 5160 

TCGTGAAATT ATCGTCCGTA G TTTGGGACA GGGTGCGATG ATTAATCGAA CTATCGGTCA 5220 

TGGTGAACAT GCCCAACCAG CACCACTTCC AAAGGCTCAT TTTCGAACGA CTAATGAGAT 5280 

GTTGGATGAA TTTGCCTTTT TGGGAGAGGA ACTGGCTCCT AAACTGGTTA TTGAAAACAC 5340 

CAATGCCTTG GCAGAAATAT TTGAATCCGT TGAAGTCCTT AAGGGTGACT TGTATACGCC 5400 

TTTCATCGAC AAGGCTGAAG AAACAGTTGC TGAGTTGACC TATAAGAAAG CTTTTGAGAT 5460 

TTATGGAAAT CCGCTGCCAG ATATTGTTGA TTTGCGGATT GAAAAAGAAT TAACATCCAT 5520 

ACTGGGGAAT GGATTTGCTG TGATTTATCT GGCATCGCAG ATCCTGGTGC AACGTTCTAA 5580 

TGAACGGGGT TATTTGGTTG GTTCTCGTGG GTCTGTCGGA TCTAGTTTCG TTGCGACCAT 5640 

GATTGGGATT ACGGAGGTCA ATCCTCTCTC TCCTCACTAT GTCTGTGGTC ACTGTCACTA 5700 

CAGTGAGTTT ATCACAGATG GTTCCTACGG TTCAGGATTT GATATGCCCC ATAAGGACTG 5760 

TCCAAACTGT GGTCACAAAC TCAGTAAAAA CGGACAGGAT ATTCCGTTTG AGACCTTCCT 5820 

TGGTTTTGAT GGGGATAAGG TTCCTGATAT TGACTTGAAC TTCTCGGCAG AAGATCAGCC 5880 

TAGCGCCCAC TTGGATGTGC GTGATATCTT TGGTGAAGAA TATGCCTTCC GTGCGGGAAC 5940 

GGTTGGTACG GTAGCTGCCA AGACTGCCTA TGGATTTGTC AAAGGTTACG AGCGAGATTA 6000 

TCGCAAGTTT TATCGTCATG CAGAACTACA ACGCCTCGCT CAAGGACCGG CGGGTGTCAA 6060 

GCGGACAACA GCCCAACACC CGGGGGGAAT C G TT G TTATT CCGAACTACA TGGATGTCTA 6120 

CGATTTTACG CCTGTCCAGT ATCCAGCAGA TGATGTCACG GCTGAATGGC AGACCACTCA 6180 

CTTTAACTTC CACGATATCG ATGAGAACGT CCTCAAACTC GATCTACTGG GACATGATGA 6240 

TCCGACTATG ATTCGAAAAC TTCAGGATTT GTCTGGTATT GACCCTAATA AAATTCCTAT 6300 

GCATGACCAA GGCCTGATGG CACTCTTTTC TGGGACTGAT GTGCTAGGGG TAACACCTGA 6360 

ACAAATTGGA ACGCCTACGG GTATGTTGGG GATTCCAGAG TTTGGAACAA ATTTCGTACG 6420 

TGGAATGGTA GACGAAACCC ATCCGACAAC CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 64 00 

GTCCCACGGT ACTGATGTTT GGTTGGGGAA TGCTCAGGAT CTGATTAAGC AAGGAATAGC 6540 

GGACCTATCG ACTGTTATCG GTTGTCGGGA CGACATCATG GTTTACCTCA TGCATGCGGG 6600 

TCTGGAACCT AAGATGGCCT TTACCATTAT GGAACGGGTA CGTAAGGGTT TGTGGCTAAA 6660 

GATTTCAGAA GAGGAGAGAA ATGGCTATAT CGAAGCAATG AAGGCTAATA AGGTCCCAGA 6720 
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GTGGTATATC 


GAATCCTGTG 


UiAAnAl Inn 


Vj i AI.A Iwi IV. 


CCTAAGGCCC 


ATGCGGCAGC 


6760 


CTACGTTATG 


ATGGCCTTGC 


GTGTAGV.HA 


t l lVMuu I * 




TTT ATT ACTA 


6840 


CTGTGCTTAC 


TTCTCCATTC 


Ui^LTAAGGv. 


i l l A IV 




CTGCGGGf 1*1* 


6900 


GGAGCTCATC 


AAGCGCAGAA 


TGGAAGAAAT 


CTCTGnAAAn 


WwUflHi *^f%ft*^% 


ATGAAGCCTC 


6960 


TAATGTGGAA 


ATCGATCTCT 


ATACAACTCT 


ft M ft AH^p^^^Pf^ 


A ATYt Aft 


tW.AACGAGG- 


7020 


TTTCAAGTTT 


GGTAAATTAG 


ATCTCTACTG 


TAGTCAGGCG 


AUtunu rTVA. 


IVn l v»unLuo 


7080 


GGATACCCTT 


ATCCCACCAT 


TTGTAGCAAT 


CGATGGTCTG 


fV» • f» »/■» » A Of* 
uununviAAVVi 


1 1 v^wnnuWt 


7140 


ACTGGTGCGC 


CCGCGTGAAG 


AGGGAGAATT 


CCTCTCTAAA 


ft a ft ft. n f+tm ft ^ 

ACAGAACTAC 




7200 


TGGACTCTCA 


TCAACCTTGG 


TTGAAAAGAT 


GGATGAGATG 


GGTATTCTTG 




(<QU 


AGAGGATAAC 


CAGTTGAGTT 


TGTTTGATGA 


GTTGTTTTAA 


AAAATTXiCTT 


AA I AA IV. 1 A J 


*7T5ft 


TAAAAGAGGC 


TAACGTATAT 


CCAATAGATT 


TACATTAGCT 


TTCTTTTTTC 


Mwn ft ft ft tntj 

TTAAAATAGT 


/JoU 


CTATGGAAAG 


AGGGTGAGAG 


TATGTCAAAG 


ATGAGTATAA 


GCATCCGTCT 


J— /■■» ft ft J Kft^H ft ^ 

GGATAGTGAKj 


/ *QU 


GTTAAGGAGC 


AGGCCCAACA 


GGTGTTTAGT 


AATCTGGGAA 


TGGATATGAC 


AACAGCTATT 




AATATTTTCC 


TTCGTCAGGC 


AATTCAATAT 


CAGGGATTAC 


CTTTTGATGT 


7AGACTAGAC 


•)C£A 

/ 3 O U 


GAAAATCCGA 


AGTTGCTCCA 


AGCGTTAACG 


GATTTAGACC 


AAAATCGTAA 


T ATwAvjt. Unu 


7 620 


TCTTTTGAAT 


CAGTCTCAGA 


TTTGATGGAG 


GACTTACGTG 


MM ft ft ^ ft M**!*^ 

CTTAAGATTC 


V* 1 1 A l\JMnn 




ACAGTTTAAA 


AAAGATTTTA 


ACTTGGCTAT 


GAAGCGTGGT 


TTGAAGGCAv* 


AA I i All AVjM 


7740 


AGAACTTTTG 


AATTTTCTGG 


TTCAAGAAAA 


AGAACATCCT 


GCCAGAAATU. 


V* * 1 wn t ii> 


7800 


ATTGACGGCA 


TCCAAGCATT 


TTCAAGGAGT 


TCGTGAATGC 


CATACCCAwU 




7860 


TTTGGTTTAT 


AAAGTAGACA 


AGTCGGAATT 


GATTTTAAA 1 


T*rr*r**mAnfiA 


gAGGCAGTCA 


7920 


CAGTGATTTA 


TTTTAATCTA 


III i AAGGGG 


Ij I iVI V- A i V»A 


A Af^rJftAflAAT 
AAW * n/\«nn * 


ATTTGCGGAA 


7980 


GATAAGCCGG 


CTAAGAACGT 


ATTTGAATAT 


CAATTAGAAC 


TTGCTGATCG 


TACAATTCTT 


8040 


CTATCGACAG 


CACTCTTGTC 


AGGTGCTATT 


GCTTTACCAG 


GAATCTTTTC 


TCCTTTGAAA 


8100 


GAAAAATAAA 


AATAGAAAAG 


AGAAAACAGA 


ATGGTTTTAC 


CAAATTTTAA 


AGAAAATCTA 


8160 


GAAAAATATG 


CGAAATTGTT 


GGTTGCGAAC 


GGAATTAACG 


TGCAACCTGG 


TCACACTTTC 


8220 


GCTCTCTCTA 


TTGATCTGGA 


GCAACGTGAA 


TTGGCACATC 


TAATCGTGAA 


AGAAGCTTAT 


8280 


GCCTTGGGTG 


CGCATGAGGT 


CATCGTTCAC 


TGGACAGATG 


ATGTGATTAA 


CCGTGAGAAA 


8340 


TTCCTCCATG 


CCCCGATGGA 


GCCTTTGGAC 


AATGTGCCAG 


AATACAAGAT 


TGCTGAGATG 


8400 


AACTATCTCT 


TGGAGAATAA 


GGCTAGCCGT 


CTTGGAGTTC 


GTTCATCTGA 


TCCACCTGCC 


8460 
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TTGAACGGAG TGGACGCTGA CAAGCTTTCA GCTTCTGCTA AAGCTATCGG ACTTGCCATG 8S20 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8580 

GGACTTGAGT GGGCTAAGAA AGTCTTCCCA AATGCTCCCA GCGACGAAGA AGCAGTTGAT 8640 

TTCCTTTGGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGCAGATCC TGTTAAGGCT 8700 

TGGGAGGAAC ATGCAGCCAT TCTCAACAGC AAGGCCGATA TGCTTAATAA GGAGCAATTT 8760 

TCACCCCTTC ACTACACAGC GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 8820 * 

GTTTGGGAAT CAGCTGGTGC TGTCAATGCA CAGGGCGAAG AATTCTTCCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC GCCTGACTTC CGTCGTCCAG ATGGTTATGT CACTTCTACA 8940 

AAACCCCTTA GCTACAACGG AAATATCATT GAAGGCATTA AGGTGACCTT TAAGGATGGA 9000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT GATCAGGTTA TGAAAGACCT TGTCTTTGAA 9060 

AATGCGGGTG CCCGTGCCTT GCGTGAATGT GCCTTGGTAC CAGATCCAAG TCCAATTTCT 9120 

CAGTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCCTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATGCGAC TAGCGTTGTT GATGGAGCGG AGATGAGCGA AGAGGAGCTT 9240 

GAAGCTCCAG GGCTTAACCG TTCAGATGTT CACCTAGACT TTATGATTGG TTCTAACCAA 9300 

ATCGATATCG ATGGTATTCG TGAGGATGGA ACGCGGGTAC CTCTTTTCCC TAATGGGAAT 9360 

TGGGCAAATT AAGGAGATAA TATGTTAGCA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 9420 

TTAGCAGGTG CTATGACCAA TCGTGGAGAG CGAATGGGAT GTTTTGGAAA AAT G TTTCTC 9480 

GGTTGGATCG GAGCCTTTCT AGCTCACTTG CTCTTTGGAA CTTGGGCCCC AGTTTTATCA 9540 

GGAACAGCTA TTATCCCAGC GATTTTACGA GCCATGATTG TTTTAGCTAT TTTTTGGAGA 9600 

CGAGGAA 9607 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14231 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATAGTA AGAGCGAGCT CTATGATAAG 60 

GCTCATTAGT TTCACCTCCT CTCACGAACC CATAGGAACG TAATCGGTAA CCGATGACAA 120 

AAATAGTATA CCACAATACA TTTAGATCAT CAAGGTCACT TAATTCTTGA AATATCAGAT 180 

CTAAGAGAAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAACTTGAT 240 
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ACTATGCGTT 


TTATTGTGGG 


AAGGTTTACT 


CCATTTTCTC 


CTGAAATTGA 


GTTTTTGTCC 


300 


AGCCTCTGTT 


TTTAGGGTTG 


CTAAGAAAAT 


AATGTCATGT 


CCTGAATATT 


TGTAAATCAG 


360 


TCACCACACA 


GAACGATACT 


CTTCGAAAAT 


CTCTTCACAT 


CATGTCAGCT 


TCGTCTTTCC 


420 


GTATATATGT 


GACTGACTTC 


ATCAGTTCTA 


TCTACAACCT 


CAAAACAGTG 


TTTCGAGCTG 


480 


ACTTGATCAA 


TTTTCAAATC 


TCTACTTTGA 


GCAAGCTGAG 


ACTAGCTTCC 


TATTTGATTT 


S40 


TCATTGAATA 


TCAGAAACCC 


ATTCTCCATC 


AAATAATTCG 


ACTGCGTCTA 


ATAATTTTTG 


600 


ATCTGGCACG 


GTGTCTGAAA 


TAAAGCTTGT GTATTTGGAG AGGGGATTAA 


TTTTAAAAAA 


660 


TCCACTCTTG 


TAAAATTTAG 


AACTATCAAT 


CAGTAAGATG 


/"■*P*F*I*f* A*WW? 


Va & A 4 V* * * 


720 


AATATTCTTT 


TTTCAAATAG 


CTTGGCTGAG 


AGAAGCTTCA 


I /WW— 1 #% A 1 


GGTCATCAAT 


780 


ACCTCTTGCT 


GAACAAAATG 


CTAAATCGAT 


ATTAAAATGA 




AA & A A A «aW A * 


840 


ATCATAGTTG 


ACCACGGAAC 


AGGATTGATG 


TTTGACCTCG 




TAAAGATTTT 


900 


GGAGCTATCT 


TTAACAGTTT 


CAGATAGGGT 


TTGTGCAGTA 






960 


AATCAAATTA 


TCAAGTTCAG 


AAAGATAGGG 


ACAGAGTTCG 




*p & f"*r l. fi a a*t^ 


1020 


TAGATAGATA 


C AC AT A C C AG 


ACC G AAT AAA 


VjTl. I I 1 


AfiACTACCGA 




1080 


TTGCCTAGTA 


CTTTCTCCTT 


CACGTATTTG 


ATGAGAAACT 


TCAATTGTGT 


TCATACACCA 


1140 


CAGGGTCACG 


TATCCGTGCT 


TTCTTTTGAT 


AAGACCTTGA 


TTTTCTAAGA 


AAATTAAATC 


1200 


ACGACGTAAG 


CTACTTGTGC 


TGGAGAAAGT 


GATTTCTGCC 


AGCTCTTTTA 


CCGCAATTCT 


1260 


TTTTTTCTTT 


TTGATAATTT 


CAATCAATTC 


AAGTACACGT 


TCATCTTTTA 


TCATAAGCTC 


1320 


CTCCTAATTT 


ATCATTTCAA 


CTATATTATA 


GCACAAATTG 


GAGGAATTTG 


AAXTATTTTT 


1380 


ATGAATATTC 


GGTTAACATT 


TGAACATTAT 


TCAAGTAACC 


GTTCACATAT 


TGAAAAAATA 


1440 


AAACGTGGGG 


ATTATAATAA 


AGTTAATCmA 


GGACGAAGAG 


AGAAGAAAAA 


TGGAAGCGGT 


1500 


TTTAGCAATA 


GATTTAGGTG 


CGACTTCTGG 


AAGAGCAATC 


GTTGGTTACC 


TTTCTGAAAA 


1S60 


TAAACTAGTA 


ATGGAAGAAA 


TAAATCGCTT 


TTCTAATCTA 


CCTATTAGAG 


TAAAAGGGCA 


1620 


TTTATCTTGG 


GATATTGACT 


TTCTACTAGC 


TAAAATTCTT 


CAAAGTATCC 


GCTTGGCTAA 


1680 


TACTAGTTAC 


AAGATTTTAT 


CTATCGGTAT 


TGACACATGG 


GCACTTCATT 


TTGGACTGAT 


1740 


TGATAATCAA 


GGTAAGCTGT 


TATTACAACC 


TGTTCATTAT 


CGTGATGAAA 


GAACAAAGGG 


1800 


AGTGTTAAAG 


GAAATATCTG 


AAATGACTGA 


ATTAGAAAAA 


CTGTATTCAG 


AGACACGAAA 


1860 


TCAGATTATG 


GAGATAAATA 


CCTTCTTTCA 


ACTCTTTAAG 


GCACGTCAAC 


AATCTC CTG A 


1920 


CTCTTTCTAT 


AAGACCAATA 


AGATTCTTTT 


AATGCCAGAT 


TTGTTTAATT 


ATCTCTTGAC 


1980 
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AGCTAAGTTT 


GCTACACAAA 


AAAGCATTGC 


TTCAACAACT 


CAATTATTTG 


ATCCTAGGAG 


2040 


TCAAAATTCG 


AATCAGAATA 


TCTTAAAACT 


ATTTGAATTG 


GATTCATCTT 


TACTTCCTGA 


2100 


AATTGTTTCA 


GAGGGAAATG 


TTCTTGGAAG 


GATAAAAGAG 


GACTATGGTT 


TAGGCGATAT 


2160 


TCCTGTTGTG 


AATGTTTGTA 


GTCATGATAC 


AGCAAGCGCG 


ATTGTCTCAG 


TACCTAAGAC 


2220 


AGAAGGTAGT 


TTATTTATTT 


CATCAGGTAC 


TTGGTCTTTG 


GTTGGACTCG 


AACTTACTTC 


2280 


ACCGATTCTT 


ACTACCGAAT 


CCTTCAGTTA 


TGCATTTACA 


AATGAACTCG 


GTAAAGATGC 


2340 


AGTGATTACA 


TTTCTGAAGA 


ATTGTACAGG 


GTTGTGGATC 


ATAGAGGAAC 


TAAGACGTTC 


2400 


ATTTGAACGA 


AGAGGGAAAG 


CCTATTCTTT 


TGATGATATT 


AGGACAATGG 


TGGAGAAAGA 


2460 


AAAAGAAAAT 


CTTCCTCTGA 


TTGATACTGA 


ATCAACTGAA 


TTTGCAACAG 


AATCTGATAT 


2520 


GCACAAGACT 


TTGACAGAAT 


ATCTAGCTTA 


TCATCATGAA 


ACTAGAGAGT 


GGACAGATGG 


2580 


ACAACTATTT 


AAGATTGTTT 


ATGAAAGCCT 


AGCTGAAACG 


TATAGGAAAG 


CCATAGAGTT 


2640 


ACTAGAACAA 


CTAACTCATA 


AGCTTTATAA 


GAGGATATAT 


GTGATTGGAG 


GAGGTGCTAC 


2700 


ACCCACTTAC 


TTTAACCAAA 


TGATTGCTGA 


TAGAACTGGT 


AAAGAGGTTC 


TTACAGGTTT 


2760 


GACTGAGGCT 


ACAGCTGTGG 


GGAATATTGT 


TGTGCAGCTC 


ATAGCTATGG 


G AC AATT AAA 


2820 


AGGGATGGAA 


GAGGCTCACC 


ATGTTATTGA 


GGAGTTTCTA 


CAATTAGAGA 


GTTATTACTC 


2880 


CCAAAAGAAT 


TAAAAAGATT 


GAGAGTTTGT 


AAATTTGCCT 


CCCTCCCCCT 


TCTTAGCTTT 


2940 


TGTGCAGGAA 


GGGGGGATAA 


TTGGTGAATT 


CAAAAATATT 


TAGTGTTTTG 


ATATGAGGAG 


3000 


GACAAGGATG 


TCAGATGTAA 


AACAAGAATT 


AATTAAATAT 


GGTAACAACC 


TAGTAGAAAC 


3060 


AGATTTGACG 


AAAGGAACAG 


GTGGGAATCT 


CAGCGTTTTC 


GATCGTGAAA 


AACAATTGAT 


3120 


GCCAATTACC 


CCGTCGGGTA 


TTGATTTCTT 


TGAAATCAAA 


GAATCCGATA 


TTGTAGTGAT 


3180 


GGATATTAAT 


GGAAATGTTG 


TAGAGGGAGA 


ACCCTTGCCA 


TCTAGCGAAT 


GGTATATGCA 


3240 


TTTGATTCAA 


TATCAAACTC 


GTCATGATAT 


CGATGCAATT 


ATCCATGCTC 


ATACAACTTA 


3300 


TGCAACAGTA 


TTAGCTTGTC 


TCAGAGAACC 


ACTTCCAGCG 


AGTCATTATA 


TGATTGCAGT 


3360 


GGCAGGGAAA 


GATGTTCGCG 


TAGCTGAGTA 


TGCAACATAT 


GGCACGAAAG 


AATTGGCTGT 


3420 


GAATGCAGCT 


AAAGCAATGG 


AAGGTCGTAG 


ACCAGTTTTA 


CTACCGAATC 


ATGGAATTTT 


3480 


AGCAGGTGCA 


CAAAATTTAT 


TGAATGCATT 


TAATATTGTT 


GAAGAACTTG 


AATATTGTGC 


3540 


AAAAATTTAT 


TGTTTAGCTA 


AGAATTTTGG 


AGAGCCAGTA 


GTTCTTCCTG 


ATGAGGAGAT 


3600 


GGAATTCATC 


GCAGAAAAAT 


TTAAAACATA 


CGGTCAGAGA 


AAATAGGGAG 


GATATTAATG 


3660 


TTAAAACATA 


TACCGAAAAA 


TATTTCTCCA 


GATTTATTGA 


ACACTTTAAT 


GGAAATGGGA 


3720 


CATGGAGATG 


AAATAGTATT 


AGCTGACGCG 


AATTATCCTT 


CTGCCTCATG 


TGCAAATAAG 


3780 
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CTAATTCCTT 


GTGATGGTGT 


AAATATTCCA 


GAATTATTAG 


ATTCCATTCT 


CTATTTAATG 


3840 


CCATTAGATA 


GTTACCTCGA 


TACTTCAATT 


CAGTTTATGA 


ACGTTCrTTC 


GGGTGATGAT 


3900 


ATTCCTAAGA 


TATGGGGTAC 


CTATAGACAG 


ATGATTGAAG 


CTCATGGTAC 


AGATCTTAAA 


3960 


ACGATTACTT 


ATCTTAGAAG 


AGAAGACTTT 


TATGAACGTA 


GTAAGAAAGC 


TTATGCTATT 


4020 


GTTGCTACAG 


GAGAAACTTC 


ACTTTATGCT 


AATATTATCC 


TTAAGAAAGG 


AGTACTTGTT 


4080 


GAAAGAGAAA 


ATGTTCAATA 


GAGGAATTTT 


AGTTCCCAGT 


CATGGTAATT 


TTGCTAGCGG 


4140 


AGCTCTCATG 


ACCCCAGAAA 


TGTTTGTTGC 


TGAGACAACA 


AATGATAGAG 


TTAGGACATT 


4200 


AGGTTTGATG 


CCTGGAGAGA 


ATATTGTAGA 


GTTTGAGCAT 


TATTTTAAAA 


ATCAAGTGGA 


4260 


TGAACTGTTA 


GACTCAAATC 


AAGAGGTTAT 


CGTTTTGACT 


GACTTGATTG 


GAGGAAGTCC 


4320 


TAATAATGTG 


GCTTTGTCAC 


GGTTTTTAAA 


TTTGGATTCA 


GTTGATATTG 


TAACAGGCTT 


4380 


TAATATCCCT 


CTCCTACTGG 


AATTAATATC 


AAGTTATGAT 


TCAAAAATCA 


ATTTAGAAGA 


4440 


AATTGTTCAC 


AATGCTCAAA 


ATAGTTTCTT 


TAATGTTAAA 


CAACAACTTA 


ACGTAGAGGA 


4S00 


GGAAGAAGAT 


TTATGTCTAT 


AGAGTTTGTT 


CGTATTGATG 


ACCCTCTGCT 


ACATGGTCAA 


4560 


GTTCTCACTA 


CGTGGCTAAA 


AAAGTATGAT 


ATTGAGCAAC 


TTATCATTGT 


TAATGATCGC 


4620 


ATCTCAGAAG 


ATAAAACACG 


ACAATCTATT 


TTAAAGATTT 


CTGCACCGGT 


AGGTTTAAAA 


4680 


ATTGTTTTCT 


TT AG TGT AAA 


ACGCTTTCTC 


GAAGTTTTAA 


ACTCTGTCCC 


AATAAAAAAG 


4740 


AGAACAATGC 


TGATATATAC 


AAATCCAAAA 


GATCTGTATG 


ATTCTATTGA 


AGGAAATTTA 


4800 


AAATTGGAGT 


ACCTCAATGT 


AGGACAGATG 


ACTAAAACGG 


AGGAAAATCA 


AAAGGTAACG 


4860 


GGACGTGTAG 


CTCTAGGTGA 


AGAAGACAAA 


TATTATTTTA 


AGAAAATAGT 


TGATAAGGGA 


4920 


ACGAGAGTTG 


AAATTCAAAT 


CCTTCCTAAT 


GATAAAGTTA 


CAATGTTGGA 


AAAATTTTTA 


4980 


TAAAAATAAT 


TTAAGGAGGT 


ACAGTATATC 


CTATTCACAC 


AAGCATTACT 


GGTGACATTA 


S040 


CTTGGGATTA 


TTGCCACTAT 


TGACTATAAT 


GGACCGTTAT 


TTATGATTCA 


CCGTCCGTTA 


5100 


GTTACAAGTG 


CAATGGTTGC 


CTT ACT ATT A 


CCAGATTTCA 


CCCAAGCTGT 


TCTTATTGGT 


5160 


TCACCTCTTC 


AATTAACTTG 


GCTCGGTCTA 


ACAGGTATTG 


GAGGTTATAC 


TCCACCAGAT 


5220 


ACTATTTCAG 


GTGCCATTAT 


TGGTACTGCA 


TTTGGTATTT 


TATCTGGTCA 


AGGAGAAACT 


5280 


GCTGGTATCG 


CTATAGCAGT 


TCCAATTGCA 


GTTGCTACCC 


AACACTTGGA 


TGTTCTTGCA 


5340 


AAAACTTTAG 


ATGTTTATTT 


TGTGAAAAAA 


GCTGATAATG 


ATGCTAAAAA 


CGGAGATTAT 


5400 


TCAAAGATCG 


GTTTTTATCA 


TTATTCAAGT 


TTCGTTTTAA 


TCACGTTATT 


TAAAATTGTA 


5460 


CCAATTTTCC 


TAGCTATTAT 


GCTTGCAGGG 


GAATATGTGG 


CAGACTTGTT 


TGCTAAGGTT 


5520 
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CCACCAATCG 


TTATGCAGGG 


ACTTAACTCT 


GCAGGTGCTT 


TACTACCTTC 


AATTGGTTTT 


5580 


GGTATCCTTT 


TAAATATGAT 


GCTCAAGAAA 


AATATGTGGG 


TATTCTTGTT 


GATTGGATTC 


5640 


ATTTGTTCTC 


TGTATGGAGG 


AATGTCAACC 


ATTGGCATCT 


CACTACTTGG 


TATTGCGGTA 


5700 


GCATACTTCT 


ACGATATGAT 


TGGAAGCAAA 


CCACAAGAAA 


CAACTTCAAG 


TAGTGATGTT 


5760 


GAGGACGATC 


TTGATCTATG 


ATGAATAATA 


AAGTAACTAA 


AGTTGAACTT 


AAAAAAGTTT 


5820 


TCAAACGAAC 


TTTTATCTAT 


GGTTCTTCAT 


GGAACTATGA 


GAGAATGCAG 


AACCTAGGTT 


5880 


TTCTATATAC 


AATTCTTCCA 


GTATTGAAAA 


AACTATACCC 


AGACAAAGAT 


TCAGCTTCTC 


5940 


CTCCAATGAA 


ACGTCACCTT 


GAGTTTTTCA 


ATACTCATCA 


AACAGCGGCA 


CCATTTATTC 


6000 


TTGGAGTTAC 


TTCCGCTATG 


GAAGAACAAG 


AAGGAAATGA 


AGGTGCAGCT 


TCAATTACTG 


6060 


GTATTAAAGT 


TGGCTTGATG 


GGGCCACTGG 


CTGGTCTAGG 


AGATAGTTTG 


TTCTGGCTGA 


6120 


CACTAGTTCC 


TATCTCTTTT 


AGTATTGGTG 


CGTCTTATTC 


TAAAGACGGC 


GGTGCTTTAG 


6180 


GTATCTTTAT 


CGCCTTAATA 


TTGTTTAATA 


TTATTAATAT 


TCCTGTTAAA 


TATTTCGGTT 


6240 


TCAAATATGG 


GTATACTAAG 


GGTTCTAGTC 


TTATCCAAGA 


AAATAATACA 


AAAGGAACAT 


6300 


TGAATCGCGT 


TACGAGTATG 


GCGACAGCAT 


TAGGGCTAGT 


ACTAGTGCCT 


GGTTTGATTC 


6360 


CATCAATGGT 


TGGTATTAAT 


TTTGGATrAG 


AATTTAAGCA 


GGGGGAACTT 


GTTATTTCTG 


6420 


TTCAACAAAT 


GATTACAAAA 


TTAATTCCAG 


GATTTATCCC 


TATGGCTTTG 


ACTTTATTAA 


6480 


TGTGTAAATT 


AATTAGAAAA 


GGAAAGAATC 


CGCTTCTACT 


AATCTTTAGT 


GTTATGCCTA 


6540 


TTCGAGTTAT 


TCTAGTTGTT 


TTAGGAATTT 


TGAAGTAGTA 


GAAAGTGTGG 


AGGTCCTATT 


6600 


TGGGATATCA 


CCTCCATTTT 


CGAAGAGAGG 


TAAAGAGTGA 


AATTATGGTA 


TAAGAAACCT 


6660 


GCCGCAAATT 


GGAATCAAGC 


CTTGCCGATT 


GGGAACGGTC 


ATTTAGGTGG 


TATGATTTAT 


6720 


GGTTCAGCTA 


CAAAAGAATG 


TATTCAACTA 


AACGATGAGA 


CTATTTGGTA 


TAGAGGAAAG 


6780 


TCAGATAGAA 


ATAATCCAGA 


CTCACTATTG 


CATCTTAAAA 


AAATTCGGGA 


ATATCTTTTA 


6840 


CATGGAGAAA 


TTCAGAAAGC 


CGAAGAATTG 


ATAAACTTAA 


CAGTGTTTGC 


TACCCCAAGA 


6900 


GATCAAAGCC 


ACTATGAATT 


ACTTGGGGAA 


CTTTACATTC 


AGCATATAGA 


TATTCAGTCT 


6960 


TGTGCTCTTT 


CATTCTATGA 


AAGAGAGCTA 


GATTTAGATA 


CAGCTATTTC 


TAATGTTGTG 


7020 


TTTGAGCCTA 


ATAGTTGTAA 


TTTACAAATA 


AAAAGAGAAT 


ATTTTACGAC 


TTTTAATAAG 


7080 


AATATTTTAT 


GTTGCCGTAT 


AGTGTCATCA 


GTTCAAAACA 


CATTAAATTT 


AAACATTAAT 


7140 


TTGGGTAGAA 


ATAAACGCTT 


TAATGACGAA 


GTATCTAAAC 


TGGATTCAAG 


TACAATTTTA 


7200 


ATGTCGGCCT 


CTGCTGGAGG 


TAGAAAAGGT 


GTTCAGTTTA 


AAGTAGTATG 


TCATTCTAAG 


7260 


GTTACGGATG 


GTGAAGTAAG 


TGTATTGGGA 


GAGACAATAG 


TTATTCGGAA 


TGCTACAGAG 


7320 
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CTATTTCTTT ATCTCAAATC AATGACGGAT TATTGCGGAA ATATAGATAT TTCTTCTCTT 7380 
CAGGGAGAAT TTAGTACTAT TCATTACTTT ACAGAAAAAG ATGAACATGT AAAAAAATAT 7440 
CAGGAGCAAT TTAATAGAGT TGATTTTAAA CTAGACTATA GTAAA GG TTG TCTTAGCATT 7500 

CCAACGAATC TACTTCTTGA AAACACTAAA AAGTATACTA ACTACTTGAC TAA C T TG T T A 7560 

TTTCATTATG GAAGATATCT GTTAATATCG TCTAGTCAAC CGAATGGTTT ACCTGCCAAT 7620 

CTTCAAGGAA TATGGTGTGA TGAATTAAAT CCAATTTGGG GTTCTAAATA TACGATTAAT 7680 

ATTAATACTC AAATGAATTA TTGGATGGTA GGTCCATGTG ATTTACCAGA ACTAGAATAT 7740 

CCATTATTTG ATATGCTCGA AAGAATGAGA GAACCGGGAA GACTAACCGC TAAGAAAATG 7800 

TATGGAGCTA CAC G TTTTAC AGCACATCAT AATACGGATC G TTTT G GCGA TACGGCTCCC 7860 

CAATCTCATG CCATGGGGGC TGCAATTTGG GTATTAACTA TTCCATGGTT ATGTACTCAT 7920 

ATTTGGGAAC ACTATTTATA TTTCCAAGAT GAGCGTATTC TTACGGAACA TTTTGAAATG 7980 

ATAAAAGAAG CATTTCTTTT CTTTGAAGAT TATTTATTTG AGCTGCATGG CTACTTGATC 8040 

ACAGGTCCAA GTGTCTCACC GGAAAATAAA TATCGCTTAA AAAATGGTAT TGAAGGAAAT 8100 

GCTTGTCTAT CATCTACAAT TGATAATCAA ATTCTAAGAT ATTTTTGTGA TTCATGCATT 8160 

GGCATTGCAA AACAATTAGG AGACAATTCG GATTTTATTA CTCCTGTGAA GGAGTTAAAA 8220 

AAGAAACTAC CTAAAACAAA AATAGGTACT AATGGGCAAA TCCAAGAATG GTTAGAAGAT 8280 

TATGAAGAAG TAGAGCCTGG GCATAGACAC ATTTCACCTC TATTTGGGCT TTATCCTTAT 8 340 

AATGAGATTG ATATTCATAA AACTCCGGAA TTAGCAGAAG CAGCTAAAAT CACTATCAAT 8400 

AGGAGATTAT CAAACGCTAA TTTTTTATCT TCACAGGAGA GGGAGCAAGC GATTAATAAT 8460 

TCGTTAGTAA GTGGTTTGCA TGCTAGTACA CAAACAGGTT GGAGTGCTGC ATGGCTGATT 8520 

CATTTTTTTG CGAGACTATA TCAAGGTGAA CCTGCTTATA ACCAGATTAA TGGTTTGTTA 8580 

AATAATGCGA CTCTTGGCAA TTTATTTCTT GACCATCCAC CATTTCAAAT TGATCCTAAT 8640 

TTAGGTTTGG TGAGTGGAAT TTGTGAATTA TTACTACAGA GCCATCATAA TTGGTTATCA 8700 

CTAATTCCAG CTTTACCTTC TGCTTGGTCA GAAGGAGAAG TGAAAGGTTT CAGAGTAAGA 8760 

GGAGGATATA ACGTATCGTT TGCTTGGAAA AATGGGGATA TAACATTCCT AAAATTGGAA 8820 

GGAGGAAACA AAGATCAAAA AGTAAGAGTA AGAATATATG GCAAAAATAC TGATGTACAA 6880 

AATATTGAAT TGGTATTTAA TTCAGAAAAA ATTATTGAGT TAAATTTTTA GGTATAAGTC 8940 

ATGAATAAAG AAAAAATAAA AAGAAAATTA ATCACAATAT TGTTTGTATG TATTGGGATG 9000 

TTATGTTTTG GATT G TTAGC AGGAGTTAAG GCTGATAATC GTGTTCAAAT GAGAACGACG 9060 
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ATTAATAATG AATCCCCATT GTTGCTTTCT CCCTTGTATC GCAATGATAA TGGTAACGGA 9120 

TTATGGTGGG CGAACACATT GAAGGGAGCA TGGGAAGCTA TTCCTCAAGA TGTAAAGCCA 9180 

TATGCAGCGA TTGAACTTCA TCCTGCAAAA GTCTGTAAAC CAACAACTTG TATTCCACGA 9240 

GATACGAAAG AATTGAGACA ATGGTATGTC AAGATGTTGG AGGAAGCTCA AAGTCTAAAC 9300 

ATTCCAGTTT TCTTGGTTAT TATGTCGGCT GGAGAGCGTA ATKCAGTTCC TCCAGAGTGG 9360 

TTAGATGAAC AATTCCAAAA GTATAGTGTG TTAAAAGGTG TTTTAAATAT TGAGAATTAT 9420 

TGGATTTACA ATAACCAGTT AGCTCCGCAT AGTGCTAAAT ATTTGGAACT TTGTGCCAAA 9480 

TATGGAGCGC ATTTTATCTG GCATGATCAT GAAAAATGGT TCTGGCAAAC TATTATGAAT 9540 

GATCCGACAT TCTTTGAAGC GAGTCAAAAA TATCATAAAA ATTTGGTGTT GGCAACTAAA 9600 

AATACGCCAA TAAGAGATGA TGCGGGTACA GATTCTATCG TTAGTCGATT TTGGTTGAGT 9660 

GGCTTATGTG ATAACTGGGG CTCATCAACA GATACATGGA AATGGTGCGA AAAACATTAT 9720 

ACAAACACAT TTGAAACTGG AAGAGCTAGG GATATGAGAT CCTATGCATC GGAACCAGAA 9780 

TCAATGATTG CTATGGAAAT GATGAATGTA TATACTGGGG GAGGCACAGT TTATAATTTC 9840 

GAATGTGCCG CGTATACATT TATGACAAAT GATGTACCAA CTCCAGCATT TACTAAAGGT 9900 

ATTATTCCTT TCTTTAGACA TGCTATACAA AATCCAGCTC CAACTAAGGA AGAAGTTGTA 9960 

AATAGAACAA AAGCTCTATT TTGGAATGGA GAAGGTAGGA TTAGTTCATT AAACGGATTT 10020 

TATCAAGGAC TTTATTCGAA TGATGAAACA ATGCCTTTAT ATAATAATGG CAGATATCAT 10080 

ATTCTTCCTG TAATACATGA GAAAATTGAT AAGGAAAAGA TTTCATCTAT ATTCCCTAAT 10140 

GCAAAAATTT TGACTAAAAA TAGTGAGGAA TTGTCTAGTA AAGTCAACTA TTTAAACTCG 10200 

CTTTATCCAA AACTTTATGA AGGAGATGGG TATGCTCAGC GTGTAGGTAA TTCCTGGTAT 10260 

ATTTATAATA GTAATGCTAA TATCAATAAA AATCAGCAAG TAATGTTGCC TATGTATACT 10320 

AATAATACAA AGTCGTTATC GTTAGATTTG ACGCC ACATA CTTACGCTGT TGTTAAAGAA 10380 

AATCCAAATA ATTTACATAT TTTATTCAAT AATTACAGGA CAGATAAGAC AGCTATGTGG 10440 

GCATTATCAG GAAATTTTGA TGCATCAAAA AGTTGGAAGA AAGAAGAATT AGAGTTAGCG 10500 

AACTGGATAA GCAAAAATTA TTCCATCAAT CCTGTAGATA ATGACTTTAG GACAACAACA 10560 

CTTACATTAA AAGGGCATAC TGGTCATAAA CCTCAGATAA ATATAAGTGG CGATAAAAAT 10620 

CATTATACTT ATACAGAAAA TTGGGATGAG AATACCCATG TTTATACCAT TACGGTTAAT 10680 

CATAATGGAA TG GT A GAG AT GTCTATAAAT ACTGAGGGGA CAGGTCCAGT CTCTTTCCCA 10 740 

ACACCAGATA AATTTAATGA TGGTAATTTG AATATAGCAT ATGCAAAACC AACAACACAA 10 800 

AGTTCTGTAG ATTACAATGG AGACCCTAAT AGAGCTGTGG ATGGTAACAG AAATGGTAAT 10860 
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TTTAACTCTG 


GTTCGGTAAC 


ACACACTAGG 


GCAGATAATC 


CCTCTTGCTG 


GGAAGTCGAT 




TTGAAAAAAA 


TGGATAAAGT 


TGGGCTTGTT 


AAAATTTATA 


ATCGCACAGA 


TGCTGAGACT 


107OV 


CAACCTCTAT 


CTAATTTTGA 


TGTGATTCTA 


TATCACAATA 


ATAGAAACGA 


ACTTGCTAAG 


11040 


AAACATGTTA 


ATAATTTGTC 


GGCTGAATCT 


GTTAGTCTAG 


ATTTCAAAGA 


AAAAGGAGCA 


11100 


AGGTATATTA 


AAGTTAAATT 


ACTAACCAGT 


GGAGTGCCTT 


TGAGTTTAGC 


AGAAGTAGAG 


11160 


GTTTTTAGAG 


AATCAGATGG 


TAAGCAATCT 


GAAGAGGATA 


TAGATAAAAT 


AACAGAAGAT 


11220 


AAAGTAGTCT 


CTACAAATAA 


GGTAGCTACT 


CAAAGTTCAA 


CCAATTATGA 


GGGTGTAGCT 


11280 


GCTTTAGCAG 


TTGATGGTAA 


TAAAGATGGA 


GATTACGGAC 


ATCATTCGGT 


GACTCATACT 


11340 


AAGGC AG ATT 


CTAACGCTTG 


GTGGCAGGTC 


GATCTGGGAG 


AAGAGTTTAC 


GCTTTCTAAA 


11400 


GTTGATATTT 


ATAATAGAAC 


AGATGCCGAA 


CCTCAGCGTT 


TATCTAATTT 


TGATGTT ATT 


11460 


TTTCTATCTT 


CATCAGGAGA 


AGAAGTTTTT 


AGAAGACATT 


TTGATAAAGT 


AGTTGATGGT 


11520 


TTGTTATCTT 


TAAAAGTACC 


TTCTGTAGGG 


GCTAAGCTAG 


TCAAAATAGA 


ATTAAAATCA 


11580 


GCAGCTATTC 


CGTTAAGTTT 


AGCGGAAGTT 


GAACTCTATG 


GTTCAAAGAG 


AACTCCGAAG 


11640 


AAACTTTCTA 


ATATTGCATT 


AACAAAACAA 


ACTCGACAGA 


CTTCAACGGA 


TT AC AATGCT 


11700 


TTTTCTCGTC 


TAGCAGTTGA 


TGGAAATAAA 


AACGGAGATT 


ATGGTCATCA 


TTCAGTGACT 


11760 


CATACCAAAG 


AAGATTCTCC 


TTCATGGTGG 


GAGATAGATT 


TAGCACAAAC 


CGAAGAATTA 


11820 


GAAAAGTTAA 


TTATTTATAA 


TAGAACAGAT 


GCTGAAATTC 


AGAGATTATC 


AAATTTTGAT 


11880 


ATTATTATAT 


ATGATTCAAA 


TGATTATGAA 


GTTTTTACAC 


AACATATTGA 


CAGTTTAGAA 


11940 


AGCAATAATC 


TATCCATAGA 


CTTAAAAGGA 


CTGAAGGGAA 


AAAAGGTTAG 


AATTTCTTTG 


12000 


AGAAGCGCAG 


GAATTCCTTT 


AAGTTTAGCA 


GAGGTAGAGG 


TTTATACTTA 


TAACTAATTT 


12060 


TAAAAATTAT 


CACCCAGGCT 


ACCGTAAATA 


TAATCGAGAT 


GGTAGTATGA 


AAGAAACAGA 


12120 


AAAATAAGAG 


GAAAATAGTA 


TGATTCAACA 


TCCACGTATT 


CGGATTCGTC 


CCACTATTGA 


12180 


TCGTCGTCGT 


CAAGGTGTAC 


GCGAATCACT 


TGAAGTCCAA 


ACAATGAACA 


TCGCTAAAAC 


12240 


TGTGGCAGAT 


TTGATTTCAA 


GCACATTGAA 


ATATCCAGAT 


GGGGAACCTG 


TGGAATGCGT 


12300 


GATTTCTCCA 


TCTACTATTG 


CCCCTCTACC 


AGAGGCTGCA 


GCTTCCCATC 


AGTTGTTTAA 


12360 


AAAATCAAAT 


GTTTGCGCAA 


CAATTACAGT 


TACACCATGC 


TGGTGTTATG 


GTAGTGAAAC 


12420 


TATGGATATG 


TCTCCAGATA 


TTCCTCATGC 


TATTTGGGGA 


TTTAATGGGA 


CAGAACGCCC 


12480 


AGGAGCTGTC 


TATCTTGCAG 


CTGTACTAGC 


TTCACATGCT 


CAAAAAGGGA 


TTCCAGCCTT 


12540 


TGGGATTTAT 


GGAAGAGATG 


TTCAGGAAGC 


TAGTGACACA 


GATATTCCAG 


AAGATCTCAA 


12600 
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AGAAAAACTT TTACCCTATG CGCCTCCAGC TCTTGCAACT GCCTTCATCA CAGACACTGC 12660 

TTACCTATCA ATGGGTAGTG TTTCGATGGG GATTGGTGGT TCTATTGTAA ATCCGGATTT 12720 

CTTCCAAGAA TACTTAGGAA TGCGAAATGA ATCGGTAGAT ATGACGGACT TCACCCCCCG 12780 

TATGGACCCT GGTATTTACG ACCCTGAAGA GTTCGAACGT GCGCTCAAAT GGGTGAAAGA 12840 

AAACGTAAAA GAAGGATTCG ACCATAACCG TGAAGACCTT CTTTTAAGCC GTGAAGAAAA 12900 

AGATAGACAA TGGGAATTTG TTATTAAGAT GTTCATGATT GGACGTCACT TAAT G GT T GG 12960 

TAACCCAAGA CTTGCTGAAC TTCGTTTTGA GGAAGAAGCG GTTGGTCACC ATGCTTTAGT 13020 

AGCTGGTTTC CAAGGTCAAC GTCAGTGGAC AGACCATTTT CCAAATGGGG ACTTTATGGA 13080 

AACTTTCCTC AATACTCACT TTGACTGGAA TGGTATTCGA AAACCATTTG TATTTGCGAC 13140 

AGAGAATGAT TCACTAAATG GTGTGTCTAT GCTCTTTAAT TATCTATTAA CAAATACTCC 13200 

ACAAATCTTT GCTGATGTGC GTACTTATTG GAGCCCAGAG CCTGTTAAAC GTGTAACGGG 13260 

ACATACTTTA GAGGGTCGTG CTGCAGCTGG CTTCTTACAT CTAATCAACT CTG G TTCTTG 13320 

TACATTGGAT CGTACAGGTC AAGCTACTCG AGATGGCAAA CCTATTATGA AACCATTCTG 13380 

GGAGTTGGAA GAAAGTGAAG TGCAGGCTAT GCTTGAAAAT ACAGACTTCC CACCAGCAAA 13440 

CCCCGAATAC TTCCGTGGAG GAGGATTCTC AACTCCTTTC TTGACCAAGG CGCATATGCC 13 500 

AGTAACAATG CTACGTCTCA ATCTTCTAAA AGGGGTTGGT CCAGTGCTAC AAATTGCAGA 13560 

ACGTTACACA CTTGAACTTC CTGAAGATGT TCACCATACT TTAGATAATC CTACAGATCC 13620 

AGGATGGCCA ACTACTTGGT TTGCTCCACG TTTGACAGGA AAAGGTGCTT TCAAGTCTGT 13680 

CTATGACGTC ATGAATAATT GGGCAGCTAA TCACGGAGCC ATAACATATG CACACATTGG 13740 

AGCAGACTTG ATTACCTTGG CTTCTATGTT GAGAATTCCT GTCAATATGC ATAATGTACC 13800 

TGAGGAAGAT ATCTTTAGAC CTAAAAATTC GTCCTTATTT GGAACAGAAG ATCTAGAATC 13860 

AGCAGACTAT CGTGCATGTC AGTTGTTGGG GCCACTACAT AAATAAAACT TGTTTATATA 13920 

GGAGCTGAAC TTACGTCCCT CCTATCCTTT TAAAAAGATT TGTTAAACAA TTC AC AAAT A 13980 

ATTGAAAACG AATACAAAAA GTAATATAAT GATGTTAAAT AGATACCCCC GAGGCGCAGG .1040 

AGGAAAATTA TATGGCTATA TTTTATGTTC CGGCAGTCAA CCTTATTGGA AAAGGTGTTG 14100 

TAAATGAAC7 GGGTCCTTAT ATCAACGAAC TTGGCTATAA AAAGGCACTT TTGGTGACA3 14160 

ATAAGTACAT CGAAGGCAGT GATATTTTAC CTAAGACTTT AAAACCACTG GATACAGAAG 14220 

GAATCGAATA T 14231 
(2) INFORMATION FOR SEQ ID NO; 82; 
<i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH : 16995 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



AGTTCTCTTA ACTT'l 1 TTAG 


GATGCCATTC 


TCCGCTCTCA 


ft/IT Af*f£* ATT 


TTCTGCToAA 


60 


GACGTTCTAA TTCTGTCCTC 


TCTTCAGGTC 


TCGTTTTTGG 




ATTTTAGGTA 


120 


CTCTCCCTCT TGTTTTCTCA 


ACAATAGTAT 


ACCCGTTTTT 


CCTGTATTGT 




180 


TAAGAAGTAT CGTACGACTT 


GGGAGACCGT 


ATTCAAGAGA 


AALTL 1 


TT ACTCCAGC 


240 


CTTCATGTCA GACTTTATTA 


CTCATTTCTT 


GTTTTAAATC 


AGGAGAATAU 


w» * irfSATTTT 
1 flnLWH i i * * 


300 


TTCCTTTTTT GACGAACTCT 


ATTCCGTAAC 


GATCAATCAA 


TTTAATCATG 


TALL 1 A/% i #* 1 


360 


TAGAATTGCT TATCCCAAAT 


TTATTTGAAA GCTTCTCTAA 


GCTATATCCT 


TGTTTTCTAA 


420 


GTTCATAGAT CTGAACTTTA 


TCATCATAAG 


TTAGTTTCAT 


AATAAAAALA 


ff*CC AA.AAGT 


480 


TAGATTTTTT CTGTCTAACT 


TTTGGGGTGT 


AGTTCATGTA 


CACCTGATAT 


gatgcgtttt 


540 


ATAATTTTTA AGCCTTTTTG 


CCCAGCCTCG 


TCAAAAGTAA 


TCniTGALA 


<"*AAAATC*TCT 


600 


GACAAAACTT TAGTTTTAAA 


GGTTTTTAAC 


TTTGTATATA 


CTA*j"i"l i i«« 


nAAAAGGAGG 


660 


ATGATCTAAT GGAAGAAAAA 


GTATCATTGA 


AAGTCAGGGT 


TCAAAAAL i #* 


CCCACATCGC 


720 


TTTCAAATAT GGTTATGCCC 


AATATTGGAG 


CATTTATTGC 




TTGACTGCCC 


780 


TCTTTATCGC TGATGGCTAT 


CTGCCAAATG 


AACAGTTAGC 


TACTGTTGTT 


LjVj I LL Inlul 


840 


TAACGTATTT ATTGCCAATC 


CTGATTCGTT 


ACACAGGTGG 


ATATATGATC 


CATGGCCAAC 


900 


GTGGTGCCGT TGTAGGAGCT 


ATTGCTACTG 


TTGGTGCAAT 


CACAGCTTCT 


AGTGTTCCTA 


960 


TGTTTATCGG AGCTATGGTA 


ATGGGCCCAC 


TGGGAGGATC 


GACTATCAAG 


AAATTTGATG 


1020 


AGAACTTCCA GGAAAAAATT 


CGTCCCGGAT 


TTGAAATGTT 


AGTTAATAAC 


TTCTCAGCTG 


1080 


GTCTCGTTGG TTTTGCATTA 


TTGCTTTTGG 


CTTTCTACGC 


AATCGGTCCA 


GTCGTATCGA 


1140 


CTCTTACTGG AGCTGTTGGG 


AATGGTGTTG 


AGGCTATTGT 


CAATGCTCGC 


CTCCTTCCTA 


1200 


TGGCTAATAT . TATCATCGAA 


CCGGCTAAAG 


TCCTTTTCCT 


CAATAATGCC 


CTCAATCATG 


1260 


GCATTTTTAC TCCTCTGGGA 


GTAGAACAGG 


TAGCTCAAGC 


TGGTAAGTCA 


ATTCTCTTCC 


1320 


TATTGGAAGC TAATCCTGCA 


CCAGGTCTGG 


GAATTCTATT 


AGCTTATGCT 


GTATTCGGTA 


1380 


AAGGTTCTGC TAAATCTTCT 


TCTTGGGGGG 


CAATGGTTAT 


TCATTTCTTC 


GGAGGGATTC 


1440 


ATGAAATTTA CTTTCCTTAT 


GTTATGATGA 


AGCCTACTCT 


ATTTTTAGCT 


GCTATGGCAG 


1500 
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GAGGTATCTC TGGAACTTTT ACTTTTCAAC TCTTAGACCC TGGTCTTAAA TCTCCAGCTT 1560 

CACCACGTTC TATTATTGCC ATTATAGCTA CGGCGCCAAA AGGT C TTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTCTTTTA GTGGCAGCAG TTGTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCCC TCGAACCTGC TCAGGCGGCT ACCCAAGCAG 1740 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTGT TCATGCAGTT GTTTCGACAG 1800 

ACTCAGTGGA AAAAATCATT TTCGCCTGCG ATGCTGGTAT GGGAACCTCT GCTATGGGAG 1860 

CTAGTATTCT TCGAGATAAG GTTAAAAAAG CAGGTCTAGA GATTCCACTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 1980 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCGT 2040 

CCTCTCCTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT ACCAACTTCA GCACCAGTAG ATAGTCAGGA AAGTGACCTT AACCATATTC 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTCC AACTATGGGC TGTGAAACGA 2220 

TTCGGGCTAT TTTTAGAAAC AAGAATATTC GTATTCCACT TTCTACTGCC AAAATTTCAG 2280 

AATTAGGTGA ATTTAATTCT AAAAACATAA TGATTGTAAC AACTATTTCT TTACAGGCAG 2340 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATAGTTTA CTAACAACAC 24 00 

CAGAATATGA CAAAATGGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACG 2460 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGGCT TTCCTACATG TACGGAAGCT 2520 

TTCAATGCAA GATATGACTG AAATCTTACA GGTTTCATCT AGAACAATTT ATCGAACTTT 2580 

ATCAGATTTG A C AG AT AG C A TGGAGCAATA TGGAATCGAA ATAACGAAGC ATGGCAAATA 2640 

CTATATTTTG ACTGGACAGT TGGATGATTT CCCGACAGAA CTTGAAGTGT TAGTTGAGTA 2700 

TAGTCCCCAA GAAAGACAAG AGTTGATTAC CTATCGCCTT CTGACTGAGA CTGGTTTTGT 2760 

CACCAATGAA GCATTGCAAG AGTGCACGAA AGTCAGTAAT GTAACTATTA TTCAGGATAT 2820 

TTCAGATATT GATAAGCGTC TTTTAGACTT TCATCTCAAA ATTGAACGAC AAAAAGGTTA 2880 

TCGGATTTCT GGTCATTCAG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 2940 

TATCTCAGTA GCAGATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA TAAGCAACTG TCAGGTTTTC CAGATATCGA 3060 

TGCTAGGATG AAGATGTTTT TTGCGATCTT GTTATCTCTT ATAGGTCAGG AGCAAAACAT 3120 

TGAAAATTCA CCTAATACTA GTAAGCAGGC TTTGGAAATT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTT AT AG TATTCAGGAA ATTATCTATT TTGCGAGCAT 3240 

CTTGGATGAA TTAATCATTA AACGTCAGGA CAATCCCCTC TTTACGGAGA AATTTGATGG 3300 
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TGAATTTTTC 


TACAATATTT 


CAAATCTGAT 


TGATACGGTT 


TCCATGTATA 




1 1 fin 

J J O \3 


CTITHTAAG 


GACAAGGTTT 


TATTCAATTT 


TCTTTTCCAT 


CATATTCGGw 






CCTCCCTATC 


CTTfTTCAGG 


CTGAAAATTT 


GCCAGAATCT 


ATCCAGATTT 


TAGTTGAAAG 




GAATAAATTT 


CTTTATACAG 


TCATCAGTCT 


TTTAGTGAAT 


GATATTTTTC 


CGAAATATCT 


J34U 


TCATACAGAG 


TATGAGTATG 


GCATGATTGC 


CCTACATTTT 


ATCTCTAGCT 


TAGGCCGTAG 


JOUU 


TCCAGAGATT 


TATCCAGTCC 


GTGTTTTGCT 


TTTAACGGAT 


GAACGTCGGG 


TCACTAGAGA 


3660 


TTTATTAGTC 


AGTAAAATTA 


AGAGTGTTGC 


TCCTTTTCTA 


GAGTTGATAG 


ATATTCAATC 


3720 


TCTAGTAGAT 


TACCACAGTA 


TTGATCTCAC 


TCAGTATGAT 


TATATTTTAT 


CTACCAAGCC 


3780 


GCTGACTAAT 


CAGGAAATCG 


ATGTAATTTC 


TAGTTTTCCA 


ACCGTCAAAG 


AATTGCTTGA 


3840 


ATTACAGGAA 


C G ACT TC ACT 


ATGTACAGGC 


ACATCGTACA 


ATTGTCGCGC 


GTGATGCTAT 


3900 


CGCTCCAGAG 


AAAAGTTATG 


ACTTGCAAGA 


TTATTTAATA 


TCTACTACTC 


AGCTTTTGAG 


3960 


TCAATTCGAG 


TTGGTTCAAT 


TGGAGAATAA 


TCAATCATTT 


GAGCACACGC 


TAGAACAAAT 


4020 


CATCCAATAT 


CAGAAGAATG 


TGAGTGACAG 


AGCTTACCTA 


ACAAGAAAAT 


TGTTATCTCA 


4080 


CTTCCAGAAT 


AGTCCTATGG 


CTATTCCTAA 


TACTGGTCTG 


GTGCTTTTAC 


ATAGTCAGTC 


4140 


TAGCAAAGTA 


ACAACAAATA 


GTTTT ACT AT 


GTTTGAACTC 


AAACTACCTA 


TCTCCGCATT 


4200 


GTCAATGAAA 


CGAGAGGAAC 


AAGAGGTCAA 


AACGTGTCTG 


CTAATGCTAA 


TGTCTAAAGA 


4260 


AGCTAGCGAG 


GAAGCTAGAG 


ATTTAATGAC 


AGCTATTAGT 


CAGTCGATTA 


TTGAAAATCA 


4320 


TCTTTATACA 


GAGATTTACA 


AGACGGGAAA 


TCAATCCATT 


ATTTATCAGA 


TGCTAAATAC 


4380 


TATTTTTAAC 


GAAAAAATTA 


AGAAATTGGA 


GAACTAATAT 


GAAACTTGAA 


AAACATTTGA 


4440 


TTAAGCTTAA 


TAAACAATTT 


TCTAACAAGG 


AGGAAGCTAT 


TTGTTATTGT 


GGGCAAOl it 


a con 


TTTATGAGGC 


TCGATATGTT 


AATGAAGACT 


ATATTGAAGC 


CATGATTGAG 


CGAGATAAAG 


4 SOU 


AGCTATCTGT 


TTACATGGGT 


AACTTTATCG 


CCATACCGCA 


TGGAACACAT 


GCAGCAAAAA 


4620 


ATGATGTCCT 


CAAGTCTCGT 


ATTACAGTCG 


TTCAAGTCCC 


TAGAGGGGTT 


GATTTTGGGA 


4680 


ATGTATCTAA 


CCCTCAAGTG 


GCAACGGTTC 


TTTTTGGTAT 


TGCTGGTATT 


GGTAATGAAC 


4740 


ACTTAGAAAT 


TATTCAGAAA 


ATTTCTATCT 


TCTGTGCAGA 


TGTAGATAAT 


CTTCTTAAAC 


4800 


TAGCAGATGC 


TCAGTCAAAA 


GAGGAAGTAT 


TGCGCTTATT 


TGATGCTGTT 


GAATAATTGA 


4860 


ATTTAGTCAT 


TTGTCATCTA 


GTATATATGT 


CCCTCAAATA 


GGAAAAGGAG 


AAATTGAATG 


4920 


AAACATTCTG 


TTCATTTTGG 


TGCCGCTAAT 


ATCGGTCGTG 


GTTTT AT AGG 


TGAAATTCTA 


4980 


TTTAAAAATG 


GTTTCCATAT 


TGATTTTGTG 


GATGTCAATA 


ATCAGATAAT 


TCATGCTCTG 


5040 
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AATGAAAAGG 


GCAAGTATGA AATTGAAATT GCACAGAAAG 


GACAGTCTCG 


TATAGAAGTA 


5100 


ACTAATGTGG 


CTGGCATTAA TAGCAAAGAA CATCCTGAGC 


AAGTCATTGA 


AGCGATTCAA 


5160 


AAGACGGATA 


TTATTACTAC 


TGCAATCGGA 


CCTAATATAC 


TCCCTTTTAT 


CGCCGAACTT 


5220 


CTAGCCAAAG 


GAATCGAAGC 


TCGCCGAGTT 


GCAGGAAATA 


CACAGGCATT 


GGATGTTATG 


5280 


GCCTGTGAAA 


ATATGATTGG 


CGGGTCTCAA 


TTTCTTTATC 


AAGAACTCAA GAAATATTTA 


5340 


AGTCCGGAAG 


GTTTGACATT 


TGCTGATAAC 


TACATAGGTT 


TTCCAAATGC 


TGCACTAGAC 


5400 


AGGATTGTTC 


CAGCACAAAG 


TCACGAAGAT 


TCCCTTTTTG 


TTGTGGTCGA 


GCCCTTTAAT 


5460 


GAATGGGTCG 


TGGAAACCAA 


GCGTCTTAAA 


AATCCAGATT 


TACCTCTAAA 


AGATGTGCAT 


5520 


TATGAAGAAG 


ATTTAGAACC 


CTTTATTGAG 


CGAAAACTTT 


TTTCAGTCAA 


TTCTGGACAT 


5560 


GCAACTTCAG 


CTTACATTGG 


TGCGCATTAT 


GGTGCCAAGA 


CAATTTTGGA 


AGCTCTTCAA 


5640 


AATCCTAATA 


TTAAATCTCG 


GATTGAATCT 


GTATTAGCTG 


AAATTCGGAG 


TCTCTTGATT 


5700 


CCCAAATGGA 


ACTTTGATAA AAAAGAATTG GAGAATTATC 


ACAAAGTCAT 


TATAGAACGA 


5760 


CTTGAAAACC 


CTTTCATAGT 


GGACGAGGTT 


AGTCGCGTAC 


CTCGTACTCC 


AATCCGAAAA 


5820 


TTAGGCTATA 


ATGAACGATT 


CATCCGGCCG 


ATACGTGAAT 


TGAAAGAACT 


CAGTTTGTCA 


5880 


TATAAAAACC 


TACTTAAAAC 


AGTTGGCTAT 


GTCTTTGACT 


ATCGCGATGT 


AAATGATGAA 


5940 


GAAAGTATTC 


GATTAGGTGA 


ATTGTTGGCT 


AAACAATCAG 


TCAAAGATGT 


TCTTATACAA 


6000 


GTTACAGGTT 


TAGACGACCA 


AGAATTGATT 


GAGCAAATTG 


TAGAGTATAT 


TTAATCTTTT 


6060 


TCGAAAATCT 


CTTCAAATCA 


CGTTACCATC 


GCTTTGTCTT 


AGGCATATCT 


TGTTCTATCT 


6120 


ACAACCTCAA 


AGCAGTGCTT 


TGAGCTGACT 


CCGTCAGTCT 


TATCTGCAAT 


CTCAAAACAC 


6180 


TGTTTGAGTT 


ATCTGCGGTA 


ATCTTTCTAG 


CTTGTCTTTG 


ATTTTTGTTC 


TTATTTATAA 


6240 


GGTAAAAGAA 


GCTGGACAAA 


AAGTCTTCAA 


AATCGGGAAA 


AGGCAGCCTA 


TCGGGTGTTC 


6300 


AAAAATCTTG 


ATAGGATGTC 


CTTTATTATG 


GAAAGCCTTA 


TTGGATTTTC 


TCCTCAGATT 


6360 


GAGTTTTTGA 


TCAGCTTTAT GAGATAGGTC TTGCTAGAGA TGTAGCCCAT 


CATGTTAT1T 


6420 


TTATGGACAG 


TGGGAAAATT 


CTTCAAAAAA 


ATAATGCCCA 


TCAATTCTTT 


AGTCGTCCAA 


6480 


GAGAACAACG 


AACCAAGCAA 


TTTTGGAACG 


AATTCTTTCG 


AATGCGATCT 


ATATAGTAAA 


6540 


ATGAAACAAG 


AACAGGACAA 


ATCGATCAGG 


ACAGTCAAAT 


CGATTTCTAA 


AAATGTTTTA 


6600 


GAACTAGAGG 


TGTACTATTC 


TAGTTTCAAT 


CTACTATATA 


ACTGAAAAAT 


TAGATAAATT 


6660 


AGTTTTGGAA 


AATGACTAAC 


CAAAAGATAT 


CCAAAGTAGT 


CTAAAATTGT 


CTATACTTTA 


6720 


TGAGTGTTTT 


AGTTAGGAAA 


AAGGCTTGTT 


GTCTATAATT 


GTCTGCATTA 


GTCTAGATTT 


6780 


TATTTATAGA 


AAATGTTATA 


ATAGACTGTA 


TTTAAAAAAT 


TTTAAGGAGA 


AATGACAGAA 


6840 
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TGTCTGTATC 


ATTTGAAAAC 


AAAGAAACAA 


ACCGTGGTGT 


CTTGACTTTC 


ACTATCTCTC 


6900 


AAGACCAAAT 


CAAACCAGAA 


TTGGACCGTG 


TCTTCAAGTC 


AGTGAAGAAA 


TCTCTTAATG 


6960 


TTCCAGGTTT 


CCGTAAAGGT 


CACCTTCCAC 


GCCCTATCTT 


CGACCAAAAA 


TTTGGTGAAG 


7020 


AAGCTCTTTA 


TCAAGATGCA 


ATGAACGCAC 


TTTTGCCAAA 


CGCTTATGAA 


GCAGCTGTAA 


7080 


AAGAAGCTGG 


TCTTGAAGTG 


GTTGCCCAAC 


CAAAAATTCA 


CGTAACTTCA 


ATGGAAAAAG 


7140 


CTCAAGACTC 


GGTTATCACT 


GCTGAAGTCG 


TTACAAAACC 


TGAAGTAAAA 


TTGGGTGACT 


7200 


ACAAAAACCT 


TGAAGTATCA 


GTTGATGTAG 


AAAAAGAAGT 


AACTGACGCT 


CATCTCGAAG 


7260 


AGCGTATCGA 


ACGCGAACGC 


AACAACCTGG 


CTGAATTGGT 


TATCAAGGAA 


GCTGCTGCTG 


7320 


AAAACCGCGA 


CACTGTTCTG 


ATCGACTTCG 


TTGGTTCTAT 


CGACGGTGTT 


GAATTTGACG 


7380 


GTGGAAAAGG 


TGAAAACTTC 


TCACTTGGAC TTGGTTCAGG 


TCAATTCATC 


CCTGGTTTCC 


7440 


AAGACCAATT 


GGTAGGTCAC 


TCAGCTGGCG 


AAACCGTTGA 


TGTTATCCTA 


ACATTCCCAG 


7500 


AAGACTACCA 


AGCAGAAGAC 


CTTGCAGGTA 


AAGAAGCTAA 


ATTCGTG AC A 


ACTATCCACG 


7560 


AAGTAAAAGC 


TAAAGAAGTT 


CCGGCTCTTG 


ACGATGAACT 


TGCAAAAGAC 


ATTGATGAAG 


7620 


AAGTTGAAAC 


ACTTG CTGAC 


TTGAAAGAAA 


AATACAGCAA 


AGAATTGGCT 


GCTGCTAAAG 


7680 


AAGAAGCTTA 


CAAAGATGCA 


GTTGAAGGTG 


CAGCAATTGA 


TACAGCTGTA 


GAAAATGCTG 


7740 


AAATCGTAGA 


ACTTCCAGAA 


GAAATGATCC 


ATGAAGAAGT 


TCACCGTTCA 


CTAAATGAAT 


7800 


TCCTTGGGAA 


TTTGCAACGT 


CAAGGGATCA 


ACCCTGACAT 


CTACTTCCAA 


ATCACTGGAA 


7860 


CTACTCAAGA 


AGACCTTCAC 


AACCAATACC 


AAGCAGAAGC 


TGAGTCACGT 


ACTAAGACTA 


7920 


ACCTTGTTAT 


CGAAGCACTT 


CCCAAAGCTG 


AAGGATTTGA 


TGCTTCAGAA 


GAAGAAATCC 


7980 


AAAAAGAAGT 


TGAGCAATTG 


GCAGCAGACT 


ACAACATGGA 


AGTTGCACAA 


GTTCAAAACT 


8040 


TGCTTTCAGC 


TGACATGTTG 


AAACATCATA 


TCACTATCAA 


AAAAGCTGTT 


GAATTGATCA 


8100 


CAACCACAGC 


AACAGTAAAA 


TAATCTTAAT 


AAACAGAAAA 


CCCACCTGAA 


TTGGTGGGTT 


8160 


TTCTGATGCA 


CTATTTTCCA 


AAAATCTCTT 


TGAGGTCTGT 


GTCTGTAATC 


CCAATCATGG 


8220 


CTGGGATGCG 


GTCCCAGTTT 


TCTTCGGTTA 


GGATGTAGGA 


TTGTTCAGAG 


GCACTTGATG 


8280 


TGACTGTTTC 


AGAGACAGCT 




CTTCAACATT 


CTCCAGTAGA 


TCACTGAAGC 


8340 


CTTCAATCAG 


ATAGGTTTTT 


CGGGCAGTTC 


CGATGTGTTG 


GGTAGCATAG 


TCGAAGGCTT 


8400 


GTAATTCGCC 


TAGTAAGATG 


AGTTTGCTTT 


TGGCACGTGT 


AATGGCTGTG 


TAGATGAGAT 


8460 


TTCGCTCCAG 


CATACCTCGG 


CTAGCACTAG 


TAATCGGTAG 


GATGACAACT 


GGGAACTCAC 


8520 


TTCCCTGAGA 


CTTATGAATA 


CTCATGGCAT 


AGGCCAAGCG 


AATCTTGTAC 


CATTCGTTAC 


8580 
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GGGGGTAAGA GACTTCATTA CCATCAAAAT CAATCACAAT CTCGTCTTGT TTCGATTCGC 
TCTATTTACC AGGAATCAGG TCTGTGATAG CTCCTAAATC CCCATTAAAG ACATTGATTT 
CAGCATCGTT AACCAAATGA ATGACCCTGT CTCTCTTACG ATACTGACAC TGAGGAGCTT 
CAAAACTGAG TTGATCTTTT TGTGGGGGAT TGAGCAGGTC TTCCATGAGC TGATTGATAG 
CATCAATCCC TGCCGTCCCT CGGTACATAC GAGCCAGAAC TTGGATATCA CGCGCGGGAA 
TACCATTTCT CAGGGCGGCA CCTAAGATTT TTTCAATGGT GGCAGGAATA TGGCCACTAG 
CAATTTCAAA GTAGGAACGG TCAGCTTTTT TTTGGCTGAA ATCAGCTGGC AAGATGCCCT 
GTCGAATCTG ACTAGCTAGG GTGACGATGG TTGATTCTTT GCTTTGTCGA TAAATTTTTT 
CCAAGCCAGT CTGAGGAATC AAAGGAATAT GAAGTAGATC CGCTAGAACC TGTCCAGGAC 
TGACAGAAGG TAGCTGATCA CTGTCACCTA CGATGAGGAT CTTACTGTTA GAAGACATAT 
TGGAGAAGAC TTGATTGGCC AGCC AAGT AT CTACCATAGA GAATTCATCC ACGATGATAA 
AGTCAGCATC TAGGTAATCT TCCAGATGAC TCGTATCATC GTCACCTCTC ATTCCCAAGT 
GGCGATCTAT GGTCCCCCTA GGCAAACCTG TCAATTCATT CATGCGACGA GCAGCTCGAC 
CAGTTGGAGC AGCAAGAAGA ATGCGCAGAT TGCTTTTCTT CCTGAACTCA AGTCCTTCTA 
AAAGGGCATA AACAGCAATG ATTCCATTGA TAACAGTTGT CTTACCAGTA CCAGGCCCAC 
CTGTCAGGAT AAAGACCTTA TTCTGGATAC CATCACAGAT AGCCTGTTTT TGAATGTTAT 
CATACTCAAT TCCCAGTTCT TGCTCGACAG TAGTGATATG TTTTTCAATG GTTTCTAAAT 
CATGACTCTT CTGTTTTCCT TTTTCAAGGA TACGAACCAA GTGACTGCGG ATCCCTTCCT 
CAGCGAAAAA GAGGCTGTTG TCAAAGATCT TGCTATCAAT CTGCTGAACC TTGTCTTCTT 
CGATCAGGTA GGAGAGCTCT TGCGCAACTT GGCTGGGGTC TAGTTCCACC GGACGGGAAG 
ACTCAAGGAG AGTAAGGGTT TGTTCCAGCA AATCCCGTGC TTCAACATAG GTGTCCCCTG 
TTTCCATACA GGCCTGAAAA AGACTGTGAA CTAGACCGGC GCCGAAGCCT TCAGGAGCCT 
GACTTTCGAT GCCTAGTTCC TCACCTAGTT GCTCAGCAAT GCTAAAGCCC AAACCCTTGA 
TATCCTCAAC CAGCTGGTAG GGATAATTTT CAACCACATC AAGGGTTTCT TCCTTGTAAA 
AGTCTTGAAT CTGAAAGGCT AGTTTGTTGG GAATGCCGTA GTTGGCTAGT TTGGCCAAAA 
TCATCTCCGT TCCGTAGTTG AGACGGAGAG TGGAGACGAA AGCCTCGCGA TTTTTGGCAG 
AGAGTCCTGC GATGCCTTCT AACTTTTCTG GGTGTTGCAA AATTTCGTCA ATGGTATTTT 
CGCCATAGGT ATCCACGATT TTCTGAGCTG TCTTGAGACC AATCCCCTTG AAATGGCTAC 
TTGAAAAGTA CTTGACCAAG CCCTTACTAC TTGGTTTTGC GCGATCATAA CGACTGATTT 
GCAGTTGTTC TCCATACTTG GAGTGCTGGA CAATTTGCCC CCAAAAAGTA TAGTCTTCGC 
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CCTCAATTAC 


ATCAGCCATG 


O 1 I^-U 1U4 


4 Vr/\ 4 4 4 V_ 


AAAATCATCA 

rWV> * ^« * %»*ft 


AAATCCTCTG 


10440 


CCTCCGTATC 


GTCGATTTCT 


Auv»Av»CjAv»OA 




ATTGCTGGGA 


TTTTCAAAAA 


10500 


TAATCCGTTC 


AATA<STTCCT 


GAAAAATAAA 




f\ A 4 * * * ft»W 


ATGAATAGCT 


10560 


CAGACTTGGG 


ATFUTTTTTA 


TTTTATACTC 








10620 


CCATCTGCAA 


CCTCAAAACA 


GTAl I i l\*AG 




rAf.TTCTATC 


CACAACCTCA 


10680 


AAACACTCTT 


TTAAGCAGCC 


TACGGCTAGC 


TTCCTAGTTT 


W I I «~ I 4 lUA * 


TTTrATT(5AG 
i 1 i 1 1 


10740 


TATTTGTAAA 


TAAACAATCA 


CTTCTCACGA 


TAGAAGAM»ft 


OVH. luAun 1 1 




10800 


TGCCTCTTAG 


GTTTCTTAAA 


ATGTTCCGAT 


ft ju*M|*|W* ft 1 

ACGGGTCATT 






10860 


TTCCCCTGTG 


ATATCTTTTG 


CTTTGAAGGT 


ACCTACGTGG 




•Pr^AAA/TAA 
ILuAAAUtAA 


10920 


CCGCTCATCT 


CCGAGGAGAA 


GGTATTCTCC 


fL ft A ft 

TTCTGGAACA 


/■!«■! ft ft &^^*^Bl A 

CTAAACCTAA 


AGTTGGTGTfT 


10980 


CTACTTGACA 


TCAACTGTGA 


ACGCTTGAGC 


TTTTTGAGCG 


ft IN ft d Mfm«W— H ft ft 

ATACTTCTAA 


ftr*» ft ft ft /"^*T*TVV* 

AlaAAALfl iTW^ 




TTTATTTCCT 


TCAAAGCCCP 


TGCCTCAGTA 


AGTGCTTTGG 


AGTTTGTCAT 


MIWIl^ ft ft ^t^fm 

CCTTviAAGCvi 


til OA 


TTTGATATAG 


TCTCCTAGAT 


AAGGCTCGTC 


CGTTTCTTTG 


TCATTGATGT 


AGAGTTTA*PC 


111 Urt 


ATTTTCGTAA 


CGAATGGTGT 


CGCCAGGCAT 


TCCAATCACG 


CGCTTGAtTuA 


I \j a L-l_ i 1A1 I 


1 1220 


CCCATCTTCC 


TCATGGGCCA 


CCACGATATC 


AAAACGGTCA 


ATAGGAAGOl 


CTTTTACAAC 




GAAGAGAATT 


TCGCCATCCG 


CTAGGGTCGG 


ATCCATGGAA 


TGTCCTTCTA 


WuUiAALA H 




GCTCCAAAAA 


AAGATACGAC 


TTAAAGCTAG 


TAATGACAGA 


Al IAGGAGmA 


AUU\ A WWUVi 


11400 


CTCTTTTAAG 


AAATTTTTAA 


ATGAATTCAT 


AACTTACCTT 


mr^vt ft ft v"*P^* 

TCTAAGCwl i 


TTTTCGCTTT 


1 1 460 


TTCAGTGTTT 


TTAAAGTGCA 


ATTTGGCGCA 


GAAGCTGAGT 




rATACCCTTC 


11520 


CAAAATCTGG 


CTAGCCACCT 


TGTCAGAAGC 


WiTTLCAW.T 




CCTGATAACC 


11580 


CAGTTCTCGT 


CCCAAATTTT 


CAAGATTTTC 


CAGAAAv>AvjA 




TGACACAAGA 


11640 


AACTGCGACA 


CACAAGTATT 


TGCCCTCAGC 


CTTTTCTTCT 


AAGCTGATAC 


CATTGCTGAA 


11700 


ACGATTGGCC 


TCTTGTGCCA 


AGTACTTCTC 


ATAATTTTTA 


GCACTGGTAA 


AGGCATCAAT 


11760 


CACAATTTTC 


TCAGGCTGAA 


CACCTTTTTG 


AAGGAGGAGA 


TAGATAGCCT 


GATTATGGAG 


11820 


GGCAACCTTA 


ACCGAAACAG 


CGTTGTAGCG 


CTCTCCGATG 


ACCTCCTTGT 


ACTTGCTGGG 


11880 


TGAGAGAAGG 


AGTGCCTGCT 


GCTCAATTTT 


TTCCTTGAGA 


ATAGGAGTAA 


TCTGACGGAT 


11940 


CTTTTGGTCG 


GTCAGAGTCT 


TAGAATCCCC 


CACACCGAGT 


TTTCGTAAAA 


AGTCGTGCTG 


12000 


GTCAGGTCTG 


ACAAAGGCAG 


CCACAACTGC 


AAGCCCACCA 


AAGTAGGAAC 


CATTTCCCAC 


12060 


CTCATCTCTC 


CCAATTAAAG 


GAAGATTTTG 


TCCGCTGGTT 


TGCTCTACAG 


CTTGATAGCC 


12120 



pwcnnnn- ^wn 
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AAACAAACTG 


GCCTATTTTT 


CACCCCCTTC 


ACCCTGAAGC 


AAGATTTTTC 


CAGAAGTATA 


12180 


GATAGAAACC 


GTTGCTTGAG 


GTAGTTTCAA 


AAAGTAGCGG 


ATATAGGGAT 


TCTTCCTAGG 


12240 


AGCCAGACTG 


GTTTGATAGT 


CTTCAAGAAA 


AGCCTGAATA 


TCCTTTTCGC 


TTGGTGTGAG 


12300 


TGTGATACTT 


GCCATAGTTT 


CTATTGTACC 


ACAAAAGCAG 


TAAAATTTGT 


AAAAACTGAC 


12360 


AAAATTACCG 


AATTTTGGTA 


TAATATCQTC 


AGGTGAATTT 


TATGGCAAAT 


CTAAATCGAT 


12420 


TCAAATTTAC 


ATTCGGGAAA 


AAATCGTTAA 


CCTTGACAAG 


CGAACATGAC 


AACCTTTTTA 


12480 


TGGAGGAAAT 


CGCTAAGGTT 


GCGACAGAAA 


AATACCAAGC 


AATTAAAGAA 


CAAATGCCTA 


12S40 


GCGCAGATGA 


TGAAACAATC 


GCTCTTTTGT 


TGGCAGTCAA 


CTGTTTATCA 


ACTCAGCTCA 


12600 


GCCGTGAGAT 


TGAATTTGAC 


GATAAGGAGC 


AAGAGCTAGA 


AGAACTCCGT 


CACAAGCTTG 


12660 


TGACTTGTAA 


GCAAGAACAG 


AGCAAGATTG 


AGGATTCCTT 


ATGATTTCAT 


TCCTTCTTCT 


12720 


ATTGGTCTTG 


GTTTGGGGAT 


TTTATATCGG 


CTATCGGAGA 


GGCCTGCTCT 


TACAGGTTTA 


12780 


TTACCTGATT 


TCAGCCATGG 


CATCGGCTTT 


TATGGCTGGC 


CAGTTTTATA 


AGGGGCTTGG 


12840 


AGAGCAATTC 


CATTTATTGC 


TCCCTTATGC 


AAATTCGCAG 


GAAGGTCAGG 


GGACTTTCTT 


12900 


TTTCCCATCG 


GATCAACTCT 


TTCAGCTGGA 


TAAGGTCTTT 


TATGCAGGTA 


TCGGCTACTT 


12960 


GCTTGTATTT 


GGGATTGTCT 


ATAGCATTGG 


TCGTTTACTT 


GGTCTTCTCT 


TACACTTCAT 


13020 


TCCTAGCAAA 


AAACTGGGTG 


GTAAGTTGTT 


CCAAGTTTCA 


GCAGGTATCT 


TGTCCATGTT 


13080 


GGTGACCTTA 


TTTCTCTTCC 


AAATGGCCTT 


GACAATCTTG 


GCGACCATCC 


CCATGGCAGT 


13140 


TATACAAAAT 


CCTCTTGAAA AGAGTATCGT CGCAAAACAC 


ATCATCCAGA 


GCATACCGGT 


13200 


AACAACCAGT 


TGGCTCAAAC 


AAATCTGGGT 


GACAAATTTA 


ATCGGATAAA 


AAGGGCAGGA 


13260 


GTTTTCCTAG 


CCCTTTGTTT 


ACAGATTTGA 


CTCGAATCTA 


TCAGAATCTA 


AAAAGCTACC 


13320 


ACACCTAGAC 


ATTCAAAGAC 


AAGGAAATAA 


AGATGAATAA 


GAAAATATTA 


GAAACATTAG 


13380 


ACTTCCATAA 


GGTCAAGGCC 


TTGTTTGAGC 


CTCATTTGTT 


GACCGAGCAG 


GGCTTGCAGC 


13440 


AATTGAGACA 


ACTGGCTCCG 


ACTGCCAAAG 


CAGATAAAAT 


CAAACAGGCT 


TTTGCTGAGA 


13500 


TGAAGGAAAT 


GCAGGCTCTT 


TTCGTCGAGC 


AACCGCATTT 


TACTATTCTC 


TCAACTAAGG 


13560 


AAATTGCAGG 


AGTCTGCAAG 


AGGTTGGAGA 


TGGGAGCGGA 


TCTCAATATC 


GAGGAGTTCC 


13620 


TACTCTTGAA 


ACGCGTGCTi* 


CTTGCCAGCC 


GAGAACTTCA 


AAATTTTTAC 


ACCAATCTGG 


13680 


AAAATGTCAG 


CTTGGAAGAA 


TTAGCCCTTT 


GGTTTCAGAA 


ATT ACAT GAT 


TTTCCGCAAT 


13740 


TACAAGGAAA 


TCTTCAGGCC 


TTTAATGATG 


CGGGTTTCAT 


TGAAAATTTT 


GCCAGTGAAG 


13800 


AATTGGCGCG 


AATCCGTCGA 


AAAATACATG 


ATAGCGAGAG 


TCAGGTACGC 


GATGTTTTAC 


13860 


AAGACTTGCT 


CAAGCAAAAA 


CCGCAGCTGT 


TGACGGAAGG 


AATTGTTGCT 


AGCAGAAATG 


13920 
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CCCGTCAGGT 


TTTACCAGTC 


AAAAACACCT 


ACCGCAATAA 


GATTGCAGGT 


GTCGTTCATG 


1 noQA 
IJ'OU 


ATATTTCTGC 


TAGTGGAAAC 


ACCGTCTATA 


TCGAACCCCG 


TGAGGTAGTC 


AAACTGAGCG 


14040 


AACAAATTGC 


TAGTCTGCCA 


GCAGATGAGC 


GCTATGAAAT 


CCTTCGCATT 


CTCCAAGAAA 


14X00 


TTTCTCAGCG 


TGTCCGCCCT 


CATGCGGCTG 


AGATTGCTAA 


TGACGCTTGG 


ATTATCGGTC 


14160 


ATCTGGACTT 


GATTCGTGCC 


AAGGTTCGAT 


TTATCCAAGA 


AAGACAAGCA 


GTCGTGCCTC 


14220 


AGCTGTCAGA 


AAATCAAGAG 


ATTCAACTGC 


TCCATGTCTG 


CCATCCTTTG 


GTCAAAAATG 


14280 


CCGTCGCAAA 


TGATGTCTAT 


TTTGCTCAAG 


ATTTAACAGC 


TATTGTCATT 


ACAGGTCCCA 


14340 


ATACAGGTGC 


GAAGACCATC 


ATGCTCAAAA 


CTCTGGGCTT 


GACACAGCTC 


ATGGCCCAGT 


14400 


CAGGATTGCC 


GATTTTAGCA 


GACAAGGGAA 


GTCGTGTTGG 


TATTTTTGAA 


GAAATCTTTG 


14460 


CTGATATTGG 


AGATGAGCAG 


TCTATTGAGC 


AGAGCTTGTC 


TACCTTCTCT 


AGTCATATGA 


14S20 


CCAATATCGT 


GGATATTCTT 


GGCAAGGTCA 


ACCAACATTC 


ACTCTTACTT 


TTGGATGAGT 


14580 


TGGGGGCTCG 


TACTGATCCC 


CAAGAGGGAG 


CAGCCCTTGC 


CATGGCTATT 


CTGGAGGACC 


14640 


TTCGCCTGCG 


TCAAATCAAG 


ACCATGGCGA 


CGACCCACTA 


TCCAGAACTC 


AAGGCCTACG 


14700 


GTATTGAGAC 


AGCCTTTGTG 


CAAAATGCCA 


CTATGGAGTT 


TG AT ACTGC A 


ACTCTTCGCC 


14760 


CGACCTATCG 


CTTTATGCAG 


GGTGTTCCTG 


GCCGAACTAA 


TGCCTTTGAA 


ATTGCCAAAC 


14820 


GTCTAGGCCT 


ATCTCAAGTT 


ATCGTAGGAG 


ATGCCACTCA 


GCAGATCGAT 


CAGGACAATG 


14880 


ACGTCAATCG 


TATCATTGAG 


CAATTAGAAG 


AGCAGACGCT 


GGAAAGCCGC 


AAACGTTTGG 


14940 


ACAATATCCG 


TGAGGTGGAG 


CAAGAAAATC 


TCAAGATGAA 


CCCTGCGCTA 


AAAAAACTCT 


15000 


ACAACGAGCT 


TAATCGTGAA 


AAGGAAACCG 


ACCTTAACAA 


GGCGCGTGAA 


CAGGCTGCTG 


15060 


AGATTGTGGA 


TATGGCCCTA 


ACTGAAAGTG 


ACCACATTCT 


CAAAAATCTC 


CACAGTAAAT 


15120 


CCCAACTCAA 


GCCCCACGAA 


ATCATTCAAG 


CCAAGGCCAA 


GTTGAAAAAA 


TTGGCTCCTG 


15180 


AAAAAGTGGA 


CTTGTCTAAA 


AATAAGGTCC 


TTCAAAAGGC 


CAAGAAAAAA 


CGAGCTCCAA 


15240 


AGGTGGCAGA 


TGATATCGTG 


GTTCTCAGTT 


ATGGTCAGCG 


TGGTACCTPG 


ACCAGTCAAC 


15300 


TCAAGGACGG 


TCGCTGGGAA 


CCCCAACTTG 


GCTTGATTAA 


GATGACCTTG 


GAAGAGAAAG 


15360 


AGTTTGATCT 


TGTTCAAGCC 


CAGCAAGAAA 


AACCAGTCAA 


GAAGAAACAG 


GTCAATGTTG 


15420 


TGAAACGAAC 


TTCTGGGCGA 


GGACCTCAAG 


CTAGACTGGA 


TCTTCGAGGC 


AAGCGCTATG 


15480 


AAGAAGCCAT 


GAATGAGCTA 


GATACCTTCA 


TCGACCAAGC 


CTTGCTTAAC 


AATATCGCTC 


15540 


AACTTGATAT 


CATCCATGGT 


ATCGGAACAG 


GAGTCATCCG 


TGAACGAGTT 


ACCAAATACT 


15600 


TGCAAAGAAA 


CAAACATGTC 


AAGAGTTTCG 


GCTATGCCCC 


ACAAAATGCT 


GGAGGCAGTG 


15660 
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GTCCGACTAT TGTCACTTTT AAAGGATAGC AGTATTCTGG 
TTGAACTAAT TTTTACTAAT AAACACATTG ACAAAAGCCA 
ATCAATTAAA TACCAACACC GAATGAAGTT TAATAGAACT 
ATGACTGTAA ATGGACGGAA CTCTGGAGAG ACCGTAAAGG 
AACTGCTCAA ACTCTCAGGT AAAAGGACAG AGCTAGGATA 
CTAAGCATTC CAGAGTACAT GTATCTTGCA TGTGCTCTTT 
GGAGAAGGAA ATGTTAGAAT TGCTTAAATC AATCGATGCT 
CTTGATTTTA TTGGTCGGAA CAGGCATTTA CCTAACTATT 
TTTGCGTCTA CCCAAGGCCT TTCAGCTTAT TTTTATCCAG 
ATCCAGTTTT GCAGCTCTGT GTACAGCCTT GGCATCAACT 
AGGAGTTGCG ACGGCTATCA AGGTTGCTGG ACCAGCAGCT 
GGCTTTCTTT GGAATGGCTA CCAAGTATGC GGAAGCACTC 
CAAGGACGAC CATGGTGCAG TAGCGGGAGG TCCCATGCAT 
AGAAAAGTGG CGACCACTTG CTGTTTTGTT TGCAGTAGCA 
GGGAATCCGA ACCTTCACCC AACTCAACTC GATTGCAGAA 
GATTTCGCCA GCCATCACAG CTCTCGTCTT GTCTGTCTTT 
TGGACTCAAG TCTATTTCTA AGGTTTCAAC TACTGTTGTT 
TATCTTAGGA ACTCTTACAC TTATTTTCTT TAATATCGGA 
TTTAGTCTTT ACCTCAGCTT TTAGTCCCCT TCCTGCGGTA 
CGTTCGGATG GCTATTCAAA ATGGTGTGGC GCGTGGTGTG 
GGGTTCTGCT CCTATTGCAG CTGCAGCTGC CAAGACAAAT 
GATTTCCATG ACAGGAACCT TTATTGATAC CCTCATCATT 
CATCTTGGTA ACTCC 

<2) INFORMATION FOR SEQ ID NO: 83: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY : linear 




,(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CCGGGGCTTT TGTACTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 



60 



WO 98/18931 



PCTAJS97A9588 



661 



ACTAAGCCCT 


TTAATAAACA 


(iTTAuA 1 /wv* 


ATTCAACTTT 


CGTTGATTCG 


TCAGTTTGAC 


X20 


CAGGCTATTT 


CGGAGATT^-C 




CGTTTGACCT 


TGGGGGAACC 


TGATTTTACA 


180 


ACGCCAGACC 


ATGTCAAGGA 






ATCAGAACCA 


ATCCTACTAT 


240 


ACAGGGATCA 


GTGGTCTGCT 


GACTCTACGT 






TAAGGAAAAG 


300 


TACCAACTGG 


ACTATGCTCC 


TCAAAATUAA 


nlLi luw A A 


CAATTGGGGC 


GACAGAGGCT 


360 


TTATCTGCCA 


CTTTGACGGC 


TATTTTGGAA 




AG(?PACTTTT 


GCCAGCTCCT 


420 


GCTTATCCAG 


GCTATGAACC 


GATTGTTAAC 


TTA«i 1 


CAGAAATTGT 


TGAGATTGAT 


480 


ACGACTGAAA 


ATGGTTTTGT 


CTTGACTCCT 




ACAAGCCCAT 


TTTGGAGCAG 


540 


GGTGATAAGC 


TCAAGGCCGT 


TATTCTCAAC 


TATCCAW.LA 


AT£*f"CACAGG 


AATTACCTAC 


600 


AGTCGAGAGC 


AGTTAGAGGC 


CTTGGCAGCT 


GTTTTACGCA 


Au A #tU*AAA 1 


AAA A V* A 4 V* A V. 


660 


TGTGATGAGG 


TTTACTCAGA 


. ATTGACCT AC 


ACAGGCGAAG 


1 1 vj l\> A 


rTAGCAACCA 


720 


TGTTGAGAGA 


CCAGGCTATT 


ATTATCAATG 


GTTTGTCTAA 


A IWiUA 1 Vak. W 


ATCAf*AGCTT 

n lUMV'Wv A A 


780 


GGCGTTTGGG 


GCTGATTTTC 


GCTCCTGCGA 


CCTTCACAGC 


LWAVfl 14V\Al. 




840 


AGTACTTGGT 


CACTGCCGCA 


AATACCATGG 


CGCAACATGC 


1 Vj\_ vjvj 1 Munn 


CCCTTGACCG 


900 


CTGGTAAAAA 


CGATGCGGAC 


CCATGAAGAA 


GCAATATATC 


CAALu I 


A/TiTATrAT 
Iftlnl \-/\ A 


960 


CGAAAAAATG 


ACTGCTCTTG 


GTTTTGAGAT 


TATCAAACCA 


WAvMi luLL 1 


A S- A A * ^ * * • • 


1020 


TGCTAAAATT 


CCAGCGGGCT 


ACAATCAAGA 


CTCCTTTGCT 


ITTPTYIAACC. 


A 4 * * * ^•* m 


1080 


GAAGAAGGCC 


GTTGCCTTTA 


TCCCTGGTGC 


AGCl~ l 1 I wA 


r^rrrTAfnccc 


AAGGCTACCT 


1140 


CCGCCTATCT 


TATGCAGCCA 


GCATGGAGAC 


TnTvAAAunn 


UbvA A V#*W%W 


GACTTGAGGA 


1200 


GTACATCAGA 


GAAGCATGAT 


TCAGTCTATU 


amAf*Tf"AAC 


A A W * V*>» * 


TTACAATCGC 


1260 


AATTTTCCTG 


AGGATGACAA 




nl i 1 1 t nVAVi 


AGCAGGTTGG 


CAAACGCATG 


1320 


TTTTTTG T C A 


AACACGCTGG 


TCAGTCTAAG 


CTGGCGCCTG 


TTATTCAGCC 


CTTGGTGCTG 


1380 


GCACGATTTC 


TCTTGCGAAT 


CAATGATGAC 


GGACTCAGTT 


ACATCGAAGA 


CTATCATGAG 


1440 


GTCATCACTT 


TTCCCAAGAT 


TAATAGTGAC 


CTCTTTGTCA 


TGGCCTATGC 


GACCTATGTG 


1500 


GCAGCTCTTG 


CAGATGCTAG 


TTTGCAGGAC 


AATCAGCAGG 


ATGCTCCCTT 


GTTTGCTTTT 


1560 


TTGCAAAAGA 


CTTTGGAGTT 


GATGGAAGCA 


GGCTTGGATT 


ATCAGGTTTT 


GACCAATATT 


1620 


TTTGAAATTC 


AAATTTTGAC 


TCGATTTGGA 


ATCAGCCTCA 


ATTTTAATGA 


GTGTGTCTTC 


1680 


TGCCATCGGG 


TTGGTCAGGC 


TTTTGACTTT 


TCTTTCAAAT 


ATGGAGCCTG 


CCTCTGTCCA 


1740 


GAGCATTATC 


ATGAGGATAA 


GAGACGTTGT 


CATCTCAATC 


CCAATATCCC 


CTATCTGCTC 


1800 
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AATCAATTTC 


AACCTATTGA 


TTTTGAGACT 


TTGGAGACCA 


TTTCGCTCAA 


CCCTCGAATC 


1860 


AACCAAGAGC 


TACGCCAATT 


TATGGATCAA 


TTATATGAAG 


AGTACGTTGG 


GATTCACCTA 


1920 


AAATCAAAGA 


AATTTATTGA 


TTCCCTAGCA 


CACTGGGGAC 


AATTACTAAA 


AGAGGAAAAG 


1980 


AAATGAAAAA 


AATCGCAGTA 


GATGCCATGG 


GGGGCGATTA 


CGCACCTCAG 


GCCATTGTTG 


2040 


ACGGTGTCAA 


TCAAGCCCTA 


TCTGACTTTT 


CAGATATCGA 


GGTTCAACTT 


TACGGAGATG 


2100 


AAGCTAAAAT 


CAAGCAATAT 


CTGACAGCGA 


CAGACCGCGT 


CAGCATTATC 


CATACGGATG 


2160 


AG AAGATTG A 


TTCGGATGAT 


GAACCTACCA 


GAGCTATTCG 


GAATAAGAAA 


AATGCCAGTA 


2220 


TGGTATTGGC 


AGCCAAGGCT 


GTCAAAGATG 


GTGAAGCAGA 


CGCTGTCCTT 


TCGGCTGGGA 


2280 


ATACAGCTCC 


CTTGTTGGCA 


GCAGGATTCT 


TCATCGTGGG 


TCCTATCAAG 


AATATCGACC 


2340 


GTCCTGGACT 


CATGTCTACC 


TTGCCTACCG 


TTGATGGAAA 


AGGTTTTGAC 


ATGCTAGACC 


2400 


TTGGTGCCAA 


TGCAGAAAAT 


ACAGCCCAGC 


ACCTCCATCA 


ATATGCGGTT 


CTAGGTTCCT 


2460 


TCTATGCTAA 


AAATCTCCGT 


GGCATTGCGC 


AACCACGCGT 


TGGTTTGCTC 


AACAACGGAA 


2S20 


CAGAGACTAG 


CAAGCGCGAC 


CCGCTTCGTA 


AGGAAACTTA 


TGAATTACTG 


GCGGCTGATG 


2580 


AAAGTTTGAA 


CTTTATCGGA 


AACGTGGAAG 


CGCCTGATTT 


GATCAATGGC 


GTTGCAGATG 


2640 


TTGTTGTGGC 


AGATGGTTTC 


ACGGGAAACG 


CTGTGCTCAA 


ATCCATCGAA 


GGGACAGCTA 


2700 


TGGGAATCAT 


GGGCTTGCTC 


AAGACAGCTA 


TTACAGGTGG 


TGGTCTTCGA 


GCCAAACTAG 


2760 


CTGCCCTCCT 


TCTCAAGGAC 


AGCCTCAGTG 


GTTTGAAAAA 


ACAGCTCAAT 


TATTCAGATG 


2820 


TTGGTGGAGC 


GCTCTTCTTT 


GGTGTTAAGG 


CACCTGTTGT 


CAAGACTCAT 


GGCTCAAGCG 


2880 


ATGCCAACGC 


TGTTTATAGT 


ACGATTCGTC 


AGATCCGTAC 


CATGCTAGAA 


ACAGACGTGG 


2940 


TTGCCCAGAC 


TGCGCGTGAA 


TTTTCAGGAG 


AATAAAACAG 


ATGACAGAAA 


AAGAAATTTT 


3000 


TGACCGTATT 


GTGACCATTA 


TCCAACAGCG 


ACAGGGAGAG 


GACTTTGTCG 


TGACAGAATC 


3060 


CTTGAGTCTG 


AAAGACGATT 


TGGATGCGGA 


TTCTGTTGAC 


TTGATGGAGT 


TTATCTTGAC 


3120 


TCTCGAACAT 


GAATTTACTA 


TCGAAATCAG 


CGATGAACAA 


ATTGACCAAC 


TCCAAAACCr 


3180 


AGGAGATGTG 


CTTAAAATCA 


TTCAAGGAAA 


ATAGCAATCG 


GAGTTCCAAG 


TCAACGGAAG 


3240 


TAGATGGTTT 


TTAGAAATGA 


GAAATATCGG 


ACAAGCTCCT 


AAAATCTTGG 


CTGACAGTGC 


3300 


TTATCAACGG 


CTCATGAAGA 


TATATCCTCA 


ACCACAAACT 


CCACGTAAAT 


CCAGCAAACT 


3360 


CAAGCCGCTA 


ACAGTTGAAG 


ATAAACCCTG 


TAATCATGCG 


CTATCTAACG 


AGATAAGCAA 


3420 


GGTTGAGAAT 


ATCTTTGCCA 


AACTAAAAAC 


GTTTAAAATC 


TTTTCAACAA 


CCTATCGAAA 


3480 


TCATCCTAAA 


CGCTTCGGAT 


TACGAATGAA 


TTTGATTGCT 


GGTATTATCA 


ATCATGAACT 


3540 


AGGATTCTAG 


TTTTGCAGGA 


AGTCTAATAC 


TAAAAAAGTG 


ATTAGAAAAC 


ATCTTTTTTA 


3600 



WO 98/18931 



PCT/US97/19588 



663 



AAAATAGAGA 
/ww% a nwnw» 


TGATTTTCAA 


ACAAAAAACC 


TAATTCAAGA 


CGTTTCGATG CCAATTCAAG 


3660 






TAGATACTGT 


TATACTAAAC 


TTGTCAACTT 


TGTAACAAGA 


3720 


f ft ft. 


AA^LAAA 


CAGGTATTCC 


TTATGAATAC 


AAAAACGATG 


TCACAATTTC 


3780 


& & ft ft T*lTf? A 
AAAU lAlwvn 


1 Aw 1 ViAuA Iw 


W A i WW 1 lUvVl 


TTGAAGGTCG 


CGGATGCAAT 


TGGGGAGATT 


3840 


TTCCwAAAGw 


Aww l viTTuun 


wwAwwAwwAw 


V i ML wA WW 1 V> 1 


TCAGCTAGGA 


ATTAAAACAA 


3900 




Aww 1 wwAwLA 




l www Aww A UL 


TATACTTGGA 


GCTGTGGCCT 


3960 


ft ********* ft o 

ATCCAGCuAC 




IAA1 lAiwun 


TTTTaAAAGT 


TTTATTATTG 


CTTTAGTAGT 


4020 


TGGTATATTT 


GGTCCTTATA 


TwwATwVATTT 


AATTAuAAAA 


AAATTTTTAA 


AGTCTTCGGA 


4080 


CAAGAAAACA 


GAAAAATCTG 


mm* ft ft ft ft ftftift 

TTAAAAAATA 


ft M ^ ft ft ft ft ^*V* ft 

ATCAAAALIA 


TAAATGATGA 


ATCTGAATCA 


4 140 


AAATTATTTT 


GCGCATCTAA 


ft ^ ft ****** ft im 


if* ft M ft f'ff A ft l**f* 

TATAGTAACG 


ACTCAAAAAA GGAGTAACTA 


4200 


TGAATCCTAA 


TTTAGAACGG 


TGTTATCTAT 


III^WM" ft ft ****** 

TCTGAtTTAGG 


AATAGATCAT 


ACCACAGGTA 




GCTTAGAAAT 


AGCAGAGACA 


mm ft A* ft ft ft ffWfi^ 

TTAGAAATTG 


ft ft y hp ft ft m ft ft ft 

AAGTAATAAA 


TAGGATGTCG 


TAAGTGTTAC 




TATCAATGAT 


TTATTTGTTT 


ft i"ii>im' ***^ 

CAAGCTTGCC 


TAGGGTGACA 


CTAAAAAATC 


AATTTCCTTT 




CAATAGCATA 


TTTTTAGTGG 


GC AGGACTCT 


T*r* tt /— 'i 1 /"* err* 


ATTTTTITAT 


CCAAAAAGTC 


4 440 


CACTTGGGAG 


GGAGATAbwL 


^■r* ft T^T^^V-/™* ft 


ft /-»/— ft a *"•***■ e*T~ ft 
AwwAAw 1 LLA 


GTTTTTGTTT 


AGTGATTGGG 


4 500 


GTAAGATAGT 


'tmn ft ******* » ^ ft 

TVillATCAGA 


•tV ft^ VM T*& kmR 

TG AlilTAATA 


^•m«-ww*«f^f— ft ft ft 


ATCAAATTCA 


AACCACGTCA 


H 30V 


ACGTCGCCTT 


***■**•*-**—**% ft ma m ft 

GCCGTATATA 


iviy* ft ^»ty* ftf 
TCTWACTuAL 


TTCGTCAGTC 


CTATCTACAA 


CCTCAAAACA 


4620 


w'l~w'l 1 i l\iAC 


WAUWCTACwW 


CTAGTTTCCT 


AGTTTGCTCT 


TTGATTTTCA 


TTGAGTATTA 


4680 




wA IuaA I A i W 


AAA ill VjuVjn 


WirCTCATTA 

AaLu 1 V.A 1 1 A 


TCGTCCGCAG 


GTGGATCAGA 


4740 


IwwAL 1 wwww 


ft\^^^^n^r^*A 


A I JWa W W*\ A WW 


'IT'I'VTI^fli "i'A 

I 4 • & * WWW A 


CTATGGTAGT 


T ATT ATT TTT 


4800 






wV. 1 AAW4\WW/% 


CCATGGATCG 

w wn 4 wwn a WW 


GACCACCCCT 


TTGGGCTTGG 


4860 


TCAAGGTGGC 


AGAGGAGATT 


GGTTTTGAGA 


CGCGAGCCAT 


TAAGGCAGAT 


ATGACGCTTT 


4920 


TTGACTTGCC 


GGATTTAACT 


TTTCCTTTTG 


TTGCCCATGT 


GCTTAAGGAA 


GGGAAATTGC 


4980 


TCCACTACTA 


TGTGCTGACT 


GGCCAGCATA 


AGGATAGCAT 


TCATATTGCC 


GATCCAGATC 


5040 


CCCGGGTGAA 


GTTGACTAAA 


CTGCCACGTG 


AGCGTTTTGA 


GGAAGAATGG 


ACAGGAGTGA 


5100 


CTCTTTTTAT 


GGCACCTAGT 


CCAGACTATA 


AGCCTCATAA 


GGAACAAAAA 


AATGCTCTGC 


5160 


TCTCTTTTAT 


CCCTATATTA 


GTGAAGCAGC 


GTGGCTTGAT 


TGCCAATATC 


GTTTTGGCAA 


5220 


CACTCTTGGT 


AACCGTGATT 


AACATTGTGG 


GTTCTTATTA 


TCTGCAGTCT 


ATCATTGATA 


5280 


CCTATGTGCC 


AGATCAGATG 


CGTTCGACAC 


TAGGGATTAT 


TTCTATTGGG 


CTAGTCATCG 


5340 
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TCTACATCTT 


CCAGCAAATC 


TTGTCTTACG 


CTCAGGAGTA 


TCTCTTGCTT 


CTTTTGGGGC 


5400 


AACGCTTCTC 


GATTGACGTG 


ATTTTCTCCT 


ATATCAAGCA 


TCTTTTTCAC 


CTCCCTATGT 


5460 


CCTTCTTTCC 


GACACGCAGG 


ACAGGGGAGA TCGTGTCTCG 


TTTTACAGAT 


GCTAACAGTA 


5S20 


TCATCGATCC 


GCTGGCTTCC 


ACCATCCTTT 


CGATTTTCCT 


AGATGTGTCA 


ACGGTTGTCA 


5580 


TTATTTCCCT 


TGTTCTATTT 


TCACAAAATA 


CCAATCTCTT 


TTTCATGACT 


TTATTGGCGC 


5640 


TTCCTATCTA 


CACAGTGATT 


ATCTTTGCCT 


TTATGAAGCC 


GTTTGAAAAG 


ATGAATCGGG 


5700 


ATACCATGGA 


AGCCAATGCG 


GTTCTGTCTT 


CTTCTATCAT 


TGAGGACATC 


AACGGTATTG 


5760 


AGACTATCAA 


GTCCTTGACC 


ACTGAAAGTC 


AGCGTTACCA 


AAAAATTGAC 


AAGGAATTTG 


5820 


TGGATTATCT 


GAAGAAATCC 


TTTACCTATA GTCGAGCAGA 


GAGTCAGCAA 


AAGGCTCTGA 


5880 


AAAAGGTTGC 


CCATCTCTTG 


CTTAATGTCG 


GCATTCTCTG 


GATGGGGGCT 


CTTCTGGTCA 


5940 


TGGATGGCAA 


GATGAGTTTG 


GGGCAGTTGA 


TTACCTATAA 


TACCTTGCTG 


GTTTACTTTA 


6000 


CTAATCCTTT 


GGAAAATATC 


ATCAATCTGC 


AAACCAAGCT 


TCAGACAGCG 


CAGGTTGCCA 


6060 


ATAACCGTCT 


AAATGAAGTG 


TATCTAGTAG 


CTTCTGAGTT 


TGAGGAGAAG 


AAAACAGTTG 


6120 


AGGATTTGAC 


CTTGATGAAG 


GGAGATATGA 


CCTTCAAGCA 


GGTTCATTAC 


AAGTATGGCT 


6180 


ATGGTCGAGA 


TGTCTTATCG 


GATATCAATT 


TAACCGTTCC 


CCAAGGGTCT 


AAGGTGGCTT 


6240 


TTGTGGGGAT 


TTCAGGGTCA 


GGTAAGACGA 


CTTTGGCCAA 


GATGATGGTT 


AATTTTTACG 


6300 


ACCCAAGTCA 


AGGGGAGATT 


AGTCTGGGTA 


GTGTCAATCT 


CAATCAGATT 


GATAAAAAAG 


6360 


CCCTGCGCCA 


GTACATCAAC 


TATCTCTCTC 


AACAGCCCTA 


TGTCTTTAAC 


GGAACGATTT 


6420 


TGGAGAATCT 


TCTTTTGGGA 


GCCAAGGAGG 


GGACGACACA 


GGAAGATATC 


TTACGGGCGG 


6480 


TCGAATTGGC 


ACAGATTCGA 


GAGGATATCG 


AGCGCATGCC 


ACTCAATTAC 


CAGACAGAAT 


6540 


TGACTTCGGA 


TGGGGCAGGG 


ATTTCAGGTG 


GTCAACGTCA 


GAGAATCGCT 


TTGGCGCGTG 


6600 


CTCTCTTGAC 


AGATGCGCCG 


GTCTTGATTT 


TGGATGAGGC 


GACTAGCAGT 


TTGGATATTT 


6660 


TGACAGAGAA 


GCGGATTGTC 


GATAATCTCA 


TTGCTTTGGA 


CAAGACCTTG 


ATTTTCATTG 


6720 


CTCACCGCTT 


GACTATTGCT 


G AGCGG AC AG 


AGAAGGTAGT 


TGTCTTGGAT 


CAGGGCAAGA 


6780 


TTGTCGAAGA 


AGGAAAGCAT 


GCTGATTTGC 


TTGCACACGG 


TGGCTTTTAC 


CCCCATTTGG 


6840 


TCAATAGCTA 


GAAAGAGGAG 


AGGATGAAAC 


CAGAATTTTT 


AGAAAGTGCG 


GAG TTTT ATA 


6900 


ATCGTCGTTA 


CCATAATTTT 


TCCAGTAGTC 


TGATTGTACC 


CATGGCCCTT 


CTGCTTGTGT 


6960 


TTTTACTTGG 


CTTTGCAACT 


GTTGCAGAGA 


AGGAGATGAG 


TTTGTCCACT 


AGAGCTACTG 


7020 


TCCAACCTAG 


TCGTATCCTT 


GCAAATATCC 


AGTCAACTAG 


CAACAATCGT 


ATTCTTGTCA 


7080 


ATCATTTGGA 


AGAAAATAAG 


CTGGTTAAGA 


AGGGGGATCT 


TTTGGTTCAA 


TACCAAGAAG 


7140 
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TGTCCAACCG 


GAGTCCTATG 


CCAGTCAGTT 


CCACATGCTA 


AAGGATCAAA 


7200 




CCAGTATCTG 


CAAAACAGCC TGCAAGAACG 


GGAGAACCAC 


TTTCCACAGG 


7260 






GCCACCTTTC 


GCGACTACAT 


CAGTCAAGCA 


GGCAGTCTTA 


7320 






AATGAGACCA 


TCGCGTCCCA 


GAATGCAGCA 


G CT AGCC AAA 


7380 




AJVTCGCCAAC 


CTCATCAGTC 


AAACAGAGGC 


TAAAATTCCC 


GATTACCAGA 


7440 


CAGCTAAwTW 




ACAGGTGCTT 


CCTTGGCCGG 


TCAGAATCTA 


GCCTACTCTC 


7500 


TTTACCAGTC 


CTACAAb iv.1 


CAGGGCGAGG 


AAAATCCCCA 


AACTAAGGTT 


CAGGCAGTTG 


7560 


CACAGGTTGA 


A&iLALAuA 1 1 


TCTCAGTTAG 


AATCTAGTCT 


TGCTACTTAC 


CGTGTCCAGT 


7620 


ATGCACGTTC 


AGGTAllLA\* 


CAAGCCTATG 


CGTCAGGGTT 


AAGCAGTCAA 


TTGGAATCCC 


7680 


TTAAATCCCA 


ACALX XOGLA 


AAGGTTGGTC 


AGGAATTGAC 


CCTTCXAGCC 


CAGAAAATTT 


7740 


TGGAGGCAGA 


Va ri LAVi4j 1 n«« 


AAGGTACAGG 


GAAATCTTTT 


AGACAAGGGG 


AAAGTTACGG 


7800 


CGAGTGAGGA 


TGGGGTGCTT 


CATCTTAATC 


CTGAGACCAG 


TGATTCTAGC 


ATGGTTGCAG 


7860 


AAGGTGCCCT 


AlTAGCllaA 


CTTTATCCAT 


CTTTGGAAAG 


AGAAGGGAAA 


GCCAAACTCA 


7920 


CAGCTTATCT 


AAGTTCAAAA 


TATGTAGCAA 


GAATCAAGGT 


CGGTGATTCT 


CTTCGCTATA 


7980 


CTACGACTCA 


TwATwLLL>t»0 


AATCAACTTT 


TCCTAGATTC 


TACT ATT AC A 


AGTATTGATG 


8040 


CGACAG CT A C 


TAAuAwTviAu 


AAAGGGAATT 


TCT'I TAAAAT 


CGAGGCGGAG 


ACTAATCTAA 


8100 


Li iCGGAGCA 




CTTAGGTACG 


GCGTGGAAGG 


CCGCTTGCAG 


ATGATTACGG 


8160 


GCAACAAAAu 


11 ALL 1 ALU 1 


TATTATTTGG 


ATCAATTTTT 


GAACAAAGAG 


TAATGTTCGT 


8220 


U'l~l ill Av» Aw 




TTTAAACTGT 


GAGAAAGATT 


CTTCTTCCAG 




8280 


ACAATTTTTG 


AJLAJLACATCT 


&CTATTTATT 


CGGTTAAATT 


CTTGTGTTTT 


TTGGT1 TTTT 


8340 


Li 1 «V* 1 AAAA 1 




AATACGAAAG 


CCGAACTTTA 


AAATGTCAAA 


ACAATTGATC 


8400 


TATTCGGGAA 


AAGCTAAAGA 


TATCTATACA 


ACTGACGATG 


AAAATL1 I AT 


l A 1 I iVAAL i 


8460 


TACAAGGACC 


AGGCGACTGC 


TTTCAACGGT 


GTCAAGAAGG 


AGCAGATTGC 


AGGTAAGGGA 


8520 


GTCTTGAATA 


ATCAGATCTC 


ATCTTTTATT 


TTTGACAAAT 


TAAATGTGCC 


TGGTCTGGCG 


8580 


ACTCACTTTG 


TGGAGAAACT 


TTCAGACACG 


GAACAACTCA 


ATAAAAAGGT 


TAAGATTATT 


8640 


CCTTTGGAAG 


TCCTGCTCCG 


CAACTATACT 


GCTGGTTCCT 


TTTCAAAACG 




8700 


GATGAGGGAA 


TCGCCTTGGA 


GACTCCGATT 


GTCGAATTTT 


ACTACAAAAA 


TGATGATTTG 


8760 


GATGATCCAT 


TTATCAATGA 


TGAGCATGTG 


AAATTCCTAC 


AGATTGCGGG 


TGACCAGCAG 


8820 


ATTGCCTACT 


TGAAGGAAGA 


AACGCGTCGT 


ATCAATGAAC 


TATTGAAAGT 


CTGGTTTGCT ■ 


8880 
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GAGATTGGGC TTAAATTGAT TGACTTTAAG CTAGAGTTCG GTTTTGACAA GGATGCCAAG 
ATTATCTTGG CAGACGAATT TTCAC CAGAT AACTGCCGCT TGTGGGACGC TGATGGCAAC 
CACATGGATA AGGATGTTTT CCCTAGAGGA TTGGGAGAAC TAACCGACGT TTATGAGATT 
CTTTGGGAAA AGTTGCAGGA ATTGAAATAA TCTGTTTGCA ACGGAAAACC TTCGTCTCTC 
AACTAAAAGG ACTCAGGCTG AAAAGGTCCC CCAGACCTTT TCACTCTGTA GAGAACTAGC 
TGAACTAACA GATGTTTACG AAATTGTCTG GGAAAAGTTG CAGGGTTTAA AATAACAACC 
TCAAGGCTGT TTGGGAATAT TGCAAGAGCT GAAATAAAGG AATAAGAATT GATGGATAAA 
CGTATTTTTG TTGAAAAAAA GGCTGATTTT CAGCTCAACT CAGAGAGTTT GGTTAGAGAG 
CTCCAGCACA ACTTGGGACT GTCAAGCTTG AAAAGTATTC CTATTGTGCA AGTATATGAT 
GTATTTGACT TGGCTGAGGA CTTGTTTGCA CCTGCAGAGA AGCACATTTT CTCTGAGCAG 
GTAACCGACC ATGTTTTAGA TGAAGTATCT GTGCAGGCGG ATCTTCCTAA CTATGCTTTC 
TTTGCCATTG AAAGTCTGCC AGGGCAGTTT GACCACCGTG CAGCTTCGTC ACAGGAAGCC 
TTGCTTTTCT TGGGAAGTTC GAGTGACGTG ACACTCAACA CAGCCCAACT TTACTTCCTC 
AATAAAGATA TTGATGCGAC TGAGTTGGAA GCTGTCAAAA ACTACCTGCT CAATCCAGTT 
GATTCTCGTT TCAAGGATAT CACGACAGGG ATTGCCAAGC AGGAGTTTTC AGAGTCAGAC 
AAGACCATTC CCAAATTGAC TTTCTTTGAA AGCTATGCAG CAGAAGACTT TGCTCGCTAC 
AAGGCCGAAC AAGGGATGGC CATGGAAGTG GATGATTTGC TCTTTATCCA AGACTACTTT 
AAGTCAATCG GGCGCGTGCC AACTGAGACT GAACTCAACC TTTTGGACAC TTACTGGTCT 
GACCACTGCC CTCATACGAC TTTTGAGACA GACTTGAAAC ACATCGACTT TTCAGCTTCT 
AAATTTCAAA AGCAATTGCA GTCAACCTAT GACAAGTATA TTGCCATGCG CGAGGAATTA 
GGTCGGTCTC AAAAACCACA AACCTTGATG GATATGGCGA CTATTTTCGG TCGTTATGAG 
CGTGCTAATG GACGATTGGA TGATATGGAA GTCTCTGACG AAATCAATGC CTGCTCAGTT 
GAAATTGAAG TGGACGTTGA TGGTGTCAAG GAACCTTGGC TCCTCATGTT TAAAAACGAA 
ACCCACAACC ATCCAACAGA AATTGAGCCA TTTGGTGG AG CGGCTACCTG TATTGGTGGA 
GCTATTCGTG ATCCGTTGTC ACGCCGTTCC TATGTTTACC AAGCCATGCG TATTTCAGGT 
GCTGGTGATA TTACAGCACC GATTTCGGAA ACTCGCGCTG GGAAATTGCC ACA^CAACTC 
ATTTCTAAAA CAGCAGCTCA TGGTTATTCT TCATATGGTA ACCAGATTGG GCTTGCAACA 
ACCTACGTTC GTGAATACTT CCACCCAGGC TTTGTAGCTA AACCTATGGA ACTTGCTGCC 
GTTGTTGGTG CGACTCCCAA GGGCAATGTT GTCCGTGAAA AACCTGAAGC AGGTGATGTG 
ATCATCCTTC TCGGAGGCAA AACAGGTCGT GATGGTGTCG GTGGTGCGAC GGGCTCTTCT 
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AAGGTTCAAA 


CAGTTGAGTC 


#IV»#T»ftf* ft^A^^^* 

TuTAuAunw 1 




ArtCTTCAAAA 


AGGAAATGCC 


10740 


ATCGAAGAAC 


GCAAGATTCA 


GCGCCTCTTC 


p*Nf* **r/*rtr"A 
t»u 1 M X uwWA 






10800 


AAGTCCAATG 


ACTTTGGGGC 


m ******* li'MIU ■^pf^ 

AGGCGGTCTC. 






GCCAGA.CGGT 


10860 


CTTGAAATCG 


ACCTCAACAA 


GGTGCCTCTT 


ft & » «T« IV /"*P* ft/^/^ 




TAf*J4rtAAATT 


10920 


GCCATCTCTG 


AATCACAAGA 


ACGGATGGCG 


ItiVu IwU X 4 V. 


f?Tf*f*Tf?AA£5A 


IV* WWW 


10980 


TTCGTTGCCG 


AATGTAACAA 


m ft ft ft ft fV* lh Mm 

AGAAAATATT 


GATGCTGTTG 


1 Vtj X\*uwu/ll» 


Au 1 *w\w l\lAA 


11040 


AAACCAAATC 


TTCTCATGCA 


CTGGAATGGT 


GAGACAATCG 


TTGACTTGGA 


PPPTPP<PWP 


iii fin 


CTTGACACCA 


ATGGTGTGCG 


CGTGGTTGTC 


GATGCCAAAG 


TTGTCGACAA 


ftftlTTTPl A A 
win l\i X LAAA 


111 Ati 


CTCCCAGAAG 


AGCGTCAAAC 


ATCTGCTGAA 


ACACTGGAAT 


LAuATALLC 4 


Tarftfi*T*prr a 

1 /%wVJU i 1V.IA 


1 1 52 ft 


TCTGACCTCA 


ACCATGCAAG 


TCAAAAAGGA 


TTACAGACTA 


TCTTTGACTG 


CTCTGTTGGA 


1 1 ion 


CGCTCAACGG 


TTAATCACCC 


ACTTGCTGGT 


CUx IACCAAC 


TCACACCAAL 


luAubLA I 


X1J4U 


GTGCAGAAAT 


TGCCAGTTCA 


ACACGGTGTG 


ACTCATACTG 


CGTCGGTCAT 




1 1 AAA 


TTCAACCCAT 


ATGT AGCTGA 


ATGGTCTCCA 


n* ft m ^*^^4ft^ 

TACCACGGTG 


CTGCTTATGC 


GCJ l 1ATCOAA 


114DU 


GCAACTGCTC 


CTTTGGTGGC 


TCCTGGTGCC 


AACTGGTTCA 


AGGCTCGTTT 


CTCTTACCAA 




GACTATTTCG 


AGCGTATGGA 


TAAACAAGCA 


GAG CGTTTCG 


GTC AGCC AGT 


ACiwTvUlVl 1 


t i con 
113oU 


CTAGGTTCTA 


TTGAAGCACA 


AATTCAGCTT 


GGCTTGCCAT 


CTATCuOlub 


T*ft. ft.<^ift./^*^0/^ 
1 AnuuAL 4 WW 


1 1 AAA 


ATGTCTGGTA 


CCTTTGAACA 


ATTGACCGTT 


rinnr i ri & ft, 

CCGCCAACCT 


TGGCTGCCTT 






ACGGCAGATA 


GCCGTAAGGT 


GCTCTCTCCA 


Ik % MMM Jk » ft^ 

GAAl T lAAAd 


I. 1A» X 1 VjUwOA 


JW\ 1 f\ A A /\w 


11760 


TACATCCCAG 


GTCAAGCCCT 


CTCTGCAGAG 


ATTGATTTTG 








GCTCAGTTTG 


AAGCCATCCA 


AGCTGACCAT 


AAAGTCACAT 




TrtTPA AATAf" 


11880 


GGTGGTGTAG 


TTGAAAGTTT 


GGCTCTTGCT 


ALv. 1 1 luuAn 




TGCACACCTC 


11940 


ACCTTGCCTG 


ft m ^**Vft?l%^* % ft ft 

AACTTGAAAC 


m /"roWfTV* ft. f* ft 






CTTgAgATCT 

V* * • * W * 


12000 


CCTGAAGAAA 


TTGCTGGAGT 


AGAGAAGGTT 


GGACAAACGA 


AAGCAGACTT 


TACACTGACT 


12060 


GTCAACGGTG 


TCAAGCTAGA 


TGGACACAAG 


CTTGACAGTG 


CATTTCAAGG 


GACATTGGAA 


12120 


GAAGTTTACC 


CAACAGAATT 


TACCCAAGCG 


AAAGAACTAG 


AAGAAGTACC 


AGCTGTGGCA 


12180 


TCAGATGTTG 


TGATTAAAGC 


CAAAGAAAAG 


GTTGAAAAAC 


CTGTGGTTTA 


CATCCCAGTC 


12240 


TTTCCAGGAA 


CCAACTCAGA 


ATATGATTCA 


GCTAAGGCCT 


TCGAAAAAGA 


AGGTCCAGAG 


12300 


CTCAATTTCG 


TGCCATTCGT 


GACCTTGAAT 


GAAGAAGCTA 


TTGTCAAGTC 


AGTTGAAACT 


12360 


ATGGTTGACA 


ATATCGACAA 


GACTAATATT 


CTCTTCTTTG 


CTGGTGGATT 


CTCGGCTGCG 


12420 
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ATGGTTCACC 


TAAGTTTATC 
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GTCAATATCC 


TGCTTAATGA 


AAAAGTGCGT 


12480 


GTGGCTATTG 


ATAGCTTTAT 


CGCCCGTGGT 


GGTTTGATTA 


TCGGTATTTG 


TAATGGATTC 


12540 


CAAGCCTTAG 


TCAAATCGGG 


TCTCCTACCC 


TACGGAAACT 


TTGAAGCTGC 


TAACAGTACT 


12600 


AGCCCAACCC 


TCTTCTACAA 


TGATGCCAAC 


CAACACGTGG 


CCAAGATGGT 


GGAAACTCGC 


12660 


ATTGCCAATA 


CCAACTCACC 


ATGGTTGCTT 


GGTGTGCAAG 


TGGGCGATAT 


CCACGCTATT 


12720 


CCTGTTTCGC 


ACGGTGAAGG 


GAAGTTTGTC 


GTGACGGCTG 


AGGAATTTGC 


AGAGCTCCGT 


12780 


GACAATGGAC 


AAATTTTCAG 


CCAATACGTT 


GACTTTAACG 


GTAAACCAAG 


TATGGATTCT 


12840 


AAGTACAATC 


CGAATGGTTC 


TGTCCATGCC 


ATCGAAGGAA 


TTACCAGCAA 


GAATGGTCAA 


12900 


ATCATCGGTA 


AGATGGGCCA 


CTCAGAACGT 


TATGAGGATG 


GTCTTTTCCA 


AAATATCCCA 


12960 


GGCAATAAAG 


ACCAACACCT 


GTTCGCATCA 


GCGGTTAAAC 


ATTTCACTGG 


AAAATAAGAC 


13020 


TTACAGATTT 


TCTAATAGAT 


AGTATCAGTA 


ATGTAAAAGT 


CATCTAAATC 


TAGCTCTTGA 


13080 


TGATTACAAA 


TGAAAATTAG 


GTATAAAAAA 


TCACATACCA 


AGTAAAATCT 


CTTAATGAAG 


13140 


AATGTGGTCT 


TTTCGGTATT 


TGGGGACATC 


CAGATGCTGC 


TAAGTTGACC 


TATTTTGGAC 


13200 


TCCACAGTCT 


TCAACACCGT 


GGTCAGGAGG 


GGGCAGGAAT 


CCTCTCCAAT 


GATCAAGGAC 


13260 


AACTGAAGCG 


CCATCGTGAC 


ATGGGGCTTT 


TATCAGAAGT 


TTTCAGAAAT 


CCAGCTAATT 


13320 




GACAGGAGCT 


GGTGCGATTG 


GGCATCTGCG 


TTATGCGACT 


GCTGGCGAAG 


13380 


f*TTCTGT AG A 


TAAGATCCAG 




TCCCTTTTCA 


CGATATGCAG 


TTTGCTTTGG 


13440 


CTCATAATGG 


AAATCTGACC 


AATGCAGCCT 


CTCTCAAGAA 


AGAACTGGAA 


CAAAGAGGAG 


13500 


CAATTTTCAG 


CGCGACTTCG 


GACTCTGAAA 


TCTTGGCTCA 


CCTCATTCCT 


CGCAGTCATA 


13560 


ATCCTAGCCT 


GATGGGCAAA 


ATCAAGGAAG 


CGCTCAGCCT 


TGTCAAAGCT 


GGTTTTGCCT 


13620 


ATATCTTGCT 


GTTTGAGGAC 


AAGTTGATTG 


CGGCTCTTCA 


CCCAAATGGA 


TTCCGACCGC 


13680 


TTTCGATTGG 


TAAAATGGCT 


AATGGAGCAG 


TTGTTGTATC 


TTCTGAAACC 


TGTGCTTTTG 


13740 


AGGTCATTGG 


TGCCGAGTGG 


ATTCGTGATT 


TGAAGCCAGG 


TGAGATTGTG 


ATCATTGATG 


13800 


ACGAwwCSCAT 


J Lnu 1 A 1 w*\^ 


AGCTATACAG 


ATCATACCCA 


GTTGGCCCTT 


TCTTCTATCG 


13860 


AGTATATCTA 


CTTTGCTCGC 


CCTGATTCTA 


ATATCCACGG 


TGTCAATGTC 


CATACGGCAC 


13920 


GTAAGAGAAT 


CGGAGCGCAA 


TTGGCGCGAG 


AATTTAAGCA 


TGAGGCAGAT 


ATTGTAGTTG 


13980 


GTGTGCCCAA 


TTCTTCCCTA 


AGCGCGGCTA 


TGGGATTTGC 


GGAAGAATCA 


GGCTTACCAA 


14040 


ATGAAATGGG 


TCTGATCAAA 


AACCAATACA 


CCCAGCGAAC 


TTTTATCCAA 


CCGACTCAAG 


14100 


AATTGCGGGA 


GCAAGGAGTG 


CGGATGAAAC 


TGTCTGCTGT 


TTCGGCTGTT 


GTCAAAGGCA 


14160 


AACGTGTGGT 


CATGGTGGAT 


GATTCCATTG 


TACGTGGAAC 


AACCTCTCGT 


CGTATCCTTC 


14220 
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nuU 1 W 1 




CCCACTGAGG 


TTCACGTTGC 


CATTCGAAGT 


CCTGCACTAG 


14280 


CUT A XW 1 lj 




ATTCATATCC 


AGACCCGTCA 


GGACCTGATT 


GCAGCCAATC 


14340 


ATAUuuTLvin 


A\»AAAW IvViU 


CAAATCATTG 


GTGCGGACAG 


TCTGACTTAT 


CTTTCAATTC 


14400 






GCTATCGAAA 


CAGATGCGCC 


GAACGGTGGT 


CTCTGTGTCG 


14460 


CTTACTTTGA 




W W#VVWwWW A 


TTTATGACTA 


CGAAGAAGAC 


TATCGTAGAA 


14520 


GTTTGGAAGA 


AAAuALUnu 1 


1111 AW/W\U 1 


ACGCGACAGA 


TTCTCCATTA 


AAGAAAAGGA 


14SB0 


AAAAATAAAT 


GAwAAATaAA 


AA luLA 1 A 1 




CACTACTGAC 


TAAAGGCTTA 


14640 


AGCATTTACT 


CAGTAGAuow 


TTTGTOCTAT 


AfifiATCAAAG 


CTAGAGCCCT 


G ACT ACT ATT 


14700 


TTTAGATAAA 


AAGATGGTTT 


AIL IAAAAA 1 


AWw 1 WV*W/*« * 


W * * * *> \*4 WW* 


AAAAGAAAAG 


14760 


GAAAAATAAA 


ATGGCAAATA 


AAAATuLul A 


W UW 1 A_AA 1 W 4 


GGTGTGCATG 


TTGAAGCGGG 


14820 


TTATCAACTT 


GTTGAACGGA 


TTAAAAAuLA 


Lu I u\*WWWV* 1 


ACGGAGCGTG 


CAGGTGTCAT 


14880 


GGGAGCTCTT 


GGTGGCTTTG 


GTGGTATGTT 


TGACCTTTCC 


AJVfIAf*TRGGG 


TTAAAGAACC 


14940 


CGTCTTGATT 


TCAGGGACTG 


& ****** i»f*nfr*r*. 
ACGCTviTvaai 


AAWWAAlrW 1 W 


i *•* * i vjt-n- a n 


TCAACTACGA 


1S000 


CAAGCACGAT 


ACCATCGGGw 


AOuAL 1 A* IUI 


ratf f ATKTGT 


GTCAACGACA 


TCATTGCTGC 


15060 


AGGTGCGGAA 


CCCCTCTATT 


TTLTCGALlA 


Ww IAuLuACA 




AACCAGCTAA 


15120 


GCTAGAACAA 


GTGGTTCCTw 


W I V* i WjUWjA 


ACCTTGTfyrC 


CAGGCTGGTG 


CTGCCCTCAT 


15180 


CGGTGCGGAA 


ACGGw 1GAAA 


I uLLuuvWAl 


w i wwwnn 


GACGACTATG 


ACTTGGCTGG 


15240 


TTTTGCGGTC 


GGTGTGGCTG 


AAAAA 1 W iv-A 


#\A 1 W A 1 1 U/%W 


GGTTCAAAGG 


TGGTAGAGGG 


15300 


AGATGTTCTT 


WVWVfUAW X 


W 1 1 LAAU X vlw 


GATTCACTCA 


AATGGTTACT 


CTTTGGTTCG 


15360 


TCGTGTCTTT 


IxWVju A 1 a AW A 




AGTCCTACCA 


GAATTCGAAG 


GCAAGAAACT 


15420 


1 J\J\\j\jf\J\\j I k 


W 1 Aw 1 iwnw. 




CTATGTCAAG 


GCTGTCTTGC 


CCCTCATCAA 


15480 


AGAAGACTTG 


GTCAACGGCA 


TTGC CCACAT 


CACAGCTGGT 


GGCTTTATCG 


AAAATGTCCC 


15540 


TCGTATGTTT 


GCAGATGACC 


TAGCTGCTCA 


AATTGATGAA 


AGTAAACTTC 


CAGTGCTTCC 


156O0 


AATTTTCAAA 


ACCCTTGAAA 


AATACGGTCA 


GATTAAACAC 


GAAGAAATGT 


TTGAAATCTT 


15660 


CAATATGGGT 


CTGGGACTTA 


TGTTGGCGGT 


CAGCCCTGAA 


AATGTAGAGC 


GTGTAAAAGA 


15720 


ATTGTTGGAT 


GAAGCAGTCT 


ATGAAATTGG 


TCGCAT CGTC 


AAGAAAGAAA 


ACGAAAGTGT 


15780 


C ATT AT C AAA 


TGAAAAAAAT 


AGCGGTTTTT 


GCCTCTGGTA 


ATGGCTCAAA 


TTTTCAGCTG 


15840 


ATTGCCCAAC 


AATTTCCAGT 


GGAGTTTGTC 


TTTTCAGACC 


ATCGTGATGC 


CTATGTGCTT 


15900 


GAGCGTGCAA 


AGCAGCTCGC 


CGTTCTGTCC 


TATGCTTTTG 


AACTCAAGGA 


GTTTGAGAGC 


15960 
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AAGGCAGACT 


ACGAAGCAGC 


CCTTGTCGAA 


CTCTTGGAAG 


AACACCAGAT 


TGACTTGGTT 


16020 


TGCCTAGCAG 


GCTACATGAA 


AATCGTTGGA 


CCAACCTTAT 


TGTCGGCTTA 


TGAAGGTCGG 


X6080 


ATTCTCAACA 


TTCATCCAGC 


CTACTTGCCA 


GAATTTCCAG 


GAGCTCATGG 


GATTGAGGAT 


16140 


GCTTGGAATG 


CTGGCGTGGG 


TCAGTCTGGT 


GTGACCATTC 


ACTCGGTGGA 


TTCGGGTGTG 


16200 


GATACAGGCC 


AGGTCATCAA 


ACAGGTTCGT 


CTGCCACGAC 


TAGCTGATGA 


TACCATTGAC 


16260 


AGATTTGAAG 


CTCGCATCCA 


TGAAGCAGAG 


TACAGGCTCT 


ATCCGGAAGT 


AGTGAAGGCT 


16320 


CTATTTACAG 


ATTGACTTTT 


TGATGATTCA 


TATGATATCT 


TTGATTTTAA 


ATTGGAGTGA 


16380 


GTGTTTCTTG 


AAGACGGCTT 


CAAACGGAGG 


TATTTGTAAT 


GTTAGAATCT 


AAAAAAACAA 


16440 


CTCGATATGT 


ATTTTATGTC 


TATCTGATGT 


TATTAACTTG 


GGGAATCTTA TTTAAGTTTG 


16S00 


AAACAAATCC 


TCAATTTATA 


GCATTTTTCT 


TAGCTCCAAG 


CTATATCAAT TGGATTCCAT 


16S60 


TTTCAGAACC 


ACTAATAGTC 


GATGGAAAAA 


TTCTTTTTGC 


TGAAATGTTA 


TTTAATCTGA 


16620 


TTTTCTTTAT 


TCCATTAGGT 


GTTTGTTTCC 


CTTTGATAAA 


AACTAATTTA 


TCTAGTTTAA 


16680 


GAATAGTCGG 


GACAGGTTTC 


TTGATTAGTT 


TATTGTTTGA 


GTGCTTACAG 


TATATTTTAG 


16740 


CAATAGGTAT 


AACAGATATA 


ACGGATTTGA 


CTTTAAATAC 


GCTAGGTGTC 


TGTGTAGGCT 


16800 


TACTGATTTA 


TCAAATTTTT 


ATAAGAGTGT 


TCAAATCACA 


GACTAGAAAA 


TGGATCAATA 


16860 


TCTTAGGTAT 


GCTTAGCCTT 


GGTTTTGCTT 


ATCTTGTTTT 


ACTGTTACTG 


CATTTACTTA 


16920 


CTCTTTAACT 


AATGATTAAA 


AAGGAGAATA 


TAATGACTAA 


ACGCGTCTTA 


ATCAGCGTCT 


16980 


CAGACAAAGC 


GGGCATTGTT 


GAATTTGCCC 


AAGAACTCAA 


AAAACTTGGT 


TGGGAGATTA 


17040 


TCTCAACAGG 


TGGAACTAAG 


GTTGCCCTTG 


ATAATGCTGC 


CCTGGATACC 


ATTGCTATCG 


17100 


ATGATGTGAC 


TGGTTTCCCA 


GAAATCATGG 


ACGCTCCTGT 


GAAGACCCTC 


CACCCAAATA 


17160 


TCCACGGAGG 


GCTTCTCGCT 


CGTCGTGACT 


TGGATAGCCA 


CTTGGAAGCG 


GCTAAGGACA 


17220 


ACAAGATTGA 


GCTCATTGAC 


CTTGTGGTGG 


TCAACCTTTA 


CCCATTTAAG 


GAAACTATCC 


17280 


TTAAACCAGA 


TGTGACTTAT 


GCTGATGCAG 


TTGAAAATAT 


CGATATTGGT 


GGGCCATCTA 


17340 


TGCTTCGTTC 


AGCAGCGAAA 


AATCATGCCA 


GTGTTACAGT 


TGTGGTAGAT 


CCTGCTGACT 


l7400 


ACGCTGTGGT 


TTTGGATGAA 


TTGGCAGCAA 


ACGGCGAAAC 


CTCTTATCAA 


ACTCGCCAAC 


17460 


GTTTAGCAGC 


CAAACTATTT 


CGTCACACAG 


CGGCTTATGA 


CGCCTTGATT 


GCAGAATACT 


17520 


TCACACCTCA 


AGTGGGTGAA 


AGCAAGCCTG 


AAAAACTCAC 


TTTGACTTAT 


GACCTCAAGC 


17580 


AACCAATGCG 


TTACGGTGAG 


AATCCTCAAC 


AAGACGCGGA 


CTTTTACCAG 


AAAGCTTTGC 


17640 


CTACAGACTA 


CTCCATTGCT 


TCAGCCAAAC 


AGCTCAACGG 


GAAAGAATTG 


TCATTTAATA 


17700 


ATATCCGTGA 


TGCAGATGCT 


GCTATCCGTA 


TCATCCGTGA 


CTTCAAAGAT 


AGTCCAACCG 


17760 
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TTGTGCCTCT CAAACACATG AATCCATGTG GAATTGGTCA AGCTGATGAC ATCGAGACTG 
CTTGGGACTA CGCTTATGAG TCTGACCCAG TGTCTATCTT TGGTGGGATT GTCGTCCTCA 
ACCGTGACGT GGATGCTGCG ACAGCTGAGA AGATGCACCG CGTTTTCCTC GAAATCATCA 
TTGCACCAAG CTATACGGAT GAAGCGCTAG CCATTTTGAT CAATAAAAAG AAAAACTTGC 
GTATCCTTGC CTTGCCATTT AATGCTCAAG AGGCTAGCGA AGTCGAAGCA GAATACACAG 
GTGTAGTCGG TGGACTTCTC GTGCAAAATC AAGACGTGGT CAAGGAAAGC CCAGCTCACT 
GGCAAGTGGT GACTAAACGT CAGCCAACTG AGACAGAAGC GACTGCTCTT GAGTTCGCTT 
GGAAGGCTAT CAAGTACGTC AAATCAAATG GTATTATCGT GACCAACGAC CACATGACAC 
TTGGTGTTGG TCCAGGTCAA ACCAACCGTG TGGCTTCTCT TCGCCTTCCC ATTGACCAAC 
CCAAAGATCG TCTGGACGGC GCGGTCCTTG CTTCACATCC CTTCTTCCCA TTTGCGGATA 
ACGTGGAAGA AATCGCCAAA GCAGGAATTA AGGCCATCAT CCAGCCCCGT GCCTCTGTCC 
GTGACCAAGA ATCCATCGAA GCAGCGGATA AATACGCCTT GACTATGGTC TTTACAGGTG 
TGAGACATTT TAGACATTAA GAAGATAAAA GGGAAGAAAA CAGTTTCTTT CCTTTTTTGG 
CTTAAAATAC TAACTGAAAC AAGATTAAAA CGAACTTTTT TGATATAATC TTGGTAAATA 
ATTCGCAAAA GAGGTTGAGG AATGAAACTC CTTGTTGTCG GTTCTGGTGG TCCTGAGCAT 
CCGATTGCTA AAAAGTTACT TGAATCAAAA GACGTGGAAA AAGTCTTTGT AGCTCCTGOG 
AATGATGGGA TGACTCTGGA TGGTTTGGAA TTGGTAAATA TCTCTATTTC CCAACATTAT 
AAATTGATTG ACTTCGCAAA GACCAATGAT GTTGCTTCGA CCTTTATCGG TCCAGATGAT 
GCCCTTGCTG CTGGTATCGT GGATGATTTT AACCAAGCTG GACTTAAGGC CTTTGGTCCG 
ACTAGGGCTG CAGCGCAGCT GGAGTGCTCC AAGCATTTCG CCAAGGAAAT CATGGTCAAA 
TACGGCGTTC CGACAGCAAC ATATGGCACA TTTTCAGATT TCGAGGAAGC CAAAGCCTAT 
ATCGAAAAGC ATGGTGCGCC TATCGTAGTC AAGGCGGATG GCTTGGCACT TGGGAAGGGT 
GTCGTCGTTG CTGAGACGGT TGAGCAAGCG GTCGAACCCG CTCATCAGAT GCTTTTGGAC 
AATAAATTTG GTGACTCAGC TGCGCGCGTG CTTATTGACG AATTCCTTGA AGGAGAGGAA 
TTTTCACTCT TTGCCTTTGT CAATGGTGAT AAGTTCTACA TCATGCCAAC GGCPCAGGAC 
CACAAACGTG CCTATGATGG CGACAAAGGG CCTAACACCG GTGGTATGGG TGCCTATGCG 
CCAGTCCCAC ACTTACCACA GAGTGTAGTT GATACAGCGC TTGACACCAT TGTCAAGCCA 
CTTCTAGAAG GGGTGATTAA AGAAGGTCGC CCTTATCTGG GAGTTCTTTA CGCAGGGCTT 
ATCCTGACAG CTGATGGACC GAAAGTCATT GAGTTCAACG CTCGGTTCGG AGATCCACAA 



17620 

17860 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 

18S40 

18600 

18660 

18720 

18780 

18840 

18900 

18960 

19020 

19080 

19140 

19200 

19260 

19320 

19380 

19440 

19500 
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ACTCAGATTA 


TCTTGCCTCG 


CTTGACCTCT 


GACTTTGCTC 


AAAATATCAC 


AGATATCCTG 


i oc.cn 


GATACCAAGG 


AGCCAAATAT 


CATGTCGACC 


GACAAGGGTG 


TGACTCTGGG 


TGTGGTTGTC 




GCATCCAAGC 


GCTACCCGCT 


AGACTATGAA 


AGGGGCCTTC 


ACTTGCCAGC 


CAAGACAGAA 


19680 


GGCGATGTCA 


TCACCTACTA 


TGCAGGGGCT 


AAGTTTGCGG 


AAAATAGCAG 


AGCACTGCTC 


19740 


TCAAACCGCG 


GACGAGTTTA 


TATGCTCGTT 


ACCACAGCAG 


ATJtCCGTCAA 


AGAAGCCCAA 


19800 


GCCAGCATAT 


ACCAAGAACT 


ATACCAACAA 


AAAATAGAAG 


GACTCTTCTA 


CCGAACAGAT 


19860 


ATCGGAAGCA 


AGGCAATTAA 


GTAAAGATAT 


AAGAATAACG 


CGCCGTAGTC 


GCCAAACACG 


19920 


ATAATGGTCG 


TCGTGGTGAA 


AAGACCACAA 


CAGTGAATGT 


TCTGGTCAGG 


GGGAAACTTG 


19980 


GAGACCTTAG 


GCTCAAAGTT 


TAGGAATGAA 


ACCGAAGGTT 


TGCTTCCGCC 


TCCATCACCT 


20040 


AAGACCATTA 


TCAAAAAGAA 


AAATAAAAAT 


TCACAAAATA 


CGTTAATGAT 


CGTATGGTTT 


20100 


GCGACCGTTA 


GCGAGCTAAT 


ATAGAACAAT 


CACCGCCGTT 


GTGAAAGAAC 


GATTGGATGA 


20160 


TAATCCAATC 


GTTCAGGGAA 


ATTGCAAGAC 


CTTGGCTTTC 


CAATTTAGGC 


ATGAGACACC 


20220 


TTTGGTGGCT 


GCTGCCGTCC 


CTCACAAGCT 


AAGCTGATTG 


TTCAAAAAGA 


GGAAAAAGGA 


20280 


GAAGAAATGA 


AACCAGTAAT 


TTCCATCATC 


ATGGCCTCAA 


AATCCGACTG 


GGCAACCATG 


20340 


CAAAAAACAG 


CAGAACTCCT 


AGACCGCTTC 


GGTGTAGCCT 


ACGAAAAGAA 


ACTTGTTTCC 


20400 


GCACACCGTA 


CACCAGACCT 


CATGTTCAAA 


CATGCAGAAG 


AAGCCCGTAG 


TCGTGGCATC 


20460 


AAGATCATCA 


TCGCAGGTGC 


TGGTGGCCCA 


GCGCATTTGC 


CAGGCATGGT 


AGCTGCCAAA 


20520 


ACAACCCTTC 


CACTCATTGG 


TGTGCCAGTC 


AACTCTCCTC 


CTCTTAGTCG 


AGTGGATTCA 


20580 


CTCTATTCTA 


TCGTTCAGAT 


GCCGGGTGGG 


GTGCCTCTTG 


CGACCATGGC 


TATCGGTGAA 


20640 


GCTGGAGCGA 


CTAACGCAGC 


TCTCTTTGCC 


CTCCGTCTCC 


TCTCTGTAGA 


AGATAAGTCC 


20700 


ATTGCGGATG 


CACTTGCCAA 


CTTTGCTGAA 


GAACAAGGAA 


AAATCGCAGA 


GGAGTCGTCA 


20760 


AATGAGCTCA 


TCTAAAACAA 


TCGGAATTAT 


CGGTGGCGGT 


CAACTGGGTC 


AGATGATGGC 


20820 


CATTTCTGCT 


ATCTACATGC 


GCCACAAGGT 


TATCGCGCTG 


GATCCTGCGG 


CGGATTGCCC 


20880 


CGTCTCTCGT 


GTGGCGGAAA 


TCATTGTGGC 


ACCTTATAAC 


GATCTAGACG 




20940 


GTTGGCAGAC 


CGTTGCGATG 


TCCTCACTTA 


TGAGTTTGAA 


AATGTCGACG 


CTGACGGTTT 


21000 


GGATGCCGTT 


ATCAAGGATG 


GACAACTCCC 


TCAAGGAACA 


GATCTGCTCC 


GCATTTCGCA 


21060 


AAATCGTATT 


TTTGAAAAGG 


ACTTTTTGTC 


AAACAACGCT 


CAAGTCACTG 


TGGCACCCTA 


21120 


CAAGGTCGTG 


ACTTCTAGCC 


TAGACTTGGC 


AGATATCGAC 


TTGTCGAAAA 


ACTATGTCCT 


21180 


CAAGACTGCG 


ACTGGTGGCT 


ACGATGGTCA 


TGGACAAAAG 


GTTATTCGTT 


CAGAAGCAGA 


21240 


CTTGGAAGCA 


GCCTATGCGC 


TAGCAGACTC 


AGCAGACTGC 


GTCTTGGAAG 


AATTTGTCAA 


21300 
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CTTTGACCTT GAGATTTCTG TCATCGTGTC ACGAAATCGC AAGGAGGTGA CGTTTTTCCC 21360 

ACTTCAGGAA AATATCCACC GCAACAATAT CCTGTCTAAG ACCATCGTAC CACCCCGCAT 21420 

TTCTGAAAGT CTAGTAGACA AGGCTAAAGC TATGGCAGTG CGAATCGCAC AACAACTCAA 21480 

CTTGTCTGGA ACTCTCTGTG TGGAAATGTT TGCGACAGCT GATGACATCA TTGTCAATGA 21540 

AATCGCCCCA CGACCACATA ACTCTGGGCA CTATTCTATT GAAGCCTGTG ATTTCTCTCA 21600 

GTTTGACACC CATATTCTGG CTGTTCTCGG AGCACCATTA CCAGTCATCA AACTCCATGC 21660 

GCCAGCCCTT ATGCTTAATG TCCTCGCTCA GCATGTCGAC GCTGCTGAAA AATATGTCAC 21720 

AGAAAATCCA AGCGCCCACC TCCACATCTA TGGTAAAATA GAAGCAAAGC ATAATCCTAA 217 80 

GATGGGACAT CTGACTTTGT TTAGTGATCT GCCGGATAGT GTGGAAGAGT TTCGGGAAGG 21840 

CATTG A TTT T TAGGACAAGT CTATGATACA AATTATCGTT AATACATTTA TTGAAAAGTA 21900 

TAAGACTGGA GCAGTTGTTG AAGTGTTGTA TGCCAGTGCT GACCAAGATA AGGTACAAGC 21960 

TAAATATGAA GAACTAGCTG CACAATACCC CGAAAATTAT TTACCTATCT ATAATGTACC 22020 

CCTGGATACG CATTTGAATA CACTAGATCA TTACCCGTCT GTGTTTATTG GAAAAGAGGA 22080 

GTTTGACTAG AAATCTTGGT TTACCTAGAT AGCTTATTCC CAACAGCTTA AGAAGAAAGG 22140 

AAAAATTAAC ACATGATCAA CCCTTACTCT CGCCCTGAGA TGGCGAATAT TTGGAGTGAA 22 2 00 

GAAAATAAAT ACCCTGCTTG GCTTGACGTG GAAATCCTCT CTGACGACGC ATGGGCTGAG 222 60 

TTGGGGGAAA TCCCTAAGGA AGATGTGGCT TTGATTCGCA AGAAGGCGCA CTTTGACATC 22320 

GACCGTATTT TGGAAATTGA GCAGGACACC CCCCACGATC TGGTGGCTTT CACGCGTCCG 22380 

CTTTCTGACA CTCTTGGTGA AGAGCGCAAG TGGGTTCACT ATGGGTTAAC TTCTACTGAC 22440 

CTGCTCGATA CTGCTTATGG TTACCTCTAC AACCAGGCCA ACGACATCAT CCGTCGTGAC 22500 

CTTGAAAACT TCACTAATAT CATCGCTGAC AAGGCCAAGG ACCACAAGTT CACCATCATG 22560 

ATGGGCCGTA CTCATGGTGT GCACGCTGAG CCGACAACCT TTGGTCTTAA ATTAGCAACT 22620 

TGCTACAGCG AAATGAAACG CAATATCGAG CGCTTCGAGC ATGCGCCTGC TGGTCTAGAA 22680 

GCTGGTAAGA TTTCTGGTGC GGTTGGGAAC TTTCCCAATA TCCCACCATT TCTAGAGCAG 22740 

TATGTCTGCC ATAAACTTGG CATCCGTGCC CAAGAAATCT CTACACAACT CCTTCCTCGT 22800 

GACCTTCACC CTGAGTACTT TGCGGTTCTT GCCAGCATTG CGACTTCAAT CGAACGTATG 22860 

GCGACTGAGA TTCGTCGTCT ACAAAAATCT GAGCAACGCG AAGTAGAAGA GTTCTTTGCT 22 920 

AAAGGGCAAA AAGGGTCTTC AGCAATGCCT CACAAACGCA ACCCAATCGG TTCTGAAAAT 22980 

ATGACTGGTC TGGCGCGTGT CATTCGTGGT CACATGATTA CGGCTTATGA AAACGTCGCT 23040 
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CTCTGGCATG 


AACGCGATAT 


TTCTCACTCA TCAGCTGAGC 


(IT ATf* A TV* Af* 




23100 


ACCATTTTGA 


TTGACTACAT 


GCTCAACCGT 


TTTGGAAATA 






23 160 


TTCCCACAAA 


ATATGATCCG 


AAACATGAAC 


TCGACTTTTG 


GTCTTATCTT 




23220 


GCTATGTTGA 


CATTGATTGA 


AAAAGGCATG 


ACCCGTGAGC 






«J<OV 


CAAAAACAGC 


CTACTCTTGG 


GACAACCAAG 


TAGACTTTAA 


ACCACTTCTT 






CAGAAGTAAC 


ATCACGTCTC 


ACACAAGAAG 


AAATCGATGA 


AATCTTCAAC 


CCAGTTTATT 


9 1 4 A A 
4 J4UU 


ACACCAAACG 


AGTGGATGAT 


ATCTTTGAAC 


GTCTTGGACT 


AGGTGATTAA 


TTAAAAAATA 


4 J4 DU 


AACAGCGAGC 


TTCAATCTCG 


W * V* & 4 • 4 * * 


TTTATCGAAA 


AGACTTAGTC 


TTCTTTTCTT 


*i ^ C9 A 


TTAGTGAGTC 


CATAGGCTGC 


TACTCTGGAC 


ATCAGTCCTC 


CGACTACTAG 


TCCTGCAGAA 


235BU 


TCGTGAGTTC 


CTGTTTCAGG 


/WtVj 1 1 1 i 1 


TPTfSTT ACC A 


CAGGAGCTGG 


ATCTTGAGGA 


23640 


AGAACTTTGC 


TTTCCTCAGC 






GTTGGCTTGG 


GATTTCTAGT 


23700 


TTTGGTTTTT 


CTTCAGCAAT 


AGCGGCTTGT 


CCGTTTTCAT 


CCCCTACATG 


TGTTACCATA 


23760 


GTTCCGACTT 


CGACTATTTG 






CGACACTATT 


TACAAGTGTT 


23820 


TTCACTTCCT 


TACCATCCGC 




nLAUnU 1 nun 


AGTTGCTACG 


ATGTCCATTG 


23880 


ACGCCCTTAG 


TAATGACTTG 


TGTTTTTCCT 


1 1 uAu 1 AAWA 


GTGGATTTTC 


ACAACTCACT 


23940 


GTGGTAAATG 


GAATTTCTTC 




1 UUAU i 1 AW 


CTTTTACCTC 


AGTAGTTGGT 


24000 


GCAAGACCAC 


TTTCATCACC 


I*. 1 I \* 1 V>M< 1 * 


ACAflCAf^Cflf* 

ALMlUAUWUVi 


CAACTTCAAC 


CACTTGGTTT 


24060 


ATAACTTCTT 


TGGTTACCTG 




ACTGTTTCTG 


TTGTTTTTCC 


ATTTTCAGTG 


24 120 


AGTACAGAGA 


TGTAATGAGT 






CTGTGATAAT 


ATTTTCCTGA 


24 180 


CCGGCTGGGA 


GGTTAGGATT 


X A w A A 1 i 4 V* 


ATAACTTCAA 


ATGGAATTTC 


TTCAGTTCTT 


24 240 


GTGATGAGTT 


CTGGTCTGGT 


TTCAACATTG 


GCAGCCACTT 


CATTTTCATC 


TAGGCTTCCT 


24300 


GAATGAGTTA 


CAGCTGGTTT 


GAGGCCTTGA 


AGAGCGGCTT 


TTAGGTTGGC 


TACAACCCTU 




TCAAGCTCAG 


CTTGTTTATT 


ACGGTTGAGG 


TTGTAATTTA 


GAGCTGTTTT 


AGCTGCUTLA 


«44 *U 


AGGGCCTCAA 


GACTTTCTTT 


ACTATATCCT 


TCTAAGTTTG 


TAGGAATTTT 


AGCTAATTCT 


24480 


TCGCGGAGAG 


CATTATAATT 


AGCACGAAAG 


TAGTCTTTGT 


TGTGGTCTGC 


AAAGGCAGTC 


24S40 


ATGAGTTCAA 


AGATTTCCTC 


TTCCTTGTAT 


TCAGCGCTTG 


CTCTATCTGC 


CCAGATTGAA 


24600 


AGCATACTTC 


CGACTGTTGG 


AAGATCTACT 


TCAGGATATT 


TGCTAGAAGC 


TAGTTGATTG 


24660 


AATGGTGTTT 


TTCCAGTATT 


CTCAATAGCT 


TTCTTGACGA 


AACCACCACC 


ATCTTCTGGT 


24720 


TTTTGACCAA 


GAATGTAGTA 


CCAGTCACCG 


TTGGTATTCA 


AGAATTTATA 


GCCTTTGCTT 


24780 


GCTAGGTATT 


GAGGTGATCC 


GAGGTTATAT 


CCCCACCAGC 


CTTTAGACCA 


GTAAGAAATC 


24840 
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AAGACATCTT 


TCTCAAACTG 


AACATCCTCC 


TTGTCTTCAT 


AGTAGAAGCC 


ATCGTTGAAG 


24900 


GCCATTGGTT 


GAACCCCTCT 


TTCTTTGGCC 


ATAGCTGCGA 


GGGTGTTGGC 


ATATTCGGCA 


24960 


AATTTGCCAT 


AGAGTTGATA 


CCACTTGAGG 


TAGTACCAGC 


CTTGGGCACT 


AGTCGCATCG 


25020 


TTCCCGTATT 


CGTCAGTACC 


AAAGTTGAAA 


ATCTTTGTTT 


TACCTGCAAA 


GAAGTCCATG 


25080 


TATTTACCGA 


TGAGGGCTTT 


TACAAAGTTC 


ATCGCTTCTT 


CGTTTTTCAA 


GTCCATAGTT 


25140 


GTTTTTGAAA 


CTTTATCAAA 


GTGGGCTTGA 


GGATTTTTAA 


TACCTAATTT 


TTCCATGCCA 


25200 


ACCAGCATAG 


CATCCATGTG 


ACCTGGACTG 


TTAATAGCTG 


CGATGAGACC 


GATGTCCTTA 


25260 


GATTTAGCGT 


ATTCAATTAG 


CTCTGTTACT 


TCTGCCTGTG 


TTAGTGCAGT 


ACCGTTTGGA 


25320 


TCCTCGTAGT 


AACCTTTAGT 


TCCTTCGATA 


ATAGCTTTTT 


TAACGTCATC 


ACTAGCATAG 


25380 


GTTTTTCCGT 


TGGCAGTAAT 


GGTCATATCA 


TCGAGTAGAA 


AGCGAAGTCC 


GTCATTTCCT 


25440 


AGAAGGAGAT 


GGACATCAGA 


ATATCCGAGC 


TCACTGCCCT 


TGTCTACGAT 


GCCTTTGAGC 


25S00 


TGGTTCAGAG 


TAAAGTATTT 


GCCTCCAGCA 


TCCATTGAGA 


TTACCTTGTT 


TTTGGCAAGT 


25560 


TTTTCAACCT 


CACGTTTAGC 


TTCTTCTTCT 


TTTTGAGCTT 


CAGGCGTGAG 


GCTCAACTTC 


2S620 


TTGACAGTTT 


CTTGAAGTTT 


AGCAATGGCT 


TGATCAATCG 


TATCTTCTTG 


GGCACCGCTA 


25680 


AGGTTGCTAT 


CGAGAGAGCG 


AATAGCTTTT 


TCAGCTTCTT 


TTACGGCCCT 


GACGCTTTCT 


25740 


GCAGTATAAC 


GGTTCAGGTC 


TTTTGCTACC 


TCCTTAAGTG 


CTTGCTCTGC 


AGATTCATAA 


25800 


TCAGCTGCGA 


AGTATTCAGC 


GTTGGCATTT 


GCAAAATGAC 


GCATGAGTTT 


GAAGAGGCGT 


25860 


GATGGTGAAT 


AACGTGCAGA 


TGGAGTGTCA 


GCCCAACCAG 


CTACCATACC 


ACCGATGATT 


25920 


GGGATATCAG 


CTCCTTCTGT 


TTTTGGTACA 


GAAGTGATTG 


GTGTGTTTTT 


AATACCATTG 


25980 


AGCCCCTGAT 


CCACATTGTA 


CCAGCCTTGG 


CCATCAGCGT 


TTCGTCCAAG 


AACGTAGTAC 


26040 


CAAGCATCAT 


TCGTATTAAC 


GATTTGGTGA 


CCTTTTTCAG 


CTAGTAGTTT 


AGAAGAAGCG 


26100 


ACATCGTAGC 


CTCCCCAACC 


ACCAGTCCAC 


ATAGAAACGA 


TGATGTCTTT 


GTCAAAACTA 


26160 


CCAAAGCTTG 


TGTCGCTATT 


GTAGTAGATA 


CCGTCGTTAA 


AACCCATTGG 


TTTGAGACCG 


26220 


TGCGATTTTA 


C AATACGAGC 


GAGGTCATTG 


GCGTAGGCAA 


TAAATTTTTC 


ATAGCCTTTT 


26280 


ACAGGGTAGC 


CTTCGTTTGG 


ATAGTATTTA 


TCAGCTTGAA 


GCACACTCCA 


ACCTTTAGCA 


26340 




CATTGGCATA 


TTCATCAAGT 


CCGATGTTGA 


AGATTTCAGT 


CTTTTTCGCG 


26400 


AAATAAGCAG 


CATACTTGTC 


GATAAGGGCT 


TTTGTAAAAG 


CGACACCTTG 


TTCGTTGTCA 


26460 


AGATCGACAG 


TACGGGCTGA 


TTTCTTCCCA 


AAATAGCTAA 


AGTTAGGGTT 


TTGGATTCCC 


26520 


AATTCTTTCA 


TGGCATTGAG 


AATCGCATCC 


ATGTGTCCAG 


GACTATTTAC 


TCTCGGAATG 


26580 
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ACACCGATAC 


CTTTATCTTT 


GGCATAGTTA 


ATCAGATCTG 


TCATTTGACT 




26640 


TGATTGCCGT 


TTGGATCGTT 


GTAATAATCA 


TTTGTACCTT 


TTTCAATGGC 


GCGTTTGACA 


AO /UU 


TCGTCACTCG 


CATAGGTCTT 


GCCGTTAGCT 


CTGATGCTCA 


TATCGTCCAA 


CATGAAACGG 




AGTCCATCAT 


TTCCGACTAA 


TACCTGTAAA 


TCAGTGTAGC 


CATAATGTTT 


CCCTTTATCG 


26820 


ATGATTTCCT 


TGAGCTGTTC 


TGGTGACAAA 


TATTT ACGTC 


CACCATCAAT 


AGAAACAATT 


26880 


TTCTTTTTCG 


CTAGTTTTTC 


ATTTACAGTT 


GCAGCACGTT 


CCTTTCCTGC 


CTCTCTTGCC 


26940 


GGTTTCTCAG 


CCTCTGCTTT 


CGCTTCATCT 


TTTTTAGCTG 


CTTTATCCTT 


GTCAGTCTTG 


27000 


TCTGTATTTG 


ACTCTTTAGA 


ATCAACCTCT 


TTCGCTTCTT 


CCTTTTTAGG 


GCTAGCTTCT 


27060 


TCTGCCTTTT 


TATTAGCAGT 


TTCTTTTTCA 


GCAGAAGTTG 


GAGTTACCAC 


TTCTGCTTTA 


27120 


TCACTAGGAG 


TTGAACTAAC 


TTCCTCTTGT 


GGTTTTTCTT 


CTGTTTTTGG 


AAGACTAGCT 


27180 


ACCTTATCAG 


TAGCTGGAGT 


TTCTGTTTCT 


ACACTTTTTG 


GAGCTTCTGG 


TTGAAGCACT 


27240 


GCTTTAGGTG 


TTTCCTCAGT 


CCGATTTTCG 


GATGATTGAG 


GGGAATCACA 


AACCGTATGG 


27300 


ATGGTCGGTT 


GCTTTTCTGT 


AGTAGTAGGA 


GTAACTCCAT 


CGGCTGCAAC AGTCTGTGCT 


27360 


TGGAAGGCAA 


ATCCAATTAG 


AACAGAAGCT 


GCTCCTACAG 


CCTATTTACG 


AATAGAAAAA 


27420 


CGCTGTTGTT 


TTTCATGTTT 


CATTGCAAAA 


CCTCCTGATT 


GCATTGTTAT 


ATTGATAGCG 


27480 


ATTATATAAA 


TCAACGCCTT 


TATTTTATTT 


CTTATATTAA 


TTTCTTATAT 


TAACGAGAGT 


27540 


CAAGAGGAGA 


TGACAAAAAA 


CTATAATAAG 


TATAAAAAAA 


TATAAAATTT 


AAACTTAAGA 


27600 


TTTCAGATTG 


CTCGGAAAAA 


ATACCTATAT 


ATATCTACTA 


TAATTTTTGG 


TTCTATTTCT 


27660 


ATAAAATATT 


CCACAAATTA 


TAGAATTTTC 


CAAAAATAGG 


TAAGCCCTAC 




27720 


TAGTATAATA 


AGCATAGAAA 


AAGCCCAACC 


GATTAGCTCA 


GGTTTTCTTC 


TTAGTGATCA 


27780 


CGGTCACATG 


AGATAAATTT 


AATCTTGTAC 


TAATCAGATC 


CTTTCTAACT 


TTCACTGTAT 


27840 


TCTAAAACTT 


GGCCAGTTGA 


TTCGAGTTTG 


GTGATTT T AG 


TTTGTAGCAC 


AGTAGGGAAT 


27900 


TGTTCATCGA 


CTCCGAGGAC 


TGAAGCTGCA 


TGTTCTGGAC 


TTGGAAAGAC 


TATTTCGTTG 


27960 


ATTTCTTCAA 


AGTGTTCATC 


ATTCATGTGA 


ATGTGGTAGT 


CTAACTTGAA 


ACGATTATAG 


26020 


ATAGAACTAT 


AGTATTCAAG 


GTTTGGATAA 


TTTGCGTTGA 


TATATTGTTC 


TGGGATGTAG 


28080 


GATGTATGGT 


AGATATAAAC 


GACACCGTTT 


GATTCGCGGA 


TACGTTCAAT 


CTTGTAGTAG 


28140 


AATTGATCCC 


CGCGTAGACC 


CAATTTTTCC 


AAGTAAACAA 


GCTTGTTTCC 


GCGTTCAATT 


28200 


GAAAGAACAG 


TTACCTTATC 


ATCTTTACCA 


TTGAAGAGTT 


CAATATCTGA 


AAACTCTACA 


28260 


AGCTTGTGTT 


TGCGTGCACG 


TGAAACGAAG 


GTTCCTTTTC 


CTTGTTGGCG 


GACAATATAG 


28320 


CCATCTTTGG 


CAAGGTCGTT 


TAAGGCGCGA 


ACAACTCTGA 


TAGAGCTGAC 


ATCGTACATT 


28380 
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GAAATGACTT CTGCTTCAGT GTAAAATTTA TCTCCACTGC TAAACTCCCC ACAGATCATT 28440 
TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 28473 
(2) INFORMATION FOR SEQ ZD NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTGATGCCT GGTATGCGAG GAT ACAGTT C TGAAAATCGC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTGAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGCAAATCC 120 

AGAAACTGAT GTGCAAATGC CACAACAGGC ATCAGGTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGCT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTCGATC CTGTTATCGG 240 

ACCAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCGCACCA AGAACAATCC 300 

TGTTTTGGTC GGAGATGCAG GTCTTGGTAA GACAGCAGTT GTCGAAGCTC TAGCGCAAGC 3 60 

CATTGTGAAC GGAGATGTTC CTGCTGCT AT CAAGAACAAG GAAATTATTT CTATTGATAT 420 

CTCAGGTCTT GAGGCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 480 

AGTCAATGAA GTGAAAGAAG CAGGGAATAT TATCCTCTTC TTTGATGAAA TTCACCAAAT 540 

TCTTGGTGCT GGTAGCACTC GTGGAGACAG TGGTTCTAAA GCACTTGCGG ATATTCTCAA 600 

GCCAGCTCTC TCTCGTGGAG AATTCACACT GATTGGGGCA ACAACTCAAG ACGAATACCG 660 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGCTCAATGC 720 

TCCTTCGGCA GAGAATACTT TTAAAATTCT TCAAGGAATT CGTGACCTCT ATCAACAACA 780 

CCACAATGTC ATCTTGCCAG ACGAAGTCTT GAAAGCAGCG GTGGATTATT CTGTTCAATA 840 

CATTCCTCAA CGTAGCTTGC CAGATAAGGC TATTGACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGGCGGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 960 

AAAAGACAAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 1020 

AACACGCATT GCAGAATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TGAAAGTGAC 1080 

TGCAAGTGTC AACGATGTGG CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTGC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCGTCGTA ACCGTGCTGG 1260 
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TTTTGATGAA 


GGAAATCGCC 


CAATCGGCAA 


CTTCCTCTTT 


GTAGGGTCTA 


CTGGGGTTGG 


1320 


TAAGACGGAG 


CTTGCTAAGC 


AATTGGCACT 


CGATATCTTT 


GGAACCCAGG 


ATGCGATTAT 


1380 


CCGTTTAGAT 


ATCTCTGAAT 


ACAGTGACCG 


CACAGCTGTT TCTAAGCTAA TTGGTACAAC 


1440 


AGCAGGCTAT 


GTGGGTTATG 


ATGACAATAG 


CAATACCTTA 


ACAGAACCTG TTCGTCGCAA 


ISOO 


TCCATACTCT 


ATCATTCTCT 


TGGATGAAAT 


TGAAAAGGCT 


GACCCTCAAG 


TTATTACCCT 


1560 


TCTCCTCCAA GTTCTAGATG ATGGTCGTTT 


GACAGATGGT 


CAAGCAAATA 


CAGTAAACTT 


1620 


CAAGAACACT GTCATTATTG 


CGACCTCAAA 


TGCTGGATTT 


GGCTATGAAG 


CCAACTTGAC 


1660 


AGAAGATGCG 


GATAAACCAG 


AATTGATGGA 


CCGTTTGAAA 


CCCTTCTTCC 


GTCCAGAATT 


1740 


CCTCAACCGC 


TTTAATGCAG 


TCATCGAGTT 


CTCACACTTG 


ACTAAGGAAG 


ACCTTTCTAA 


1800 


GATTGTAGAT 


TTGATGTTGC 


CTGAAGTTAA 


CCAAACCTTG 


GCTAAGAAAG 


ACATTGACTT 


1860 


GGTACTCAGT 


CAAGCGGCTA 


AAGATTATAT 


CACAGAAGAA 


GGTTACGACG 


AAGTCATGGG 


1920 


GGTTCGTCCT 


CTCCGTCGCG 


TCGTTGAACA 


AGAAATTCGT 


GATAAGGTGA 


CAGACTTCCA 


1980 


CTTGGATCAT 


TTAGATGCTA 


AACATCTGGA 


AGCAGATATG 


GAAGATGGCG 


TTTTGGTTAT 


2040 


TCGTGAGAAA 


GTCTAAGACA 


GAATTTTGAG 


GATAAAAAAG 


AAGGAGCCAG 


CTGAAAAAAA 


2100 


CTGGTTCCTT 


TTTAGGTACG 


ACAGGCATGT 


CCTATAGTAG 


AACTGTATTA 


TTCTAGTTTC 


2160 


AATATACTAT 


ACTAGCTCAG 


AAGTCGGTAC 


TTAAACGTGC 


TAT ATCAAAA 


CCAGTCCTCG 


2220 


AAAAACGTGG 


ACTGGTTTCG 


TGTTTGGATT 


ATTACCTTGA 


ACGACATGCG 


TTAAAAGTTA 


2280 


GTTGAACCGC 


CGTATGCCGA 


ATGGTACGTA 


CGGTGGTGTG 


AGAGGGGCTA 


GAGATTATCC 


2340 


CCTACTCGAT 


TTTAAATCAC 


ATGACGTTCA 


AAGCCATCAT 


CTGAAATCCC 


TTGTTCCAAG 


2400 


ATGAGTTTTG 


CCCATTCTTT 


AGCAGAGAAG 


AGGCTGTGGT 


CCTTGTAGTT 


TCCGCAAGAT 


2460 


TCGATGGTTG 


TCCCTGGCAC 


ATCTTCCCAA 


GTAGTAGTTT 


CAGCGATTTC 


CTTGAGCGAA 


2520 


TCCTTGATAA 


CAGCTGCGAT 


TTTACCACTC 


GTGTGACGTC 


CCCACATAAT 


CATCTGGAAG 


2580 


CCTGTGCGGC 


AACCAAATGG 


TGAACAGTCA 


ATCATGCCGT 


CAATGCGGGT 


ACGGATGAGT 


2640 


TTGGCTAAGA 


GCTGCTCGAT 


AGTGTGAAGG 


CCGGCAGTAG 


GGATAGAGTC 


TTCGTTTGGT 


2700 


TGCACCAAGC 


GAATATCATA 


ATTGGAGATG 


ATGTCTCCTT 


TTGGTCCTGT 


TTCTTCCCCA 


2760 


ATCAAGCGAA 


CATAGGGTGC 


TTPGACAATG 


GTGTCGTCAA 


GTTCAAAACT 


TTCGACAATA 


2820 


ACTTCTTTTG 


ACATGGTAAA 


TCCTTTCAGT 




CATTATATCA 


TAAAGGTTGC 


2880 


TCCTGAGACA 


GAGAGAAAAC 


CTCTCCGAGG 


CTGGAGAGGT 


TGAAATCTTT 


ACTTACGATA 


2940 


TAAGCGGTCG 


TATTGCTAGT 


ATGGGTCAAA 


GGTTACGTTG 


ATACCCAGTT 


TACGAAGGAC 


3000 


ATTCTTGTCT 


TCATCAGTCA 


AGATGATGGT 


TGAGTGGGCT 


TCGCTTCCTT 


TGAGGTTGCC 


3060 
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GACTTCTTCC 


ATAGCGCGGG 


CAGCATCAGG 


ATTTTCTGTA 


GCTGTGATAG 


CAAGTGCAAT 


J 1<6W 


CAGGATTTCA 


TTTGAATGAA 


GGCCTGGATT 


GCGGCTACCG 


AGATGATCGA 


TTTTAAGACC 


1 1 OA 


TTGGATTGGC 


TTAACAACTT 


CACCCTCCAT 


TAGTTTTACT 


TCTTTAGCGA 


TGTCAGCTGA 


3240 


TTTTT TGATG 


GCGTTGATCA 


AGGCAGCGGC 


TGTAGGACCA 


AAGAGTTCTG 


AGTTCTTACC 


3300 


AGTGATCATT 


TCCCCATTTG 


GCAATTCAAA 


GGCTAGGGCT 


GGTCCACCAG 


TTTCTTCTGC 


3360 


TTTTTGGCGC 


GCAACGACAG 


CAACCTTACG 


GTCTGCAGGT 


GTGATACCGA 


GGTCGTTCAT 


3420 


GAGCAACTCA 


ATTTTCTTGA 


CGGCAGCTTC 


GCCAACTTTT 


TCAGCTTTGA 


AGTCAAGAAC 


3480 


TCTTTGATAG 


TAACGGCGGA 


TGATTTCTTG 


TTTAGAAGCT 


TCGACAGCGG 


CCTCGTCATC 


3540 


TGTAATAGCC 


AAACCAACCA 


TGTTGACACC 


CATATCTGTC 


GGTGAAGCGT 


ATGCTGATTT 


3600 


TCCGAGAATA 


CGTTCCAACA 


TGCGTTTGAG 


CACTGGGAAG 


ATTTCGATAT 


CACGGTTGTA 


3660 


GTTGACAGTG 


GTTTCTCCAT 


ACGTTTGAAG 


ATGGAAGGGG 


TCAATCATGT 


TGACATCATC 


3720 


AAGGTCAGCT 


CTGGCAGCTT 


CATAAGCCAA 


GTTAACTGGA 


TGATGAAGGG 


GAAGATTCCA 


3780 


AACAGGGAAG 


GTTTCAAATT 


TAGCCTACCC 


AGATTTCATG 


CCATTGATTT 


GGTCGTGGTA 


3840 


CATATTGGAC 


ATACACGTTG 


CCAATTTTCC 


AGAACCAGGT 


CCAGGAGCGG 


TTACGACAAT 


3900 


CAAGTTGCGA 


CTGGTTTTGA 


TGTAGTCGTT 


TTTGCCCATG 


CCTTCTGGGC 


AAATGATGTG 


3960 


ATCCATATCC 


GTCGGATATC 


CTTTGATTGG 


ATAATGAAGA 


TAAGAATCAA 


TTCCGTTTTT 


4020 


CTCAACTTGA 


TTGCGGAAGG 


CATCTGCAGC 


GGGTTGGCCA 


GCGTATTCTG 


TAATGACAAC 


4080 


GGAACCAACA 


AAAATCCCTA 


ATTCATTGAA 


TTTATCAATC 


AAACGAAGAA 


CTTCTTGGTC 


4140 


ATAAGAAATG 


CCTAAGTCGC 


CACGTGCTTT 


GGAATGTTCA 


ATGTTGCTAG 


CATTAATGGC 


4200 


AATCACAACC 


TCAACCTGCT 


CTTT CAATTC 


TTGCAAGAGC 


TTCATTTTCT 


TCTCAGGTTC 


4260 


ATAACCAGGA 


AGGACACGAG 


CAGCGTGGAA 


ATCTTCTAAC 


ATTTTACCGC 


CAAACTCTAA 


4320 


GTAGAGCTTG 


CCCTCAAATT 


GGTTAATGCG 


CTCCAAAATA 


TGGTCGCGTT 


GTAAATTCAA 


4380 


ATATTGTTCA 


GAACTAAAAG 


CTTGTTTTTT 


CATTTTTTTA 


CCTCTGGACT 


CTATTATAAT 


4440 


AAAAAATTGG 


AAGTTAGGAA 


ACTACGGAGC 


TAAAAAAGAA 


ATTAAAAAGA 


TTAAGCAAAC 


4500 


GCTTCCACAA 


AATTTTAAAA 


ACTGCTATCA 


TACACTATAG 


ATTATGAAAA 


TAATGAGGTA 


4560 


AACAGATGCA 


AGAAAAATGG 


TGGCACAATG 


CCGTAGTCTA 


TCAAGTCTAT 


CCAAAGAGTT 


4620 


TTATGGATAG 


TAATGGAGAT 


GGAGTTGGTG 


ATTTGCCAGG 


TATTACCAGT 


AAGTTGGACT 


4680 


ATCTAGCTAA 


GCTAGGAATC 


ACAGCAATTT 


GGCTTTCTCC 


CCTTTATGAC 


AGCCCTATGG 


4740 


ATGATAATGG 


. CTATGATATT 


GCTGATTATC 


AAGCGATTGC 


GGCTATTTTT 


GGAACCATGG 


4800 
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AGGACATGGA 


TCAGCTGATT 


GCAGAAGCTA 


AGAAGCGTGA 


CATTCGTATC 


ATCATGGACT 


4860 


TGGTGGTCAA 


TCATACCTCA 


GATGAACATG 


CTTGGTTTGT 


CGAAGCCTGT 


ft m m ft m ^^af^^ 

GAAAATACTG 




ACAGCCCTGA 


GCGAGACTAC 


TATATCTGCC 


GCCATGAACC 


CAATGACCTA 


GATTCTATCT 


iS Q Aft 


TTAGTGGGTC 


TGCTTGGGAA 


TACGATGAAA 


AGTCAGGTCA 


ATACTATCTC 


CACTrTTTCA 




GCAAGAAACA 


GCCGGATCTC 


AACTGGGAAA 


ATGAAAAACT 


TCGCCAGAAA 


« auHii ft ft ^» & 

ATTTATGAGA 


c i nn 


TGATGAACTT 


CTGGATTGAT 


AAAGGTATTG 


GTGGTTTCCG 


TATGGATGTT 


ATTGACATGA 


51 60 


TTGGCAAAAT 


TCCTGACCAG 


AAGGTAGTCA 


ATAATGGTCC 


TATGCTCCAT 


CCCTATCTCA 


5220 


AGGAAATGAA 


TCAGGCCACC 


TTTGGAGATA 


AGGATCTCTT 


GACAGTAGGG 


GAGACTTGGG 


S280 


GACCAACTCC 


AGAGATTCCC 


AAGTTCTACT 


CTGATCCAAA 


GGGGCAAGAA 


TTGTCTATGG 


5340 


TCTTCCAGTT 


TGAACATATC 


GGTCTTCAGT 


ATCAGGAAGG 


TCAGCCTAAA 


TGGCACTATC 


5400 


AAAAAGAGCT 


GAATATCGCT 


AAGTTAAAAG 


AAATCTTCAA 


CAAATGGCAG 


ACACAGTTAG 


5460 


GAGTTCAGGA 


CGGCTGGAAT 


TCCCTCTTCT 


GGAACAACCA 


TGACCTCCCT 


CGTATTGTCT 


5520 


CAATCTGGGG 


AAATGACCAA 


GAATACCGCG 


AAAAATCTGC 


CAAACCCTTT 


GCAATCTTAC 


5580 


TTCATCTCAT 


GAGAGGAACT 


CCTTATATCT 


ACCAAGGTGA 


GGAGATTGGG 


ATGACCAACT 


5640 


ATCCGTTTGA 


AACACTGGAT 


CAAGTAGAAG 


ATATTGAATC 


TCTCAACTAT 


GCGCGTGAGG 


5700 


CTCTTGAAAA 


AGGTGTTCCG 


ATTGAAGAAA 


TCATGGACAG 


TATCCGTCTT 


ATTGGACCTG 


5*760 


ACAATGCCCC 


TACCCCTATG 


CAATGGGACG 


AGAGCAAAAA 


CGCTGGTTTC 


TCAACAGGTC 


5820 


AACCTTGGTT 


GGCGGTTAAT 


CCAAATTACG 


AGATGATCAA 


TGTCCAAGAA 


GCGCTGGCAA 


5880 


ATCCAGATTC 


TATTTTCTAT 


ACCTATCAGA 


AACTGGTCCA 


AATTCGCAAG 


GAGAATAGCT 


5940 


GGCTAGTTCG 


AGCTGACTTT 


GAATTGCTTG 


ATACGCCTGA TAAGGTCTTT 


GCTTATATAC 


6000 


GTAAGGATGG 


CGACCGTCGC 


TTCCTAGTTG 


TGGCTAACTT 


GTCCAATGAA 


GAGCAAGACT 


6060 


TGACAGTAGA 


AGGAAAAGTC 


AAATCTGTCT 


TGATTGAAAA 


CACTGCGGCT 


AAAGAAGTAC 


olZO 


TTGAAAAACA 


GGTCTTGGCT 


CCATGGGATG 




GGAATTACTA 


TAAATATTTT 


c i a a 
oloO 


TTGCAGAAAA 


ATTTAAAATT 


GAAATCCTAT 


AAAAACAAGG 


GAGGACTGTA 


TAAAAGACAG 


6240 


AAATCCTTTG 


TTTTTTATAA 


CCAAAGTTTA 


TAAACTTTCA 


TTCTTGAAAT 


TCAATTAACT 


6300 


TTACAAATTC 


CCACTATTAA 


GGAGAAAGAA 


CATGAACATA 


AAGAAGCG7G 


TCCTTAGTGC 


6360 


AGGCCTGACT 


TTTGCATCTG 


CTTTGCTTTT 


ACCCAAATCA 


TTCATACCTC 


TCTCAACTAG 


6420 


ATGTAACTTA 


CAAAACCCCT 


GACCTCATGA 


GCCACTTTCT 


TCCTCCTCAT 


GAGGTCAGTT 


6480 


TTACTTTCTG 


CTGTTCCAGT 


ATCGTTTTTC 


CTCGCTAGAT 


TTCCTCAAAA 


GGGCAGACTC 


6540 


CTCCCTTGGT 


GCGTCACACG 


ATTTTTTCAT 


CTCGACTGTT 


CTTTAATGCA 


TCATTAACGA 


6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAACATTC AGGTTCACTT TTCTAATCCT 6660 

AGAATAAAGT CCTGAAAACA ATTCGGAATA GGCATAGAGA CTAGACAATT TGAGGACCTG 6720 

CTTGCGTCCT GTTCGAACAC ATTTTCCGG 6*749 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 bass pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



TCTACCCATG 


GACTTTCAGC 


CAT TCATTGT 


TCCATCTTCT 


AGTGGCGAAT 


CTTTTGATAC 


60 


AAACGATTCA 


ATTCACTTGG 


ATACTGAAAC 


TCTCCCGCAA 


ACATTTTTCT 


GGTTAACTCA 


120 


ATCCAGCTGA 


TATTTCTTTC 


AGCCAAAATA 


ATGGACAAGT 


TCTCCCAAAA 


TCGTTCAGCC 


180 


ATATTGCTTC 


TCCTTTAGTT 


AGATAAATAA 


TGTGTTTGCG 


CCATGTAAAT 


CAATTGTTTC 


240 


GTATCTCTTG 


GCAATAGAGC 


TCTAGCCTCT 


TCCAAATTCA 


GACTTGGATA 


AACTCGCTTA 


300 


TTTGAAACCC 


CAAGAGGAAG 


TCTGATGGTT 


ACTTCAGGAT 


TTTTTAAAAT 


TATCTCAACG 


360 


AAATCCGTTA 


ATCTTAGATT 


GTCACGGTTC 


TTAAATCCTA 


ATAAATTGGG 


AGATAAAAAC 


420 


TCAAAACAAT 


CTGAAGAATA 


CCTCATCATC 


TCAATTAATT 


TCTCCTTTGT 


CATTTCAGAA 


480 


ACTGAATGAC 


AACATACCTC 


TATGCCATAC 


TTTTCGAAGA 


AATCTAAAAG 


AAGTTGATTT 


S40 


CTTTG TCT AT 


TTTTACTTAG 


ATAGAGATCA 


ATCATGGGAG 


ACCTCCCAAA 


GATTCGGTTC 


600 


CAT TTG AT AT 


TCTGACACGA 


TTAAGGAATC 


TAATAAATTA 


AGGAATCTAA 


TAAATTTGCC 


660 


AAGTTAATCC 


GTTTCTTGTC 


TTCATCATAA 


GCTTTTACAG 


TTACTTGGGT 


TGTAAGTATT 


720 


CCCTCTTTTC 


CCTCGGCTCG 


ATAGCCTTGT 


CCATATAAAA 


CAAAAACGAG 


ATTTTGATCA 


780 


TCATCTACAA 


AGGCATCAAC 


CCCATTCTTT 


ATGTCTTGAC 


TTTCAAGGAA 


TTCCATAACG 


840 


TTTTGAAGAT 


AGGATTCGTA 


AAATAGTGGG 


TACTTATGTT 


TTTTATGGTA 


ATCATCTAAA 


900 


AATGTCACTT 


CAAACTCACA 


TGGAGAGTAA 


TTTTGACTTT 


GAACAGCCTA 


AAAGTGCCAT 


960 


CAAATTTGAA 


TTGGAATAAA 


TCAAATAAAT 


AGCCCCATCC 


TCATCAATCC 


AACCTTTGCT 


1020 


CAAAGACAAC 


TCCAACCGAT 


CTTTTAAAAC 


TGAGTAAACC 


ACCTTAACCT 


CCAGTTTCAT 


1080 


ATTCTTATAC 


CGTTCACTCT 


CAAATAAAAG 


TTTGGGGAGC 


TTATAATAAC 


CCTCTGATGT 


1140 


CTGATATTGA 


TTAGCGGTAA 


TACGCTTCAT 


TATTGTCCCT 


CCAAGACTAA 


AATTCCAACA 


1200 
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TTTCCAAATT CATCAAATCC GATTAAACCT ACTTGTTCCA 

CCTTTTACCC AAATCATTCA TACCTCTCTC AACTACATGT 

TCATGAGCCA CTTTCTTCCT CCTCATGAGC TCAGTTTTAC 

TTTTTCCTCC CTACATTTCC TCAAAAGGGC AGACTCCTCC 

TTTCATCTCG ACTGTTCTTT AATGCATCAT TAACGACGCT 

AGGAACAGGA AGATTCACCT TGACTTTTCT AATCCTAGAA 

GGAATAGGCA TAGAGACTAG ACAATTTGAG CAGCTGCTTG 

TCCCACCACG TGAAGAAAAA GATGCCGCAA GCGTTTGATT 

TCCACCTAGA TGTTTGAGAA AAACATAGAG ATTGTAGGCG 

AACTTCGTTT TTGATTAAGG TTGAACTATC CGTTTTATCG 

CTCCTTGATG AAATTCTCGG CTTCACCACG TCCACGATAA 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19390 base pairs 
(B> TYPE: nucleic acid 
(C) STRAND EDNESS : double 
(D> TOPOLOGY: linear 



TTTCATCAAC 
AACTTACAAA 
TTTCTGCTGT 
CTTGGTGCGT 
TTTCTTCTAG 
TAAAGTGCTG 
CGTCCTGTTC 
GTTAAAGTTT 
ATACAGCTCA 
CCAAAAAATC 
AG 



TAACTGACTT 
ACCCCTGACC 
TCCAGTATCG 
CACACGATTT 
CTGGTTCATA 
AAAACAATTC 
GAACACATTT 
GGAAGTCACC 
TCATCATACG 
CCTCCTTCAT 



1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1842 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
TCATCTTTAT CTCCTCGAAA TTTTCTAATA TAGCCATTAT AACAGAATTT 
CCTATTATAG TAAATCACTA TTTCAGTATA AAAAGAAAAA ACGAATCAGA 
TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTAGCC GATTTTTTCC 
CTGCATCCTC ACTCATCAAG ACTTGACCAT ACTCTTCTAA GACTGAGCGA 
TATCGTCTGC AAACTCCCGC ATACGGGCCA ACAGCCAAGC TGGATATGGG 
TTTCAATATC CACTAAAATG GTCAAATAAT AGCGCTCGTT CATTTTCTAG 
TTTCCATTTC AAAAGTCACT GTCTTGGCAA AACCTACCAA GTCAGCCAAC 
AAAGGATGTA GTAGATGTAA GGTTCTTTCT TACTCTCAGC TTCTTGTTCA 
GCTCTTCTTC CTTGACTTCA ACTTGCTCAA GAGATTGAAT GCCTTCGATA 
TTTTGTCTGC CATGCTTTTT TCCAGGGTTT TGATAAATTC ATCTGGAGAC 
ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC TACATTTTGG 
ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGGAGT CACACGGAAG 
CTCTATCCAG AAAGCTATCA GGCATCTCTA GCTCATCCAA GATAGCATAA 



TGTGAAAATT 
CGATTCGCTC 
AGATTGAGCA 
CTGATATCAC 
CTTGGATGAT 
AGTTCAGAAG 
TTACCAAAAG 

_ TCATCCTTGG 
ATTTGAGCCA 
TCAATCTTTG 
CTCAACATGC 
AAGAACTCTT 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 

780 
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CTGTTTTTTC 


TTGAGCAACG 


AGAAAGTCAG CAATCTCCAT 


TCCACGATCC 


ATCAAATCCT 


840 


CTAAAGATAT 


CGTGATTTTT 


AAAGTTGTAT 


CACTAATTTG 


TTTCATTTTC 


ATTGCTAGTA 




ACCTCATACT 


TTCAGTTCTA 


TCTATTATAC 


TAGATTTTTA 


CGATTTTATC 


AAAAGAAOGC 


OCA 

950 


TCCTCTATAC 


GGATAGATTT 


TCCCTAGGGT 


CTTTCTATAG 


GAGACTCCAA 


AAGAAAATTT 


1020 


CTGCAGACAG 


ATAGAAAAAG 


CCTTCAAAAT 


CGGCTAAGAG 


CCGACTTTGA 


AGACCTTATA 


1080 


CATCAGAATA 


CTTATAATTT 


AAAGGTTCCT 


ACACCGAGGA 


TAGAACGATT 


TAACTTTCTG 


1140 


AGAATTTGAA 


GACTTTGCTC 


AAATTTCTTA 


TAACGAGTCA 


CTCCGTACTC 


TTCAACAAGA 


1200 


AGGACTGTAT 


CTCTTTCCAA 


AAGAGATCAT 


ACATCCTGTA 


AATCTACAAA 


ATGCATTCCT 


1260 


TTTAAAGCTT 


CTTGACTCTG 


TTTCAATTTA 


TCTAAGATAG 


CTTTATTTGA 


GCTAACGATC 


1320 


GTCAATTCCT 


GTCCAGTATT 


TTTCTATGAC 


AAAACATCTG 


CTAGGTTAGC 


AATTGTTGTA 


1380 


ATCTCTGTTA 


CAAAATCAAT 


TTGATACTGA 


GAAAAATCAC 


CTACTCTATT 


GATTGTTGGA 


1440 


TTAAAGAGAT 


AAACTAACAC 


ATTTCCCATC 


ACAACCAAAA 


TCACACAAAC 


CACTCCAATA 


1500 


ACAACTAAAC 


GAAGAATCAG 


ATTTTTCACA 


TTTAAGCCAA 


GCGCTGTTTC 


ACCATTTGCG 


1560 


TTCAATTCTT 


TAGAGTTGAT 


GGTTTCCAGT 


TTTTCAATTT 


TCACATTTGC 


ATAGGCATGT 


1620 


TTAAATTTCT 


CAATCAACCC 


ATCAATTTTT 


TTCTCTAACA 


AGTTATTGGC 


ATCTTTACTT 


1680 


GATGTCAAAA 


TTTTCACACC 


AACCCCTGCA 


TCGTCAATCA 


TATAGTAGAC 


GGTCAATTTT 


1740 


TTCCACCAAT 


AGTCATTCGT 


TGAATTTTTC 


AAGGTTGTTT 


CTGTCGTGTC 


TAATTCACTC 


1800 


GCAATTTTTT 


TCAACTCACT 


GCGTTCTACA 


TCATTCAAAA 


GATAAGCTCC 


ATTCAAATTA 


1860 


CCATCAATCA 


ATTTCCCATA 


AAAATCACTA 


TAACCACCAA 


TTTGATGATT 


CAAAATCGTT 


1920 


TTGTCCGACT 


CTTTTGGACG 


AGTGATTTTA 


TAGATAAGAT 


AAGTTGAATA 


ACTTGTTCTA 


1980 


TCTTTGACAG 


TGTTTTTATT 


CCTAACTGCT 


TTAATTCTAA 


ATGGTACAGC 


AATGAGAGCA 


2040 


AATAAAGCGA 


TGAGACCTAA 


AATATTTCCT 


TTTCGCTTTT 


TATAAAGATT 


TGCAAACAAA 


2100 


TCAGCTACTG 


AATAATCTTC 


AAACATGATT 


TTTTTCTCCT 


TTGTTTAGTA 


GATACTAGTT 


2160 


TTCCTTTGTA 


AGCATTTTTG 


CTACAAATAT 


AATCACAAGA 


ACAATTCCCC 


AGAATTGCAT 


2220 


TCTAAATAAA 


TTGAAGAAAC 


TTTCTGAAAA 


GCTGCTTCTT 


GGCATAAAGA 


ATACATTATT 


2280 


CAAGATCAGT 


AGGGATAAAG 


CAAATAGGAT 


TGTCCTTGAG 


CGATAGGCTA 


CTTGCAGCAT 


2340 


CGCTATAAAT 


AATACGCCGA 


GTAAGAAACT 


AAGCAGAAAG 


ACTCCAATCA 


TACCATAGTC 


2400 


GGTATACAAC 


TCCATGATAT 


AACTACTTCC 


GATACCATCC 


CCTTTCAAGT 


ATTCC1TGTT 


2460 


CAACACAAGA 


TAGGATAGAT 


TGTGGGCATA 


ACT ATT ACTA 


TCAATAGCTA 


GTTCCACACT 


2520 
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ATTGGTTCTA TCTTCAAAGG CTTTTCCTCC GAAAATGGCT CCCAAACTCC CCCTTGCAAA 25BO 

ATAATCAAGA ACAGGACCAA AAGTAAAATT ACGGAAATCT CGGTAAGGGA GGCTACTGTT 2640 

AAATAGAAAA CCTCGAGCCA GAACACCAAA ACTAGTCCCT TGTTTATAGA TAAAGTCAAG 2700 

TAAGATATCC CAGAAACCTG TATGGCAAAC TTGGACATTA TCCCGTACAT AATTGAGTAC 2760 

TCCCATCGCT AACATGAGAA TAGGAGAACC TACAAAAATC GCTAACTTTT CTTTAAACCC 2820 

AATCCATTTT CCTTTTTCAG TTTGCTCCCG CATAAAGTAA TAAACAAAAG CAAATmAAAT 2880 

ACTTAAAATA AAGGGATTTC GTGTCCCAAT TGCCAAATGA AT ACT ATT AG CTGCAATAAA 2940 

GGAGACAAGC ACTGCTGTGG CCTGCAATTT CTTTGGCTTG GTTGCCAGAT ACATACACAT 3000 

TGCATAGACC GTAAAGGTAG ACAAAATGTA GGTAAAATAA GGCAGTTTAC TTTCAAAATT 3060 

TGCATAGTAG GCATAGTAGG AAGTCTGCAA ACGATACAAG AGCCGTTCAA ATAACCGAAT 3120 

GAAATAGAAA GGATAACTTA GAAGAAAAAC TCCTAGTGAT ACAAAGCGTA ACCGCTTGAT 3180 

ATAAACCTCT TTTAGAGAAT TTCCTATATT TGCTA C TTT T ATTTTCTTCC TAGCTATGAA 3240 

GTAACGAGCC AGAATCCCTC CTGTGGTCAA CCCCAGAATC GAAATCATGA CAACTATAAA 3300 

CGCAAAACGA TAGGCTATTG GATGATAGCT ATCCAAAGCA CCATCCCTAA AATAATCAAT 3360 

CGTCGGTCTT GATACCAGAA ATACAAAAAT GGTTAAATAG AAAATAAAAT GGATTAAGTA 3420 

ATACTTGATA TCATTCCAAC AAGCAATTAA GCTACTAACC AACAAGAACA ATAAAGTAGA 3480 

AAGTAACCTA ACATTATTAT TATTAAACAG ATACACAATT CCACTTACTA GCGTCAAGGC 3540 

ATAACTGACT ATGGTCAAAC TAAATAATAA TC G TT TC CCA TCAATCACTT GGTCACCCCC 3600 

GTTCTAATGT AATTTTTTAG ATTTTTCAAT ATTTTTCAGT AATAAGAATC GATATAAGGA 3660 

AATATTTATG AATAGCGCCA AAGCACTAAT TCTTCTCCCC TTACGGAAAA TTGGATTCCT 3720 

AGAAATAGCA AAGGCATGGC CTTTTAAAAA ACGATGAATC TGAGAATAGG CTTCAAACTG 3780 

TTT AT ACT G A TCATCTAGCA ACATCTTATC CAGAATAAAG AACTGGGCAT AGGCCAATCT 3840 

GAAAAAAGCG ACCTCTTTCA AGTCAGGATA GTTTTTCACA ACTTCATTAT AAAACTTTTG 3900 

GTAGATATCA ATATAGGCTA AATCCTTCTC TGCATAGGGT TTGCTCGTAA TACTATCCCC 3960 

TCTATCGAAA TAGTAATAAT AGGGTTTAGT ATTAACCACA T ACT TC TTGG CCAACTTGAT 4020 

TAAATCAAAA TGGTAATAGG CATCTTCGTA AATCAACCCC TTAGGAAAGG ATAGGGCAGT 4080 

TGCAATCTGT CTCTTGATTA GCTTATTGCA AATCGTCCCA GGTATTTTTT CACCTATGAG 4140 

GTATTCCTTT AGAAATGTTT GAGAATCACA GACAAAATAG TCATCCTGAT TGGCTGACTG 4200 

TGGGCTTTCA TCATTAGCAT AGACATTCAT GACACCACAG CTCGAAACAT CCGCATCTTC 4260 

TTGAACTAAT TGCTCATATA AGCTCTGAAT CATTTCTGGA TGGATATAAT CATCTGAGTC 4320 
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AATAAAAATC ACATAATCCC CCTCAGCCTG CTTCATCCCA TCATTTCCTC CTTGCGACAA 4360 

TCCTTCCTTC TTTTTATGAA CCACTGACAC CCTCTCATCT TCTTCAGCGA TTGAATCACA 4440 

CAAGCCACCA CTTTCATCTG TTCCACCATC ATCAACAAGA ATAATTTCCA GATTTTGATA 4S00 

CCTCTCCTTC TGAATGCAAC CTATCGATTT TTCTAGGTAC TGCGCCACAT TATACACTGC 4560 

CACAATCACA CTAATTAATG CAGTTTCCAT GCTACTCCTC TAATACTTTT TCTACTT G TT 4620 

CGATTTGTTT TGTAATTGTA AATTGTTGAA TGAATTGGCT AGCCTCATCG ACATCAAAGT 4 680 

TTGAGGCAGA AGTCATGTAA TTAGTAATCG CCTGAGCTGC CTCTTGATTG CTCTCAATGA 4740 

TTTGTCCAAA TCGTCCTTCT TGGGATAATT CCTCAGCCCC TCCAACGTCC CTAGAGATAA 4800 

AAGGGAGTCC CAGACTCAAG GCCTCCACAT ACACTCCAGG AAAACCTTCT TGTTTAGACA 4860 

TAGACAAAAG AACTTTCGTC TGAGATAGAT ACTGATAAGG ATTTTTTTGA TAACCAAGGA 4920 

AATGTACATA GTCCTCAATC CCATACTCTT TGACTC G TTT TTTCAGTTCC TCTTCCATAT 4980 

CACCAGCCCC GATAAAATAG AGATGATAGT TTTTTCCCTC TTGGTGTAAT AATCCTATCA 5040 

CTTCCACTAC ACGGTCAGAA CCCTTATTTT CCTCAATCCG TCCGATAGTA CAGATACTTT 5100 

CACCAGCAAT CTCGATATCC ATCTTCTCTT GAGATTTTTC TAGAATAGTC TGAAAATCAT 5160 

ATCCATTGTA GATTGTCTCT AATTTAGAAG TATAATCTGG ATAAACTTCC TTGATAGAAT 5220 

TGCTGGTCTT TTTTGAAATC CCTACAATTG TATTCGCAGC ATCCAACTGC CTTCTATGTG 5280 

ATTCTCTTTT AGAGCTATCC TTAAGAAGTT CTTCAATACT TCCATGAATC CAAGATATCT 5340 

TCTTGACTTC TCTTCTTTTA GACAACAACA GTGGTGGATT CATAATGGTA AAAGAAACTT 5400 

CAACATCATA ATCATCTTTT ACAAGCAAAC GACGAGTCAG TCTTGGAAAA TAAATTCTCA 5460 

TTCTCCACAA AAAAGCTCGT AACCATCTGG TTTGGCGATA ATCTTGAAGG GATTTTAAAA 5520 

TGCGTACATG CTTTGGAACA GATTCATATC CCTTGTCAAA GTCCTCCATT TCAAGAATAT 5580 

CAATATCATA CTTTTCTGGA TCCAGATTTG AAACAATGGT TGATAGAATC TTCTCTGCAC 5640 

CACCTCCAAG AGAAAAAGAC CACATAAAAA ATAAGATTTT TTTCTTAGCC ACCATATTCT 5700 

CCCTTGTATT CTGTATAAGA CTTATCCATA TCACCGATGA CAGCATCATG ATCCGGTACC 5760 

TGCTTGTCTG CTGGTGGAGG CGTCATATAA TCCCCAAAAG CA C TTCTGAG ATAGACATCA 5 820 

TAGCCGATTG GAATAGCCAT CTCTGTTCCT TCAAATGGCA AGAAAAGATT GTCTTCAAAA 5880 

GATGTGATTG GGTACTTGTT TCTCATGTAG CCACGACCTG AGCATAATTC TGTAATGCCA 5940 

TCACAATCAC CCAAATCATA CTTAGTCATT TCTTTCTCAG CTTTTTTCCA GATGCGATAA 6000 

CGG AG AG ATT TTGGACTCAA ACCCAGTAAA ATGCGACTTC CCCATTTCAT GAGATCACCA 6060 
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TGCTTTTCTG 
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TCAAAGCCTG 


ACTGCTATCC 


AAATAGCTAT 
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TTATACTCTT 


6660 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCTTCA 


CCTTGCCGTA 


GGTATAGGTA 


ACTGACTTCG 


6720 


TCAGTCTTAT 


CTACAACCTC 


AAAACTGTGT 


TTTTACCAGC 


CTCCGGCTAG 


CTTCCTAGTT 
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TTTTCATTGA 


GTATTATCTT 
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GATATTTGTT 
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TTCTGGAATC 


GTCAATTTTT 
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6960 


ATAAACCGTT 


TGTTAGGAGG 


AAAGTATAGG 


GTACGTTGGT 
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CCAAAAACCT 


7020 
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TAGTGGGATC 


TTTAATTTGC 


TCTAAAATTG 
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CAATCCCTGT 
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AATTATCTTT 


CAAGACTTGG 


CGAACCATTT 


CTGCCCCTTT 


TTGCCCCTCT 
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ACTCGTAGGC 


TACGTTTAAC 


TTGTTATCTG 


TCTGTTTTCT 


ACCATTAATC 


ACTTGACTAT 


7500 


AACCATCTAT 


ATAGACCAAA 


TTATCACGCA 


TGAAACTGAC 


TAGCTTCATT 


TTCTTATCTG 


7560 


AGCCCCCGAC 


ATTTAATACC 


ATAATAGAGT 


CAGTTCGTGT 


CTCAACACTG 


TTCTGGCCGA 


7620 


TTCGACCATC 


AGTACCCATG 


ATTAAAATAT 


TAACTCCATC 


TCTAGTCTCC 


TGACCATTAA 
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AGACTTCTAC 


TTGAGCTGCC 


CGGGCATCAG 
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TGCGCTAGCA 


TCTTGGTAAC 
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CACGCAAAAA 


CATGAATACC 


ATGGCCAAAG 


CCACACAGAC 


CAAAAGTGAA 


AAAATCACCA 


7800 


TAAAAATTCG 


TTTAAGACGG 


AGCTTCCGTC 


TTTTCTTTTT 


TGGAGGGAAA 


GAGAGTGCTT 


7860 
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CATACCTAGG 
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CGATTCTCAA 


TCAGAGCATC 


CATATCAAGG 


TAATTAGGGT 


CCAAGCCTCT 


TGCAATAGTC 


9360 


GATTTTCCAG 


CCCCCATAAA 


CCCTAATAAC 


ACCTTAGCCA 


TGAATCAAGC 


TCTCCAAATC 


9420 


ATCAAAGAAA 


CTAGGATAGC 


TGGTATTGAT 


GGCTTCTGCA 


CGGTCAAGCT 


CCACCTCTCC 


9480 


ATCTCCAACC 


AAGAGGGCTG 


CGATAGCTGT 


CATCATCCCG 


ATACGGTGGT 
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ATTGACTCTA 


GCACCGTGAA 


CAGCTGATTT 


TCCTTTGATA 


ATCATCCCAT 


CTCCCGTAGG 


9600 
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ACTAATATCT 
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TCTTACCATC 


CACACTTAAA 
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ATTGTTTCAA 
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CATATCTTCA 


GTATGATTAC 


GGGTGTACTC 


TTTTTCGATA 


ATAACTGACT 


CCCCCTTAGC 


10140 


TTCTAAGGCT 


GCAAACATCA 


AGCCTGACTT 


CACTTGGGCA 


GAGGCAATTG 


GCAACTCATA 


10200 


ATGAATAGGT 


CTTAGGTTTT 


TCCTCCCTTT 


TAAGCGAAGG 


GGAGCCAAGT 


CTCCTTCAGT 


10260 


TTGCCCTGAA 


ATGCTGACGC 


CCATTTTTTT 


CACTGGAAGG 


GTCACACGGT 


CCATAGGACG 


10320 


TTTGGAAAGA 


CTATCATCTC 


CAAACATCTC 


TACTTCGAAA 


TCTGCACCAG 


CAAGCACACC 


10380 


TGAAATCAGG 


CGAATCGAGG 


TGCCAGAATT 


TCCCATATTA 


AGGGCATTTT 


GTGGCGCTTT 


10440 


TAAGCCAGCC 


ATGCCTACAC 


CTTGAATGGT 


AATAACCCCA 


TCTTTATCCT 


CAATTTCAAC 
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ACCAAGGTCA 


CGAAAAACCT 


GCATGGTCGA 


AAGAACCTCT 


TCACCTCGCA 


GAATATCATA 


10560 


AACCTTCGTC 


TCACCCTCAG 


CCAAACTTCC 


AAAGATAATG 


GAACGCTGGC 


TGATAGACTT 


10620 


GTCACCTGGG 


ACGCCCATAC 


TACCATCTAA 


ATGGCGAATG 


TTTGTTTTTA 


GTTTCATACT 
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CTTGCAATAC 


TTTTACCTAT 


TTTATCATAA 


AAAGCCAGAA 
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CAATTTTTTC 


AATATCAGAA 


AGCTAAATGG 
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TTGCTTGGTA 


AAGAATTCTG 
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ACAAGAGCCT 


ATTTCCTTGA 


ATCTGTTTAC 


CAAAGCCTTC 


CATCTTAGCT 


TGGAAGGACG 
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TTGACCTGTC 


TGTGCTAGTT 


TTTGAATTTC 


CTCTTGAAAG 


CCAAGATAAT 
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CTGTAAAGAT 


TTTGCTTGCC 


TCAGCATCTG 


CTCCAATCGC 


ATCTTTAGCT 


GCTTTAACAG 


11040 


CCTTCTATTC 


TGGTAATCCG 


CGTAGACCGC 


GACTGAGTTC 


GTTTGCACTA 


TCGTAAATAT 


11100 


TTGACATGTT 


CTTCTCCTTA 


TTTGATGACC 


ACTGTATAGT 


CAGTATTTTC 


TGTTATGAGA 


11160 


TGCTCACCTC 


TTTCCAAGTC 


TTGAGCATTT 


TTAAATGAAA 


TTTGTAGGAT 


TCCGTGAATA 


11220 


TCCTCACGAT 


TTTCCTCGTT 


GATGTGCATA 


TTAACCAAGG 


AAGTTCCACG 


TAGCACTTCC 


11280 


AAAATCCGCA 


GGATGACATC 


TTCTTCATCA 


GGAACGTCAA 


CATAGAGGTC 


GTAAGAGCTA 


11340 


TCCACACCAC 


CACGCTTATG 


GATTTCCATG 


GTCTGGCGTT 


GTTCACGCGC 


TTGCTTAAAA 


11400 
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AACTTCCAAA 


TTTGCTCTTC 


ATCTCCCTTA 
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GACCAATCGC 
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TCCTTGAAAT 
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ATCCAGAATG 


ATCTCGCTAT 


TGGACAAGAG 


AATGGAGGTC 


11520 


CACATTCCTG 


CCTCGCTTTC 


CGCAATTCGC 


GTCATATCTC 


GAAAACCACC 


TCCCGCAAAG 


11580 


CGCCTTGCCA 


TCTCATGCTC 


TTGAGCATAG 


ACCGCAGTCT 


GCTCCATGAG 


ACTAGAAGCC 
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AAAATATGAG 


GAAAATGGCT 


AATCTGAGAA 


GTGACACGAT 


CATGCTCCTT 


GGCATCAATC 
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TCGATAAAAC 


GAGCATGAAG 


ACCTGAAAGC 


AGATCCTTCA 


TTTCCTTAAG 


CGTGTCCTGA 
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TTGAAGGTGT 


AAAGATATAA 


TAGGCATTTT 
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GACATCTGCC 
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GCGAACAGAC 
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ACCAGCATCT 
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GAAATAATAA 


CGCCTTCTCG 


CAAATCCAAA 


TTGGCCAACT 


CCTTAATCAA 


AGCAATACTT 
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TGTTTGATTG 


GCAAGCTGAG 


GATAATGACA 


TCTGCCAAAG 


GAGCAAAACT 


AGCAAAATCA 


12060 


TCCGTTGCAC 


GCTCAATCAT 


ACCTTCTTTC 


AAGGCGATAT 


CTCTCGAAGC 


TTGACTACGA 


12120 


TTATAACCTA 


AAATTTCATA 


ATCTGGATGA 


TCGCGTTTGA 


TACCAAGTGC 


CAT AGAGGCT 


12180 


CCAA7CAACC 


CAAGACCTGC 


GATATAGATT 


GTTTTTGCCA 


TAGGAACTCC 


TTAATAGTTC 


12240 


TTTGTATAGT 


CTCGGTGTTT 


GGCTACCGCT 


TCTTTTAGTT 


CCTCAAGATT 


ATCTGATGAG 


12300 


AATTTTTCGA 


GGATTTCTTG 


CGCCAGAACC 


CTTGCTACAA 


CTCCTTCCAT 


GACCATTCCT 


12160 


GCAGCTGGAA 


GAGCAGTCGG 


ATCACTTCTC 


TCCACGGTTG 


CCTTGTAAGG 


TTCCTGGGTT 


12420 


TCGATATCCA 


CACTCATAAG 


AGGTTTATAA 


AGAGTAGGAA 


TGGCTTTCAT 


GACCCCACGA 


12480 


ACAACGATGG 


GTTGCCCATT 


AGTCATACCA 


CCTTCAAAAC 


CACCTAGATT 


ATTGGTACGG 


12540 


CGAGTATAAC 


CGTCTTCTTT 


AGACCAGAGA 


ATTTCATCCA 


TAACTTCGCT 


GCCTTTACGA 


12600 


TAACCAGCCT 


CAAAGCCAAG 


ACCAAATTCC 


ACCCCTTTAA 


ACGCATTGAT 


AGAGACAACA 


12660 


GCTTGAGCCA 


ATCTTGCATC 


CAATTTTCTA 


TCCCATTGGA 


CATAGGAACC 


AAGACCAACT 


12720 


GGAACGCCTC 


CGACGACTGT 


CTCCACAACC 


CCACCGATGG 


TATCACCATC 


ACGTTTGATT 


12780 


TGGTCAATAT 


AGTCCTTGAT 


TTCCTGTTCT 


CCTTCTTCCT 


TGACAATAGA 


AACTTCAGAC 


12840 


TGGGCAGCTC 


TTTGCTTAAT 


TTCAGCGACT 


GTCAGATTTT 


CAGGAACATC 


GATTTCCTTG 


12900 


CCACCAAAGA 


CCACGACATG 


GTTGGCAATC 


TCCATATCCA 


GCTCAGCCAA 


GAGGCGTTTG 


12960 


GCTACTGCAC 


CAACTGCCAC 


CCGCATGGTG QTTTCACGAG 


CTGATGAACG 


CTCCAAAGAA 


13020 


TTTCCCAAAT 


CATCAAAACG 


GTACTTAATC 


CCCCCAACCA 


AATCGGCATG 


ACCTGGGCGA 


13080 


GGATGAGTAA 


TTTTCCGCTT 


GCTTTTAAGG 


CGGTCTTCAA 


TGTCCTCCGC 


AGACATGATG 


13140 
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TCCACCCATT 


TCTGGTGGTC 


CTTATTGATG 


ACATCCATAC 


TAATAGGCGC 


CCCTGTCGTC 


13200 


TTCCCGTGGC 


GAACGCCCGA 


AGTAAAGACA 


ACCTGGTCAT 


TCTCAATCTT 


CATACGACCA 


13260 


CCACCACCGT 


AGCCACCCTG 


ACGGCGTCTA 


AGGTCCTCAT 


TGATATCCTC 


AGCTGTCAAT 


13320 


GGAACTCCAG 


CTGGAATTCC 


CTCAATAATA 


GCTGTTAGAC 


GGGGGCCGTG 


TGATTCTCCT 


13380 


GCAGTTAAAT 


ATCTCATACA 


CTCTCCTTAT 


TTTACCAAGT 


AGTCTTTCAT 


CTCTTCCAGA 


13440 


GAAACTGGCT 


GAATGGTCGC 


TGAACCAAGC 


TCTGGCACCA 


AGACCAATTT 


CAAGGTGTTA 


13500 


CCACCCCCTT 


TCTTCTCATG 


AGTAAGAGCC 


TGATAAAGCT 


TGCCAACTTC 


CCAATTTTCA 


13560 


TAGTCAACAG 


GCAAACCGAA 


TTTCTGACAC 


ATCTCTGTGA 


TAGATTGGGT 


AATGCCAGCT 


13620 


GGCATGAGGC 


CTTTTTCCTC 


AGCAACCTTG 


GAAATCTCTA 


CCATTCCCAT 


GGCAACAGCC 


13680 


TCTCCATGCA 


TGACCTTGCC 


ATAACCGGCA 


GTCGCTTCGA 


TGGCATGGCC 


AATAGTGTGG 


13740 


CCAAAATTGA 


GGTAAAGACG 


AATACCATTC 


TCCAACTCAT 


CTTCAACCAC 


CATCTTGCGC 


13800 


TTCACCTGAC 


AAGAATGTTC 


AATCAAGGTC 


TCTGCATGTT 


CCAAAATACT 


CTCAACAGAA 


13860 


CCATTCAGTC 


CCGTCAAGAG 


AGCCCACAGT 


TCTGGATCCT 


CAATCAAGCC 


ATACTTGATA 


13920 


ACTTCACCCA 


TCCCTTCAAT 


CAACTCTCTT 


TTTCCGAGGG 


TTTCAAGAAC 


AAGTGGATCA 


13980 


ATCAGAACCC 


CATCTGGTTG 


CGCAAAGGTC 


CCCACCATAT 


TTTTAGCAAA 


TGGTGTATTA 


14040 


ACCCCTGTCT 


TTCCACCGAT 


AGAAGAATCA 


ACCTGAGCTG 


TCAAACTAGT 


CGGAATCTGA 


14100 


ACAAAGTGAA 


TACCCCGCAT 


ATAGGTAGAG 


GCTACAAATC 


CAGCCAGGTC 


CCCAACAACG 


14160 


CCACCACCAA 


GAGCAACGAT 


TCCATCGCTA 


CGACTCAGAC 


CTTGCTTGAC 


TAGAAATTCA 


14220 


TAGACTTTCT 


GAACAGTAGT 


TAAATTCTTT 


CTTTCTTCAC 


CTTCTAAGAA 


ATCAAAAACA 


14280 


GCTACCTGAA 


AACCAGCATC 


TTCTAGGCTG 


AGCTTGACCT 


TCTCTGCATA 


GAGACAGGCT 


14340 


ACATGGTTAT 


CTGTCACAAT 


GACTACCTTT 


TGCGGTTGCC 


AGAGTTCTCC 


CAACCACTGA 


14400 


CCAGCCTGGG 


CCATACAACC 


TTTTTCAATC 


TGAATATCAT 


AAGGATGCTG 


AGGAATATCG 


14460 


ATTCTGATTT 


TCATAGGAGA 


CTCTCCCTTT 


CTTTATTGGT 


ATTTTTCTGT 


TAAAGACTGC 


14520 


CAAATCTCTT 


CTGTCGGCAT 


TTCCTTGCCT 


GTCCACAGTT 


GAAAAGCTTC 


TGCAGCTTGA 


14580 


TACAGTAACA 


TTCCCAGACC 


ATTCACTGCT 


GGATTGCCCT 


GACTTCTAGC 


CCATTTCAAA 


14640 


AACGGTGTTT 


CAAAGGGTTG 


GTATATGATA 


TCTGCAACTA 


AAAGACTTTC 


TGGTAAGACT 


14700 


ATGTTTTCAG 


GAACAGGAGA 


GGATTGCCCA 


TCCATGCCCA 


CACTGGTGGC 


ATTAACTAGC 


14760 


AAATCCGACT 


CGGCAATCCT 


TGCTTGCAGT 


TCAGAAACAT 


ATTCTAAAGC 


ACACAAATCC 


14820 


ACTTTAAAAC 


CTGTCTGCTC 


CTGTAACTTG 


TCTAGGTAAG 


GTCTTGTTTT 


TTCCATAGAA 


14880 


ACGGAACGAA 


CAAAGACCGA 


AATCTGACTG 


ACGCCATCCA 


AAATAGCCTG 


TGCCAAGATT 


14940 
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GATTTAGCCG 


CACCACCTGC 


ACCCAGCAGG 


GTCATCTTTT 


TACCTOAAAT 




1S000 


GCCAAGCACT 


TAAAAAATCC 


CTTGCCATCT 


GTATTATATC 






15060 


TTGACAACCG 


TATTAACCGC 


ACCAATCAAG 


CGCCCTTCAT 






15120 


GGAATCACCT 


GCTCCTTATA 


GGGCATGGAC 


AGATTGATGC 


CAAACATCTG 




1S180 


ATATTGGCCA 


CTGTTTCTAC 


CAAGTCACTC 


GCTTCAATCT 


CCCAAGCCAC 




15240 


TTGGTAGCTG 


TCGCCTCAAA 


GGCTCTATTG 


TGGATGAAGG 


GAGAAATAGA 


ATCCTTAATA 


19JVV 


GGATTGGCAA 


CAACTCCAGC 


TAAACGTGTA 


TACCCATCAA 


GCTTCATCCA 


AAATCTCCCT 




GATTTTTTTC 


ATGCTAGCTA 


GAGAAATCTG 


CCCAGGGGCA 


CTAACCTCAT 


CCAGACTuCC 




AAAAGACCAA 


CTCGAACCAG 


TCACATCCGC 


ACTGATACGA 


GAGACCTTGC 


CCACCTTACC 


l34dU 


CATAGAAATG 


GTCACATATT 


CCTGTTCAGG 


ATTGAGGGTT 


TTAAAGCCTC 


GTGTATAGTT 


15540 


CATCAAGTCT 


AAGACATCCT 


GCTCCGTGTG 


AGCCATCACC 


GCAACCTTAA 


CAAGTTTTGG 




ATTTAGGATC 


GTCAACTCTG 


ACAAGATTTC 


CATCATGTTC 


TCAGGTGTTT 


CTTGGAAATT 


lboou 


ATGGTAACTC 


AAAACAAGAT 


TTCGGAAGTC 


CAGCATTTCC 


TCAAAAACAT 


CCTTGTAGCT 


15720 


ATAGTACTCA 


AAATCAATAT 


AGTCTGGTTG 


ATAGAGTTCC 


GCAACTTCCT 


TGATTAGATG 




GATATACTCT 


TCTGGAGAAA 


GGTCGATTTC 


TCCACCTTCG 


GACCCAGTTC 


GTAGCGTGAA 




AACCAACTCA 


CGGCCTGCGA 


ATTTTTCAAA 


AATGGCTGGA 


GCTACCTGCA 


AAATCCCTTC 


199UU 


TTTAGGCAGA 


TAGTCGGCAC 


GCCATTCAAT 


GATGTCGGCA 


TCCAGGTACC 


TCCTGGCATt 




CAGAGCCTGA 


GCCTCCTCTA 


AACTTCTTGG 


CATTACTGAA 


ACGATTAATT 


TCAu iALfA 




ACCTTCATAC 


TAATCACCTT 


GAGGTAATTA 


CTACTTTCAT 


CTT'riTTATT 


ATAGCCAAAA 


16080 


TCTGCTCGAA 


CACCATATTT 


GTTTAAAATC 


TGGTAACTTC 


TTCCTGCAAA 


ACUi i l ATUA 


16140 


ATTTGTTCTG 


TAAATTTCTG 


ACGGGAAACA 


TTGGCACCAT 


TGGTACTGGC 


An 1 V*>\. 4 1 V_ 


1 6200 


CCTCCCGGAT 


TTAAAATCTC 


AAGACTCTGG 


GAAATCAACT 


TGTGATAATC 


CTTGGCCACA 


16260 


GAGAAAGTTT 


CTTTTTTATT 


CCCAGCAAAC 


CTAGGCGGAT 


CTAGGACAAT 


CACATCGTAG 


16320 


GTCAAGTCTT 


TGCGTTTCGC 


ATATTTGAAA 


TACTCAAAGA 


CATCCATGAC 


TATAAAACGA 


16380 


TGCTCGTCTC 


TGCTGAGCCC 


ATTTGCCTGA 


AAATGCGCTT 


GAGACAATTC 


TCGTCAACGT 


16440 


TTGGCTAGAT 


CAACAGAAGT 


TGTATGGCTA 


GCTCCTCCCA 


TGGCCGCAGC 


TACTGAAAAA 


16500 


GCCGCTGTGT 


AGGAAAACAT 


ATTGAGTAAG 


GATTTACCCA 


TAGCCAAGCC 


GTCAACTAAA 


16560 


CTACCGCGAA 


CCTCATGCTG 


GTCTAGCAAA 


ATTCCTGTCA 


TCAAGCCATC 


ATTCATAAAG 


16620 


ACTTGATACA 


GGACACCATT 


TTCTAAAACA 


TTGAAAAAGT 


CAGGTGCTTC 


TTCACCATAA 


16680 
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ACATGGCCAG 


ATTCATAGTC 


CAAACCCTTA 


AAGCGGATTT 


TCTCATAAGC 


TCCT AAAACC 


16740 


TCAGGGAAAA 


CCTGTCTAAA 


GGCTTCTGAT 


ATAGTCTGAC 


GAATCTGATA 


AACATAAGAG 


X6800 


TTATACCAAG 


AAAAGACGGC 


CTAGTCGCCA 


TAAAGGTCCA 


CTCTCAGACC 


CCCAAAGCCA 


16860 


TCTCCCTCTT 


GATTAAAGAG 


ACGAAAGGCA 


GTTGTCAAAT 


CATCTTGATA 


GTAGGCGTTT 


16920 


CTCTTTTCTT 


TGGCTTTTCT 


AAACAACGTT 


TCAAAGAAAG 


CTTGATTGAA 


GGCCACCTTG 


16980 


TCTTTGCTGA 


TAAACCAGCC 


CAAGCCCTTG 


TTTTCCTGAG 


AAAGGTAGGC 


AGTCCCAAGA 


17040 


AAGTTTCCTT 


CCTGACCCTG 


CACCTCTACT 


TCCTGATCCT 


TAAGATTGAC 


ATTCTCAAGA 


17100 


TCACTGGCTT 


CTACTAAAAC 


TAGCCCCTTA 


GCAAGCTTCT 


TTTCAACCCT 


TTTGCPGACT 


17160 


CTTATTCTAT 


TCATAACTAC 


CATTATATCA 


AACTTTTAGA 


CAATTCTCAA 


AAAACAAACT 


17220 


ACCCTTGCTT 


TTTTACTCTT 


CTTTTAAAAA 


ATGGTATACT 


AGACTTCCTG 


CAAAACTAGG 


17280 


AAGTAAATGT 


GTAAGAATCA 


CAGTAAAAAA 


TGCTCTTCCC 


TCTTGGAGCA 


GCATTTCTTT 


17340 


TTATCAACGA 


AAATCAAATA 


GCAAACTATG 


AAACTAGCCT 


CAGGTTAACT 


CTCAGATTAT 


17400 


AGGTAGAGAG 


GTTGTATCAG 


CAATATGTGT 


CTGTCAAATT 


TAGTGACAAA 


GGTAGTAGAA 


17460 


GAAAGATAAA 


GAAATAAATC 


AGCTTCAGTA 


GGTATCTGGA 


AAATTTGATT 


TTATAGAGAA 


17520 


GCCTTTTGTT 


ACAAACTCAA 


TATACTATCA 


ATAAATAATA 


TTATAGAACC 


AACAATAATT 


17580 


ATAATTTCAC 


CTATCTGCAT 


CATTCTATTT 


CGAACTCTAA 


ATATATCTTC 


TATCAAAAAT 


17640 


ACTTGGAACA 


CACACATTAT 


AGGAATTAAC 


CTTTTTGAAA 


TTCAAAAATA 


TCCAAATAAA 


17700 


TAAACTATAA 


ACAACAAAAA 


TAGAACTATG 


TTATATTTCT 


TATTCAAAAC 


ATTCCTCCCT 


17760 


ATATATTTTT 


GATTACCAAT 


CTTAATCATT 


TACAACTACA 


TTCTAACAAA 


CTATAAAAGC 


17820 


GTTTGTCGAA 


TTGAATTTAT 


CAAGCAAGCG 


ACCAACCAGT 


TCATCTTTTT 


TCTATTTCTG 


17880 


CCAATATGCG 


TGACACGTAA 


TAATGATAGC 


CAAAAATAGC 


AAGAGCAAGC 


AAGACGATAA 


17940 


GAGCTCCTAC 


TCCCAAGCTG 


ATGGCAAGGA 


TAGGGGAGAG 


AGACTGAACC 


AAGAATATGC 


18000 


TCCCAATTAC 


AAGGGCCATC 


AGGATTGCAC 


TATAAATAAA 


CAATAAAACT 


ATGGCGAC fA 


18060 


TGCCATTTGA 


ACGATTCACC 


AGGTCCGTAA 


TCCTACTCCA 


ATTGGTTGAC 


AGATTTTTAA 


;ei20 


CGTCCTTAAA 


GTAATGGTGG 


CAAGAAAGGA 


TGACACTGGC 


AATGATCCAG 


ACTACAAGAA 


18180 


GGTAAATCAT 


CGAAATGATG 


GGCAAGCCTA 


GATATAGAGA 


AAGACCAAGC 


AAAGTCAGAA 


18240 


CTGGTAAAAA 


GGACTGGACA 


GCATATATAA 


TCCAAAATTT 


CACTTTCACA 


TAACGAGCAA 


18300 


AGTCAAAGGG 


TAAACTCTTA 


AGAAAATCAA 


CATTTTCCCT 


CTCCAAGGAC 


AAGGCAATTG 


18360 


AATGCAGGCT 


GCTGATATTG 


TTATTGACAA 


CTGCTATAAA 


GAGAGCTATA 


AAAAACAAGG 


18420 


GTAACCAGTA 


TGGAGGATGA 


ATGTCTGGAA 


CTATCTGAGA 


ATCTCGGATT 


TTGGAAATCA 


18480 
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GACCGATCAT CATGAGATAA GGAAGGAAAG CACTTGTAAA AAGCACTGTA ATCACGCCAG 18540 

TCCCCTGTCC CAAGAGGGTG AGGTCCTAGC GTAAAACCAT GCGAAAAAAT CCCTTTTTAG 18600 

TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC CTTCTCCTCA CTATTAAGCA 18660 

GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT CAGATAAAGC AGGAAGAGAG 18720 

TT AGTCCT AT CCAAGCGAGC AGACCCACTA AGCCTTCTGT CGAAAAAGGC TCCACTGCTA 18780 

TTTTGTAAAA GATATGAAGA GGATAAAGGA CAAATGGAAT GTCTCTAACT TTGTCAACAA 18840 

TACTTCCAAA AGTCGACTGA AGAAAGAAGA TAAATATTAA AGGTATGAGA ACTCCTATCC 18900 

CAATCATCAC ATTCGAAAAA AT AG A CTG AT ACTTTCTGAA GACCCTAGTT TGAGCCAAGA 18960 

AATGCACTGC CACTACCATC ACTAGAGCCA CAGAGACAAA TAATAAGCTC AAGGACAGTA 19020 

GCATCAAAGG CAAACCCAGC CATAGAGAAG GAGCTAGCCT AATGTAGAGG ACCAGAAAAT 19080 

AAGCTAGGAT TGGTACAATT CCAGTTAGAG CTGGCAAAAG GACAGACAGT CCTTTAGCAA 19140 

TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA TACCAAATCC TTACTCTCAT 19200 

AAAAGACATT GTAAAAGGCC GTTAAAGAAG TTGAAAAGGC AATCACTAGT AAAATAGCAA 19260 

TCATCGAGCT AAAATAAATA GGTATTTCCT CAAAAGGAAA ATGAATGGCT ATATTACTAA 19320 

AACAGATGAT CATCAAGAGA CTGGAAAAAA TGTAAGAACT TAAGACTCTA GCGGAAACAT 19380 

TTACTTTTTT 19390 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18436 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CCGAGCGTCG TTACAGACTT TATCAAGATT GGACGCAAGA AGAAATTCAA CATATAAAGG 60 

AAAATATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA ACAGGACTTC 120 

TCAACGACCC AAATGGCTTT TCTTACTTTG ATGGCAAGTG GATCCTCTTT TACCAGAATT 180 

TTCCTTTTGG TGCAGCCCAC GGTTTAAAAT CTTGGGCACA GCTAGAAAGT GATGATTTGA 240 

TTCACTTTAA AGAAACTGGA ATCAAAGTTT TACCAGATAC TCCATTAGAT AGCCACGGTG 300 

CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT ACACGAAATC 360 

TTCGCGATAA AAACTGGATC CGTCACCCAT ACCAGATCGG TGCTTTGATG GACAAGGAGG 420 
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CTAACATTAC 


AAAGATTGAC 


AAGATCTTGA 


TTGACCAGCC 


AGCAGACTCT 


ACTGACCACT 


480 


TCCGCGATCC 


ACAAATTTTT 


AACTTTCAGC 


GTCAATATTA 


TGCCATTGTC GGCGGACAAG 


540 


ACTTGGAGAA 


AAAAGGTTTC 


CTTCGTCTCT 


ACAAGGCTGT 


CAATAACGAC 


TACACAAACT 


600 


GGCAACCAGT 


TGGCGACCTT 


GACTTTGCTA 


ACGACCGTAC 


TGCCTACATG 


ATGGAATGTC 


660 


CTAATTTGGT 


CTTTGTAGAG 


GAACAACCTG 


TCCTTCTCTA 


CTGTCCACAA 


GGATTGGATA 


720 


AGAAACTTCT 


AGACTACGAT 


AATATCTTTC 


CAAATATGTA 


TAAGATCGGG 


GCTTCCTTTG 


780 


ACCCTAAAAA 


TGCCAAAATG 


GTAGATGTGT 


CTCAACTTCA 


AAACATGGAT 


TACGGTTTCG 


840 


AAGCCTATGC 


AACTCAAGCC 


TTCAACGCTC 


CTGATGGGCG 


TCCTCTAGCA 


CTTAGCTGGC 


900 


TTGGTTTGCC 


AGATGTTTCT 


TACCCATCTG 


ACCGTTTTGA 


CCACCAAGGA 


ACCTTCTCTT 


960 


TGCTCAACGA 


ACTCACTATC 


AAAGACGACA 


AGCTCTACCA 


CTATCCAGTC 


GCTGCTATTA 


1020 


AGGACCTTCG 


TGCTTCTGAA 


CAAGCCTTCT 


CAAACCGTTC 


CCAAACCAAG 


AACACTTACG 


1080 


AACTTGAACT 


CAACTTGGAA 


GCTAATACCC 


AGAGCGAGAT 


TCTCTTACTT GCTGATAAAG 


1140 


AAGGTAAGGG 


ACTTTCAATC 


AACTTTGACC 


TTGTAAACGG 


TCAAGTAACA 


GTGGATCCTA 


1200 


GCCAGGCTGG 


AGAACAGTAT 


GCCCAAGAAT 


TTGGGACAAC 


TCGTTCTTGC 


CCTATCGAGA 


1260 


ATCAGGCTAC 


TACTGCTACA 


ATCTTCATCG 


ATAACTCTGT 


CTTTGAAATT 


TTCATCAATA 


1320 


AAGGAGAAAA 


AGTATTTTCT 


GGTCGTGTCT 


TCCCACATGC 


GGACCAAAAT 


GGTATCCTGA 


1380 


TTAAATCTGG 


AAACCCAACT 


GGAACTTACT 


ATGAATTAGA 


TTATGGTCGC 


AAAACTAACT 


1440 


GATGTCCCCA 


AACTTGCAGG 


CGTCAGTCCT 


ACTACCGTTT 


CTCGGGTTAT 


CAATAAAAAA 


1500 


GGGTATCTAT 


CTGAGAAAAC 


CATCCAAAAA 


GTCAATGAAG 


CCATGCGACA 


ATTGGGCTAT 


1560 


AAACCCAACA 


ACCTGGCTCC 


TAGTCTGCAA 


GGAAAATCAG 


CTAAGTTAAT 


CGGCTTGATT 


1620 


TTCCCCAATA 


TTTCCAATGT 


TTTCTATGCA 


GAATTGATTG 


ATAAATTGGA 


ACACCAACTC 


1680 


TTCAAAAATG 


GTTACAAGAC 


CATCATCTCC 


AACAGTGAAC 


ATGATTCTGA 


GAAGGAACGC 


1740 


GAATACATCG 


AAATGTTGGA 


AGCCAATCAG 


GTGGACGGCA 


TCATTTCTGG 


TAGTCACAAC 


1800 


CTAGGAATCG 


AAGACTACAA 


TCGTGTGACA 


GCGCCGATTA 


TTTCCTTTGA 


CCGAAACCTA 


1860 


TCGCCAGACA 


TCCCTGTCGT 


CTCCTCTGAC 


AACTATGCTG 


GTGGGGTTCT 


TGCTGCCCAA 


1920 


AC CTTGGTCA 


AGACAGGTGC 


CCACTCTATC 


ATCATGATTA 


CAGGGAATGA 


CAATTCTAAT 


1980 


TCGCCAACCG 


GACTGCGCCA 


CGCTGGTTTT 


GCATCCGTAC 


TCCCAAAAGC 


TCCTATTATC 


2040 


AATGTTTCCA 


GTGACTTTTC 


TCCCGTCAGA 


AAAGAAATGG 


AAATCAAGAA 


TATCTTGACC 


2100 


CGGGAAAAAC 


CAGATGCCAT 


TTTTGCTTCG 


GATGATTTGA 


CAGCTATTCT 


GGTCATTAAA 


2160 


ATCGCTCAAG 


AATTGGGCAT 


TTCTGTCCCA 


AAAGAGCTCA 


AGGTCATCGG 


CTATGATGGG 


2220 
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ACCTA CTTT A TCGAAAATTA CTACCCTCAA TTGGCTACTA TCAAGCAACC TTTGGAAGAC 2280 

ATTGCTTGTC TCACTATTGA TCTTCTCTTG CAAAAGATTG AAGCCAAGCA AGTCGCCACA 2340 

ACTGGTTACT TCTTACCAGT TACCCTATTA CCAGGAAAAA GTATTTAAAC ACAAGAAAAC 2400 

TCAGACCGAT TCGTCTGAGT TTTTATGATC TTAAATTTTC GAGATAGCGC TGGGCTGTCT 2460 

CTAGGTTAAA GGTTTTATCT GAGATGAGGC GCTCTACTAG GGGAGCAACT TCAGATTCAC 2S20 

TAGCCCCAGC TAGGAGAGCT ACGGATTTGC CCTGTAGTTT CATGTGGCCT TGCTGCATCC 2S80 

CCGTACTTAC CAAGGCTTTG AGGGCTGCAA AATTTTGAGC AAGACCGATG CACACGATAA 2640 

TCTGGGCTAA TTCTCTCGCA GAAGGATTTC CTAGTAGATC ATGACTGAGA ACTACACGTG 2700 

GGTTGAGGCC GATAGAGCCA CCCTTAGTCC CTACAGGCAT GGGCAGGCTC ATCTCACCGA 2760 

CCAATTCTTC TCTTTCAAGG TCCAGCGTCC AGCAGCTAAG ACCTTGATAG CCTCCATCTC 2620 

GACTGGCAAA GCCATGGGCC CCAGCTTCGA TGGCACGCCA GTCATTACCA GTGGCAATCA 2880 

AAATCGCATC AATACCATTA AAAATTCCTT TATTATGAGT AGCAGCTCGG TAAGGATCAG 2940 

CCTGCGCAAA CTGACTAGCC AACGCAATTT TCTCCCCAAT CTCTCGTCCT TGATCCTTTT 3000 

GGCGGCTCAA GTAGCGAAAG GCGATGCGAC AGCTTGCAGT CACCAGAGAA TCGCTCCCGT 3060 

AGTTGGACAG GATTCCCATG AGACTCTGTC CCTGACTGAG TTCTTCTAAG ACTCGTTTCA 3120 

AGGCTTCCAG CATGGTGTTG AGCATATTGG CACCCATGGC TTCCTGGGTA TCGACATGAA 3180 

TATAAACAAC GAGAAAGTCT GGTTCGCCTT TTATCTCCTC GACATGCACA TCACCCCCCC 3240 

CACCTCCACG TTTAACGATA GAAGGATACG CTTGATTGGC AAGCTCCAAG ACCTCCGCTT 3300 

TCTTGCTGGC AATCTTCTCT TGCGCTAGTT TAGCATTAGC AACTTGATAA AGGCCTACCT 3360 

GCCCAATCAT CTGTCGCTGA TGGACTTGTG CAGTAAAACC ACCTGCACGC TTGATGATTT 3420 

TGCTGGCATA CCTGGCCGCC GCAACCACAG AGGGTTCTTC TGTCACATAG GGAACGGTGT 3480 

ATTCCTGACC GTTGACAAGT ACCTCCGGAA CCACTGAATA AGGCAGAGAA AAAGTTCCCA 3540 

CTACATTCTC ACTCAGCTGG TCTGCCACAG TCACGCTCAT CTGTTCATCC TTCTCCAGAC 3600 

TAGCTTGTCT CTCAGGACTA AGGAGCGCCT GAGCTTTTAA CAGCTCGAGG CGCTCTTGGT 3660 

ATGATTTTTT AGAAAATCCA TTCCAACTTA TCTTC ATT AT TTTTCAACCT TGCTATAACG 3720 

GCGTTGGTGC TCGAGAATTT CAACCAAGGC AAAATCTTGA TTTTCATACC CAGCAAACTC 3780 

GGCAGACTTA GTTTCATCCA AGTTTACTTC CTCAAAAAAG ACCTTTTCAT AGTCTGCAAC 3840 

GGATAGGGCA GTTCGTTGGT TCAGCTTCTT CAAACGGTCT TTATCCAAAT AAGCTTCATA 3900 

TCCTTCAACC AATTCACCAC TGAAGAACTC AGCCACAGCT CCACTTCCGT AACTATAAAG 3960 
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CCCGATTTTA 


TCCCCAGCTT 


TCAAGCTATC 


TCTATTTTCC 


AAGAGAGACA 


AAAGTCCAAG 


4020 


GAAAAGTGAA 


CCTGTCTAGA 


TATTCCCCAC 


CTTTTGACTG 


TAGAGAATAG 


ACTGGTCAAA 


4080 


ATGCTTTTGT 


AAGACGTCTT 


TTTTCTCTTG 


AGGCAGGCTC 


TTATCCATGA 


TTTTTTTCAA 


4140 


GCCTTTTAGC 


GCTAATTTAG 


GATAAGGCAA 


GTGGAAACAA 


ACAGCCGCAA 


AATCATCCAA 


4200 


AGTAAGCTGG 


TAGCGTTTTT 


GATATTCAAG 


CCAAGTCGTT 


TTCAAACTAT 


CCAAGTATTC 


4260 


TTGGGTAGAA 


TAGACACCAT 


TTACATAAGG 


AGTTGTCGAC 


TAATTTGGTC 


GCCAGAAATC 


4320 


CATGATGTCA 


CGGGTCTGAG 


CTACATTGTC 


ATTATTAAAG 


GCCATCATGC 


GTGGATTTTG 


4380 


TGTAATCAAC 


ATAGCTACAC 


TTCCAGCACC 


TTGAGTTGGT 


TCTCCTGGAG 


TTTCAATACC 


4440 


GTATTTGGCA 


ATATCACTGG 


CAATGACCAA 


GACCTTGGAC 


TCCGGAGAAT 


TTTCCACATG 


4S00 


CAATTTGGCA TAATGGAGGG CAGCAGTCGC 


TCCGTAGCAG 


GCTTCTTTAA 


TCTCGAAACT 


4560 


ACGAGCAAAG 


GGCTGGATGC 


CCAGCAAGCC 


ATGCACAAAG 


ACGGCCGCAG 


CCTTACTCTG 


4620 


GTCAATTCCT 


GACTCGCTCG 


CCACAATGAC 


CATGTCAACT 


TCTTGTCTTT 


CTTGCTCAGT 


4680 


TAAAATAGAG 


TCACTACCAC 


TGGCCGCCAA 


GGTCACGATA 


TCCTCAGTTA 


GGGGCGCAAT 


4740 


ACTCAATTCC 


TTGAGTAAGA 


CTCCTTTACT 


TAATTTTTCA 


GGGTCAATTC 


CCCTCGCTTC 


4800 


TGCTAACTCT 


TGTAATTTCA 


ACACATATTG 


ACTGGTCGCA 


AAACCAATCT 


TATCAATACC 


4660 


GATTGTCATA 


TTTACCTCTG 


TTTTATCATT 


CATGTAAAAA 


AT CCTTCT AT 


ACT ATTTT AT 


4920 


CACAAATGGC 


AGTAAAAGAG 


ACAAAAAAGA 


CTTGATTCAC 


CAAATCAAGC 


CTCTTATTGG 


4980 


TCATCATTTT 


AAAGAATGAT TAGTTGCTAG AGAGTTCACC 


CATATAAGTA 


GCTTTATAAG 


5040 


CTCCATTCAC 


AGTTATCAGC 


TCCTGGAGGA 


TCAAATTTCC 


TGAGTAAGTC 


CTTCCCATCT 


5100 


CATCTACAAA 


TTTTTGATAA 


AACTGACTGG 


TCGGAATTTC 


TCTGACATCC 


TTATCAAATG 


5160 


TCTTATCAAG 


TGTTTTACTA 


ACCTTCTCAG 


CAATCAATTG 


ATGCTCTTGC 


CATCCACTTT 


5220 


GAAACTCTGA 


GCCCGAACTA 


GAAACCATGA 


CTGGGATAAA 


CAACAAGCTC 


AGTAGATTTA 


5280 


CAGACAATAA 


GGAAAGTAGT 


AGACTTCCTG 


CAAAACTAGA 


ATCCTAGTTC 


ATGATTGATA 


5340 


ATACCAGCAA 


TCAAATTCAT 


TCGTAATCCG 


AAGCGTTTAC 


GATGATTTCG 


ATAGGTTGTT 


5400 


GAAAACATTT 


TAAACGTTTT 


TACTTTGGCA 


AAGATGTTCT 


CAACCTTGCT 


TCTCTCCTTA 


5460 


GATAGCGCAT 


GGTTACAGGC 


TTTATCTTCA 


GCTGTTAGCG 


GCTTGAGTTT 


GCTGGATTTA 


5520 


CGTGGAGTTT 


GTGCTTGAGG 


ATATATCTTC 


ATGAGCCCTT 


GATAATCACT 


GTCAGCCAAG 


5580 


ATTTTACCAG 


CTTGTCCGAT 


ATTTCTGCAA 


CTCATTTTGA 


ACAACTTCAT 


ATCATGACTA 


5640 


TAGTTCACAG 


CGATATCCAA 


AGAAACAATT 


CTCCCTTGAC 


TTGTGACAAT 


CGCTTGAGCC 


5700 


TTCATAGCGT 


GAAATTTCTT 


TTTACCAGAA 


TCATTCGCTA 


ATTGTTTTTT 


AGGGCGATTG 


5760 
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ATTTTTACTT 


CCGTCACATC 


AATCATTATC 


GTGTCCTCAA 


AGCTGAGAGG 


ACVfCTTGAA 


5020 


ATCCTAACAC 


CACTTTGAAC 


AAGAGTTACT 


TCAACCCATT 


GGCTCCGACG 


GATTAACTTG 


S690 


CTTTCGTGCA 


TACCAAAATC 


AGCCGCAATT 


TCTTCATAAG 


TGCGGTATTC 


TCGCACATAT 


5940 


AGAAAGCGTT 


ATCAATTTAT 


TTATCTCATT 


TTTCAGAAAA 


TYLTTTTATT 


TCTGTAAAGT 


6000 


CTACGATACT 


CGATGTGTTT 


TTATATAATG 


ATAGAGTCTG 


AGAATCACTC 


TTCCGCTAGC 


6060 


CATTCCAATA 


GAG ATT AC C A 


AAGCCAACAT 


GACAACCAAG 


CTCGCACTTG 


CCAGTGCTTT 


6120 


ATTATAGTCC 


CCTGTCACAA 


AAAAGGCAGT 


TGTTCGGTAG 


GAGACATAAC 


CTGGAACCAG 


6100 


CCGTGCCAAA 


ATGGCCAAGA 


TAAAGACCAC 


AGCACGTGTC 


TTATAAAGAA 


TACTTAAAAT 


6240 


CTGGCTCACA 


CAAGAACCAA 


TAATGGCTGC 


AATGAAGGTA 


GCTACAATGA 


CATTGGTCGC 


6300 


TTCCTTGAGC 


AAGAGATAGA 


TTAGCCAGAC 


ACTCATGCCC 


AAAATCCCTC 


CAGGTAAGAG 


6360 


CATACACCGT 


TGCACATTCA 


GTACGATTAA 


AAAAGTGATA 


ATGGCAACAA 


AACTTGCTAC 


6420 


TGCTTCTAAT 


AAAAAGGTTG 


TTAGTGTCAT 


ATTACTTCAT 


CAATACCAAG 


GCGACAGAAG 


6480 


TTCCTGCCCC 


TAAAGCGAGG 


GTAATGAGCA 


GGGATTCAAA 


CATCTTACTC 


ATACCAGAGT 


6S40 


TTATGTGGTT 


GGTCATAATA 


TCACGGACCG 


CATTGGTCAA 


GGCAATACCT 


GGTACAAACG 


6600 


GCATGACCGC 


ACCAGCT AT A 


ATCAAATCTG 


CCGTTGAAGG 


AAAACCTGTG 


TAGCGAGCCC 


6660 


AAAACTGGGC 


AATTATCCCA 


AAGACAAAGG 


CTCCACCAAA 


GGCTGTCACA 


AAGGGAATTC 


6720 


GGATAAATTT 


TTCCACATAG 


AGGGAAAAGG 


CAAAACCAAA 


TAAGCTCGCC 


ACTCCTGCCC 


6780 


CAAGTGCGTC 


GTAGATATTT 


CCGCTAAACA 


TAACTGAAAA 


GAAAGGACCA 


CTAAAGGTCG 


6840 


CAGCCAGAGT 


TACCTGCAAC 


TTACTATAGG 


GAAGGGGTTT 


GGCTTGCAAG 


GCCGTCAATT 


6900 


GCTTAAAGGC 


TGTTTCTAAG 


TCAATCTCCC 


CCCCAACTAG 


CTGACCAGAA 


ATCTGGTTCA 


6960 


CATCGCAGAC 


TTTTTCGATG 


TTATAAGAAC 


AGGAGGTCAC 


GCGCTTCATC 


CGCAAATATT 


7020 


GGTATTTTCA 


ATAGAGAAAA 


AGATAGCGGC 


AGGCATGGCA 


AGGACATTGC 


AATCCACAAT 


7080 


CCCCTGCGAA 


TGCGCGATTC 


GAATCATGGT 


ATCTTCTACA 


CGATGGATTT 


CTGAGCCACT 


7140 


TTTAAGGAGA 


ATAGTCCCCG 


CTAGCATAAT 


CACATCAATG 


ACGCCATTTA 


ATTCTTTTGA 


7200 


TTCTTCCATG 


CTTTCCTCCT 


TTTATCAACT 


CCCTCTATTC 


TATCACAAAT 


CCGGACTCAA 


7260 


AAAAAATCTT 


TGCCATGAAA 


TCATGACAAA 


CATTGATTAC 


TCATTTTGAT 


TATCCATCTG 


7320 


CTTTTAAGGA 


GTAGCTGAAG 


TTGTTTTAGG 


TTTGTAGATT 


GAAATCTTGA 


CTCTAGTCTT 


7380 


ATTGAGGTCT 


ACCTTTTCAC 


CTGCTCTAGG 


ACTTTGTTCA 


ACAACCATGC 


CTTCTGCACT 


7440 


ACCTGCAGGC 


GCTGTCGTCA 


CTTCTACAAC 


TTCTAT ATT A 


GCTTCCTTAA 


TCCCAACAAT 


7500 
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TTGAATCAAA 


TTGTTCTTAG 


TAAACTCCAA 


GCTAGAACCA 


ATCTAACTCG 


GCATGGCAAC 


7560 


ACTTGTAACT 


TTTTTAGCTA 


CTGTCAAGAC 


AATTTGACTA 


GGTTTACTCA 


CATCATAAGT 


7620 


CGTTCCCGCA 


CCTGGACTTT 


GTTTCATAAT 


CGTTCCTGGT 


TCGCTTTCGC 


TGGACTCTTC 


7680 


TTCCTCTATC 


TTAATCAAAT 


TCTCAGGAAC 


CTTCTTCTGC 


TTGAGTTCTG 


AGATTACTTC 


7740 


TGTAGAGTTC 


CGTCCAATAT 


AGTTCCCTAA 


TTGAATCGTC 


"GTKGCTTTTT 


TAGCTACTGT 


7800 


CAAAACAATT 


TGAGTTGCCT 


TGCTCAAGTC 


ATAGGTCCTA 


CCTTCTCGTA 


GACTTTGCTT 


7860 


CACGACCGTT 


CCAGCCTCAC 


TCTCATTCGA 


CTCTTCTTCC 


TCAATTTTAA 


TCAAATTATC 


7920 


TGGAACTTTT 


TTCTCTTTTA 


ATTCCGCAAT 


GACATCAGAG 


GATTTCCGAC 


CGACAT AATT 


7980 


ACTAATTTGG 


AAAGATTGCT 


TGCCTGATGA 


GACAACCAAA 


TTGATTTTCG 


TTCCTTCTTT 


8040 


TCGACCAGTT 


CCAGCGCCAG 


GATCTGTACG 


CATAATCCGC 


CCTTCTTCCA 


CCTTTTCACT 


8100 


ACCCTCTGTC 


TTCTCCTCAC 


CAATCTCAAA 


ATTGGCTTTT 


TTGAGCCTTG 


CCTTGGCCTC 


8160 


TCCAACTGTC 


TGACCTGCCA 


CATCTGGAAT 


GGCAATGGTT 


GCACGACTTC 


TGGATAGTAT 


8220 


CCAAATAAGA 


GAAGCTGCCA 


CCAATACAAG 


GCTGGCCAAC 


AAAATCAGGT 


AACGCATCTT 


6280 


AAATCTATGT 


TTTTTCGGTG 


CTTGTGGTTG 


GTAAGTTTCC 


TCTGTCACAG 


CCTGGCTTGG 


8340 


GTTTTTGATT 


GATTTGTGTT 


CTGTTTGCCC 


TTGAACCTTA 


GGAATAGATG 


TCAAGGTACT 


8400 


CTGAGAAACC 


TTCGGCAAGG 


TCTTGGTATC 


TGCCTTGCTC 


GTTTCATCAA 


AGATTAACTT 


8460 


ACTTTCATTT 


CTACGATTGT 


AGGACAACCT 


ACTAGACAAG 


TCCACATACA 


TCTCTGAAAC 


8520 


CGAGCGGTAG 


CGATTGGTCA 


ACTTTTTAGC 


AGTTGCCTTG 


ATAATAACAT 


TTTCTAAAGC 


8580 


CTGAGGTACA 


GATGGATTTT 


CTGCAATAAC 


GGACGGCAGG 


GGTTTCTGGA 


AATCCTGGAG 


8640 


GGCAATGGTC 


ACCGCGCTAT 


CCCCGTCATA 


AGGGATATGG 


CCTGTCAGCA 


TCTCATAGAA 


8700 


AATAATCCCC 


ATGCCATACA 


TATCACTCTG 


CACAGTCCCC 


TTCCAACCAC 


GCGCCTGCTC 


8760 


TGCTGACAAG 


TAATGAACTG 


AGCCCAACAT 


CGAGTTAGTC 


TGGGTCAGAC 


TTGTCTCTGC 


8820 


AAAGGCTACA 


GCAATCCCAA 


AGTCTCTGAC 


CTTGGCAGTC 


CCATCTGGTG 


TCAAGAGGAT 


8880 


ATTTTGAGCT 


TTCAAGTCCC 


TGTGAACAAT 


TCCTCGAGTA 


TGGGCCAAGC 


GCATAGCCAA 


8940 


GAGAATTTGT 


CCCATCATAC 


GGACTGCTTC 


TTCATTAGAA 


ACAGGATAAT 


GTTCCTTGAT 


9000 


ATAGCGTTTG 


AGGTCCAGTC 


CAGCCACATA 


CTCCATAGCT 


AGGTACTGTT 


GACCGTCTTC 


9060 


CTCGCCAATA 


TCTGTTATCC 


CAACGATATG 


ACGATGGTCT 


AGATCTGCCA 


TAGCTCTCGC 


9120 


TTCACGCTGA 


AAACGAGCTA 


C AGCT ATCGG 


GTCCGTCTGG 


TAGTTGGTCC 


TCAGAACCTT 


9180 


CACTGCCACT 


TCTTCCCCAT 


CTAAGATTAA 


GTCTTTGGCT 


AGCTAGACAT 


CCGCCATACC 


9240 


TCCTCGACCA 


ATCTGTTTGA 


CAATCCGATA 


GCGTCCGGCA 


AAAATCTTGC 


CGATTTGGAT 


9300 
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CATTCTGCAT 


CCTCCTCGTT 


CATAGAAACA 


AGGGCAACCU 


A /V% A w A A V* b W 


TAAACCTCCT 


9360 


GCATTGTTAG 


CAAAACGAAC 


AAGTGTCTCC 


GTTTTATCTG 


fTAAAGGAAT 


AT^ A {7 1 lyiyrr 


9420 


ACAATATCAC 


GAATCTCACT 


GCCTGAAATC 


ATGTTGGTCA 




ATTGAGCAAG 


9480 


AGATAGTCAC 


CTGACTCAAG 


GATAACTGTC 


CCAAAATCAG 


V»U l\tnn X I l *- 


ITrTTTPTftP 

A A %» A A A A A 


9540 


CCAATAGACT 


GGGTGATAAT 


ATTTTTTTGC 


GGATGAGCTT 




TGGTGTCAAT 


9600 


TGACCAGCCT 


TGAGCAATTC 


ATTAACCAAG 


GAATGATCGC 


TCGTCAACTU 


ATGGTATTCT* 


9660 


TCTCCACGAA 


TCAAGCCGAT 


ACGCGAATCA 


CCAATATGAG 


CATAGATAGC 


CTGATTATCA 


J <*u 


ATAATAGCAA 


GGACTTCCAA 


AGTAGTTCCC 


ATCCCTCTGT 


AAGCTTCATC 




9780 


TGGTGAATCT 


TTTGATTTTC 


AATTTCTAGG 


TAATGGGCGA 


ACCATTCACG 


CACTTCATTG 


QftAA 


ACTGTATCGA 


TCTGGGTATC 


AACCCAAGCT 


ACACCCAGGT 


CTGTGACCGC 


CATTTCACTA 


OOAO 


GCGATATTCC 


CTGCCCGATG 


ACCTCCCATC 


CCATCAGCTA 


AAATAATCAT 


GGTACGTCCA 


aa An 

7>OU 


GCTCTATTGA 


CATAGTGCTT 


GACATAGTCT 


TGGTTATTTG 


TTCUITTC TG 


ACCAACATCT 


IUUav 


GTTAATAATG 


AAATTTCCAT 


GTGTCAGTTC 


CTTCCTAATC 


CGATATCTTG 


UjAAATTGM. 




TGATGAAGAA 


TCCATCACTT 


CCATACAATT 


CAGGTGTAAT 


CAGGATACAG 


CCGTCTTTCA 


10140 


TGATATCCTT 


ACATTCATGT 


TCTAGTTTTA 


CCTGCTCGAA 


jijru — i j — t — - 1 — - ft 9fk^ ft 

CTCGGGATGA 


V- 1 V. 1 1» 1 A/*A« 


10200 


AGGCCTTAAC 


GACTTGAAAA 


TTCTCCTCTG 


AGACGATAGT 


GCACGTGCTA 


TAAGTTATTA 


10260 


TACCACCTTT 


GCCTACTATT 


TGACAAACAC 


TACCTAATAT 


TTCTAACTGA 


ATTTCCTGCA 


10320 


AGGACGCGAA 


ATCTCCCCTT 


TCTTTATTGT 


ATTTGATATC 


TCGTTTTCGG 


CGCAAAAGAC 


10380 


CGATTCCTGA 


ACAAGGAGCA 


TCCACCAAAA TCTTATCAAA 


GGAATCCTGG 


TCAAAAAACT 


10440 


CATGCACCTT 


TCTGGCATCC 


AATTTTTGAG 


TTTGAACCCG 


ATCTTCAACT 


CCCAGACGTT 


10500 


GGGCATTTTC 


TTCAATTAAA 


TCCAACTTGT 


GGTCGTACAA 


GTCCACAGCA 


GTAACCTGAC 


10560 


CTCTCGTAAG 


ATAAGAGGCT 


ATATGGGCTG 


TTTTCCCACC 


TGGAGCCGCA 


CAGGCATCAA 


luo *u 


GCACTCGCTC 


ATCACCTTGT 


AAATCAAGCG 


TCGGAGCAAC 


CAGCTGACTG GACTCATCTT 


10680 


GGATGCTAAT 


GGCTCCATCC 


GCAAACAAAT 


TATGCCCTGC 


AAAATGCCCC 


TGCTCCTTAA 


10740 


CCAGACCACT 


GGTTGCTAAA 


AGGGAATTAT 


TCGCCTCCAA 


CAAGGCTTGG 


ATTTCCTCTT 


10800 


TTCGACTTAG 


GTCTGTTACA 


CGAATACTGG 


CTTTGTTTCG 


CACTAACAGG 


CTTTCAAAGA 


10860 


TGGCTTTTGC 


TCTCTCCTCT 


CCGTATTCTT 


CCTTGAGTTT 


GGCAACTAGC 


CAAACTGGGA 


10920 


GAGAATAGGC 


AATGGAGTCA 


CGCTTGTTTT 


TTCGCTTGAT 


GCTAGCAATA 


TCTGGCCAGC 


10980 


CTTCACGCAA 


GATACGGCGA 


AGGACAGCGT 


TGACCAATTT 


TTCACTGCCT 


TTTTTACGGA 


1104 0 



WO 98/18931 



PCT/US97/19588 



GTTTGCCCAA 
GGACTTGGTA 
CTTCGATAAA 
CCAGCTCGGT 
TTAACCCGAT 
AACTTCTAGC 
ACTCCGTTGA 
AGGGATAGAG 
TCACCTGGAT 
TTAAGGAAAC 
CGATTGGTTT 
ACCTGACTCG 
TCCAAAAGCA 
TCATCTGTCA 
ATTTCCATGA 
CCACCACCAC 
TCAAGGACTT 
AGCTTCATAA 
AGTCCTGCTT 
ACAGCACGGT 
CCTTTTAAGA 
TCTCCTTCTT 
TCCCGTTCTT 
CGGTATTTAA 
GTTGGCCCCA 
TAGGCACGTT 
TAGTAACGTG 
TAATCTTGAT 
ACTTGACCTG 
GTTCTCTCAG 



TTCCACTGCT 
GGCACTCATG 
GTGGGATAGG 
CACTAAGCCC 
ATTTGAATAT 
CGTTTCTACT 
GGAAGGAAGC 
CCCCTTCAGC 
TTCCCTGACC 
TATGGGCAAC 
TGTTCCAGTC 
TATCCTGCGG 
AATCACGACC 
TCGGAATGCT 
TGGTCACACC 
GGTCTCTAGG 
TGCTTGGGAG 
GATCTTCCAT 
CCTTGGCAGC 
CTGGCTGGGT 
CTGTTGCTGA 
ATAAAAATTG 
GAGTCAAGGC 
TTAAAATCTG 
GAATGGGACT 
TGAAAACCTC 
GATAGCCGAG 
CCTTGGCAAA 
CCTTTTCAGC 
AAGAACGGAA 



TCATTAACCA 
AGAAGAAGGA 
TACCATTCCA 
TTGTCTGCTG 
GCTTGGTTCA 
TTACTCACCA 
AATGTCCATC 
CGTTGCGACA 
TTCTACTGGT 
AGGCCAGGGG 
CAGTTTTTCT 
TTCAGGTTTC 
AACTAGCGCC 
GCGACGAGAA 
AGCTTCCTCA 
AAGGAGGGAG 
AAACTGCCCA 
CTCTCGACTT 
CTGCTTGACT 
CACAACGCCT 
AAAGTCGGGG 
CTGCGGCTCA 
TAAAACCTGG 
GTAATGATAG 
GGTCTCTGAC 
TTCTTCTTTC 
TTGTCCTCTG 
TCGAATAGCA 
CCCACCTGAT 
ATCACGCAGA 



700 
CAGCATGATC 



CATAGACCCA 
GAGTCAGTTT 
CCAAAACTTG 
CAAAAACATC 
AATCGTTCTC 
TTAGGCTTAC 
ATCAATTCTT 
AGGGCTTCAT 
TTCATTCCAC 
TCCTCTGGCT 
ATATCACCAG 
AATTTTTCAA 
ATCATATCTC 
TCCCCTTGAA 

gcatgaacgt 
aaagcagcag 
ccacataatt 
cggctttctt 



AGAATTTCGT 
GTCCCCATAA 
TGGTCAATGC 
TTGAGGGTCG 
AGGTTCTGGG 
AAGCCTCACC 
TTGTGAGAAA 
ATTCCCATTT 
TAGTGCTCCG 
CGACCTGCCA 
TTCAAGGCCG 



TGGAATCTTG 
GCTGTCTAAC 
ACGGGCTACC 
ACTTCCCTTT 
CTCTAGCACT 
CTACAGTCAA 
CAGCTGGCTG 
TCTTGCCGAT 
AAATCTTAAA 
GAATTTGGTT 
TTATATTTGG 
CAATATAGGC 
ACAAGGTGCC 
CTGCATCCAT 
TCAAGGCATA 
TGACAGCAAA 
TCACAATTCC 
TTTCAGGTTG 
GGATAACTTT 
AACGGTCATC 
AGATTAGTTT 
TGAGACGGAG 
ACCCCAGCTC 
TACGGGCAAT 
GCAAAATGTT 
GGGTAAT ACC 
CATAGGCATA 
GATTGTAGGA 
CCTGAGTCAA 
TATCCGCATT 



TCCAAATAGC 
TGGTCTCTGT 
GTTCCATACA 
AGATGCTTAT 
GCTAGAGCTA 
TGTACCTCCA 
CACTTCTTTG 
AGAGAGAATC 
GCGGTCGCCC 
AAAGAGTTGA 
AGAGAAGGTA 
AGGCAGAGTG 
AACATTGTCC 
TTCCTTAACC 
ATGGATAGGC 
CTCCATGCTA 
ATCTGCTCCT 
GTAGATAGAT 
TTT ACGACCA 
TGTCAAAAGT 
TGTC AT ATCT 
CTCACTATTT 
ATCTTCTAAA 
CGGTTTTGGC 
CATGACTTCA 
AATCGTGAAA 
AAAGCCTTCG 
CTGTATCAAG 
GAGCTGGAAG 
TAGAACTCCG 



11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

L2480 

12540 

12600 

12660 

12720 

12780 

12840 
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ACTAGCGTAA 


CATTGGGAAA 


ATCCAAACCC 


TTTGCAATCA TCTGACTACC 


AAGTAAAATA 


X2900 


TCCGCTTCCC 


CTCCCCCAAA 


CTGGTCAAGC 


AAGCCTTGCT 


GACTGCCTTT 


CTTTCGAGTC 


12960 


GTATCCACAT 


CCATCCTCAG 


AATGCGAGCT 


TGGGGAAAGA 


GTTCTGCTAG 


CTCATCATAA 


13020 


GCCTTCTGAG 


TTCCCCTCCC 


AT ACT AACGA 


ATACTGCGGC 


TCTTACAGTT 


AGGACAGACC 


13080 


TGAGGAATAT 


CCTTCGAGAA 


ACCACAATAA 


TGGCAGTTCA 


TAGTCTTGGT 


ATCCATATGC 


13140 


AAGCTCAGAG 


AAATATCGCA 


CTTGGGACAA 


GTATCCACCG 


TCCCACACTC 


CCGACACATG 


13200 


ACAAACCTAC 


AATAACCACG 


GCCATTGACC 


ATGAGAACCA 


CCTGCTCTTT 


TTTAACCAGA 


13260 


CGGTCTTGGA 


TAGCCTCTAG 


CAAACGAGGC 


GTAAAGTTTG 


ACGTCTCATT 


TTCTCCGATA 


13320 


TAGTCTCGAA 


AGTC AATCAC • 


TTGAACCTCA 


GGGATTGTAG 


CCAAAGGATT 


GGCACCTTGG 


13360 


GTTAGACCTA 


AGTGTTGATA 


GACGCCTTTG 


CCAGCACCTG 


CCCGGCTCTC 


TAAGCTCGGC 


13440 


GTTGCAGATC 


CAACTACCAG 


ACTTGCTTGA 


TTATACTCAC 


CCCCTAAAAT 


AGCTACCTCT 


13500 


CTGGCATGGT 


AACGGGGATT 


GCTGTCCTGC 


TTATAAGCCG 


CTTCATCCTC 


TTCATCAATA 


13S60 


ATCATGACAC 


CCAGATTTTT 


CAGAGGAGCA 


AAGATAGCAG 


ATCTGGCACC 


AACAACAACT 


13620 


TGGGCATCGC 


CACCCTCCAC 


CTTGCCCCAT 


TCATCATACT 


TTTC AC C ATT 


GGATAATCCT 


136B0 


CAGTGAAGAA 


TGGCTACCTT 


CTCCCCAAAA 


CGTGCTATAA 


AACGCTCGGT 


CATCTGAGGA 


13740 


GTCAAGGAAA 


TCTCAGCTAC 


CAGCAAAATA 


CCTGTCTTCC 


CCTTATCCAG 


GCCACCTTGG 


13800 


ATAATCTGCA 


AGTAAACCTC 


GGTCTTCCCA 


CTTCCTGTAA 


TCCCTTGAAG 


TAGAAAGGGA 


13860 


GGTTGAGAAC 


TGCCAATAGA 


ACTCACAACC 


GCATCACGCG 


CCTGTCTTTG 


TTCTGGATTT 


13920 


AACTCCAAAG 


GTCTACTTGC 


TTCAATTCCT 


TCAAAATAAG 


CAGCCCAGCG 


TTGAACTTCC 


13980 


TTTTGGACTA 


TCGTAACAGC 


ACCTTGATCC 


ACAAACAAGT 


TGACTTCCTC 


TCGCGAGTAG 


14040 


GACTCTAACA 


ACCTACCCAA 


GGAACCGCTC 


TCTGCATGAG 


ACACCAGATA 


ATCTCTCAGT 


14100 


TCCAACTTTT 


TCTTGGCACG 


TCTACAAATC 


TCAACACCTT 


CTAATTGAGC 


ATGCTCAACC 


14160 


TCATACCAAG 


ACTGGGTCTT 


CACCTTCITT 


TGATCCACTC 


CCTCATATTC 


CAGACCAAGC 


14220 


AGGCCTTTTC 


TAGTCAAACG 


CATCATTTCA 


GCTTCCTTGG 


CAAGGTCTAG 


TCAAGAAAAG 


14280 


gctagccaat* CTTCTGAACC 


AAACAGGCGC 


ACTCCTTCTT 


CCTCACTCAA 


GCCTTCCACA 


14340 


CGATAGACAA 


TCTTGTCATA 


GCTAGAATTC 


AGAAACCCTG 


CAAGCATGGC 


CTTGAGGATA 


14400 


CAGATTTTCT 


AGGAGAAGAC 


AGATTTGCGT 


AACTCCTCAG 


CCAGCCAGAG 


TTGTTCTGGC 


14460 


CTGAGAACAG 


GAGAAAAATC 


CAGCACCTCT 


GCAATATCTT 


TTAAATCTTG 


CTCCATCTCT 


14520 


TCTCCATCTG 


ATTGGGACTT 


CAAACCAAGA 


ACAATCCCTT 


GAATCAGGCG 


ATTACCCTTA 


14S80 
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CCAAAAGGCA CATGAACCCG CATCCCAACT TCCACCATTC CCTCAAATTC CTCCGGAATC 14 640 

CTGTAACTAT AGGGCTGCTC CCTCTGCATC AAGGCCACAT CTACGATAAT CTTACCTACC 14 7 00 

GCCATCTTCT CACCTCCTCC TTGTCAGTAC ATTCTTCCAA TAGAAAAAAT AAGATTGACT 14760 

CCCCCCAACC TTAAATTTTT TCACCATCTT C TT T TT C TTT AGCAATTTGC TCTTTGATTT 14820 

TCTTTTCTTC TTCTTCTTTG CGGCGTTTTT CTT C TTCGAT ACGGCGACGC ACTCCTTCAC 14880 

GTTTTCCTTC TGGATCTGGG TGAATTCTAA CCTT TC CTGA TTCGATTTCT TCTAAAGCGC 14940 

GAAGAGTTGA TTTTTCAGAC TTGAAACCTT GAGTTGCTGG GCCACCTGCT TCCAATTCGT 15000 

GGGCACGTTT TGCTTCCAAG ATTACGAGTG AATATTTTGA AGGAACCTTG TCGAGCAAGG 1S060 

TATCAATAGA GGGTTTTAAC ATCATTTGCT TGTACCTATT TTCTAAATTT TATCGGGTAG 15120 

TTGGAGATTT TGGTAACATC TCCTGATAGT GACCAATGAC ACGATCCACA CAGAAGTGTT 15180 

CTGCTTCAAT CACACATTTG ACACGTTCAG CAGCTAGGCG TACCTGATCG TTGACAATCG 15240 

CATAATCATA CTCACGCATG AGGGCAATTT CTTCCTTGGC CTTTTCGATT C C TT G GGCAA 15300 

TCACTTCTGC ACTATCTGTT CCACGACCTA CCAAGCGATC TTGCAATTCA TCCAAATCTG 15360 

CTCCTGTCAG GAAGATAAAG ACAGCATCTG GAACCTTTTT CTTGACCTGA AGAGCACCCT 15420 

CAACTTCAAT TTCAAGGAAA ACATCGATTC CCTTGTCCAA GGTTTCATTC ACATAGGTCA 15480 

GAGGAGTTCC ATAGTAGTTA C CG A CAT ATT CTCCCTATTC CAACATCTGT CCTTGACGAA 15540 

TCAGCTCTTC AAATTCTTCA CCAGTACGGA AGAAATAGTC AACACCCTCC ACTTCTCCAG 15600 

GACGTTGTGC GCGTGTCGTC ATCGATACAC AATATTGAAA TTGGTTTTCA CAACTCTCAA 15660 

AAATCTCTCT TCTAACCGTT CCTTTTCCAA CCCCTGAAGG ACCAGAAAAA ACGATTAGTA 15720 

AGCCTCGGTC TGCCATTGTG TCTCCTTTTA GTCAATCTCT GAAATAACAT TTCTCTAGAA 1S760 

TAATGGCAAA AAGCCAGATT ATCCTTTACA GTCTTTCTAT CTACTGTAAC AAAAAAGCAG 15840 

TAATTTTTCA ACTGCTCTTT CTTATTTATT TAGCATAATC TACTCCACGA AGCTCGCGAA 15900 

TCACCGTTAC CTTGATATTT CCTGGATAAT CGAGATTGTT TTCAATTTTC TTACGAACTT 15960 

TGTGAGCCAA GATTGTGACT TTCTCGTCCT TGATTTTTCC TGGATTGACC ATGATACGAA 16020 

TTTCACGTCC TGCTTGAAGG GCAAAGCTAG TTTGCACTCC TTCAAAGCCG TTAGCAATTT 16080 

CTTCCAAATC ATGGAGACGC TTGATGTAGC TTTCAAGAGA CTCACTACGA GCACCTGGAC 16140 

CGGCTGCGCT CAAGGCATCT GCTGCAGCCA CCATAACTCC TATCACGCTC T C AG CTTCAA 16200 

CATCTCCGTG GTGACTAGCA ATCGTATTCA CCACAACTGG GGGTTCCTTG TACTTACGGG 16260 

CCAATTCCAT ACCGATTTCA ACGTGGCTAC CTTCAACCTC ATGGTCAATG GCTTTCCCGA 16320 

TATCGTGAAG GAATCCAGCA CGACGGGCAA GAGCCGCATT TTCACCAAGT TCCCTCGCCA 16380 
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TGATACCAGC 


CAACTTAGCA 


ACCTCAATCG 


AATGGCGCAA 


AACAill K»T 


CCATATwwwi 


16440 


TACGGAACTG 


CAAACGTCCC 


ATAATCTTCA 


TCAACTCTGG 


ATGAAGGTTT 




x a jvu 


TTTCATAGGC 


ACCAGCCTCA 


CCCTATTCAC 


GAATCTTATT 


GTCAATCTCT 


TGACGGTTTT 




TCTCAACCAA 


CTCTTCGATA 


CGAGCTGGAT 


GTATACGACC 


ATCTTTCAGC 


AACATTTCCA 


looZU 


TAGTCATACG 


GGCAATCTCA 


CGACGAATCG 


GATCAAATCC 


TGACAAGGTC 


ACC Ac nvi\* 


16660 


GTGTATCCTC 


GATAATCACA 


TCGACCCCTG 


TCAAACTTTC 


AAAGGTACGA 


ATGTTACGAC 


16740 


CTTCACGACC 


AATAATGCGT 


CCCTTCATAG 


TATCCTCTGG 


CAGATGAACT 


GTTGAGTTTG 


16800 


TTGACTCCGC 


TACATATTCA 


CCAGCCATAC 


GTTGCATAGC 


TTGAACCAAG 


ATGTCCTTGG 


16860 


CCATTTTGTC 


ACAACGTTCC 


TTCACCTCrr 


GCTCAGCTTC 


GCGAATGCGA 


CTGGCAATCT 


16920 


CCCTGGTCAA 


GTTTTCCTCT 


GTCTGAGCCA 


AGATAATATC 


TCCTGCTTCT 


GCCTGAGACA 


16980 


CCGCACCAAT 


ACGCTCTAGT 


TCTGCTTCTP 


TTTGTCTTTC 


GACTTCCTCT 


AATTGCTCTT 


17040 


CACGCCCATC 


AAGGTTTTTC 


GCTCTATCAG 


AAATACTTTG 


TTCTTTTTGT 


TCAAGTGTTT 


17100 


GTTCTTTACT 


CGTCAAATTG 


TCGTCCTTAC 


GCTCAAGGCT 


ACTAGCTCTC 


TCTGTCAAAC 


17160 


GACTTTCGAT 


TTGTTTGAGT 


TCTTGACGTT 


CTGATTTGAA 


TTCAGCGTCC 


ACTTCTTCAC 


17220 


GGTATTTTCT 


GGCTTCTTCT 


TTGGCCTCCA 


ATACTCCTTC 


TTTTTTAAGA 


GACTTGCTTT 


17280 


CACGTTTGGC 


TTCATTAACA 


AGTAAATCCG 


CTTCACCCTC 


AGCTTGTCCA CGTAAATTAG 


17340 


TTGCTTCTTG 


TTCAGCATTT 


AAAAGCATCA 


ACTCTGCAGC 


TTCCTGAGAT 


GATTTCATCT 


17400 


TAGCTGACAT 


GCTGACATAT 


CCAATGACTA 


AACCAATGAT 


GACGGCAAAA 


ACAGCAATCC 


17460 


CAAGCGACAT 


GATTTCCATG 


TTTTTACCTC 


ATTTTATTGT 


TATTCCGAAT 


GACATACATT 


17520 


CTTTTACATT 


CTACCATAAA 


AAAGTGATTT 


TCACAAACCT 


AAAATAGAAT 


ATGTTTTGAG 


i icon 


GAATTTGGAA 


CACATTTACC 


AAAATAAACT 


TGTTGTTTAG 


AAATAGTACT 


TTAGTAGAGA 


X f 04U 


CTTGAGAAAA 


AGCCTACCTT 


TCAATAGACT 


TAGTAATGAT 


CTTTAAACGA 


CAAGAAAGCC 


17700 


ACGCTATCTC 


CATCCATCAT 


ATAAATCAAG 


CGATTTTCTG 


CATCAATACG 


CCGTGACCAC 


17760 


GCTCCTTGGT 


AATCATATTT 


CAGTGGTTCT 


GGTTTACCTA 


TTCCTGTAAA 


GGGATCACGT 


17820 


TGAATATCCT 


TGATTAGTTT 


ATTGATTCTT 


TTTAACGTTT 


TCTTATCCTG 


ATTTTGCCAG 


17880 


TAGCAATAAT 


CTGCCCAGGC 


ATCTTCTGTA 


AACTTGAGCA 


GCATTTCTTA 


CTCCTCAATA 


17940 


ACATGCACCT 


GAGTACTTCC 


AGCACGAACT 


TGAGCCATTC 


CTCGCAAAAC 


CTTATCAGAA 


18000 


AGTTCCTTAT 


TTTGAGCAAT 


TCTCAGGGTT 


TCTTGGATAC 


TATCCCACTC 


ACTCTTTGAA 


18060 


AGGAC7ACAA 


TGTCCTCATC 


TGGATTTTTA 


TTCACCACCG 


TCAAAGGCTC 


AAATTCATCA 


18120 



f * 
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TTTACCTTCT TCATGTAGTC CTTTAAATGA TTTCGGAATG TTGACTAAAG CACTGCTTCC 18180 

ATAACCATAC CTCGTTTTAG CTCTTTTCCA CTATTATACA CCAAAAGAAA CAAATTCTCA 18240 

GGAACTTCTA CAAGATTTTC TTTTCTATCT ATTTATACTC AATCAAAATC AAAGAGCAAA 18300 

CTAGCAAACT AGCCGCAGCC TGTACTTGAG TACGGCAAGG CCACGTTGAC GCGATTTGAA 18360 

TTTGATTTTC GAACACTATT ATTCGTAAAA AATCTCAAAA AGCCTACCTT TCGGTACACT 18420 

TAGTTTGTTT CTATTC 18436 
(2) INFORMATION FOR SEQ ID NO: 88: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7001 base pairs 

(B) TYPE: nucleic acid 

(C) ST HANDEDNESS : double 
(O) TOPOLOGY: linear 



(XX) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


88: 






ACGTAGAAAA 


. ACTATTTCTA 


TCACAGATAA 


TATTCCGTAT 


GTTGTTCGAG 


CTATTGAAAT 


60 


AAACGTCCTA 


GGTATCTTTC 


TCAGTCTATG 


TGACTTACAA 


GGGAAAACTC 


TTTTCGAGAC 


120 


AGAAATTTTG 


AATGAAGATT 


ATCCTATTTC 


AGAAATCAAT 


TCCACCATTA 


CCAATATGAT 


180 


AAAAACAGCT 


ATACACTACG 


TCCCTTTGGA 


AACAAAATTA 


CTTGGATTTG 


GCTTATCAAT 


240 


ACCTGGACAT 


TATAACAAAG 


ACTCCGGAAG 


TATCATTACA 


AACAACCCCA 


TATGGGAATC 


300 


TTTTAATTTA 


TTAAATGTAA 


TTAAAACATT 


CAATTTTCCT 


TTTATTGTAA 


AAAATAATAT 


360 


CGATTGTATG 


GCTATAGGAC 


AATACCTTTT 


TAATCCACAC 


AATACCCCCC 


ATAACTTTAT 


420 


TTTCCT A CAC 


GCTGGATTAG 


GTATTTACAC 


TTCCTTTTTC 


ACAAAAGAAA 


AAATAGGAGC 


480 


CTCTAAAAAT 


CCTTATATCG 


GAGAAATTCG 


ACACACCATT 


CTCGAATTGA 


ATGGGCAATA 


540 


TTGTGAATGC 


GGAAAAAAAG 


GTTGTTTACA 


AACATATATT 


TCGCATGCTT 


GGTTAATCAA 


600 


ACACGCCCAA 


TTATTATTTA 


AAAATTCCCA 


ACTAACTGTA 


CTAAAAAGCC 


TTGTAAAGAC 


660 


TCAAAAACAC 


ATTCATTTAG 


ACACCCTTTT 


AACGGCTTAT 


AATTTAGCCG 


ACTCCGCTTT 


720 


ACGTCAACAA 


ATTCATAAAG 


CACTCAATTT 


ATTAGCCACT 


TCTATTGCAA 


ATCTCCTCCT 


780 


CATCAATCCT 


CCTGATAAAA 


TCTATATCAA 


CAGTCAATTG 


CTTAATTATC 


AACCTTTCAC 


840 


TCATGAAGTC 


AGGGATAAAA 


TCCAAGACCA 


GCTCCACTTC 




CTCGTAATAT 


900 


AGAAATTGAA 


ATTTTACCTT 


ACAACAAACA 


TCGTGGAAGT 


ATAGGACCTT 


GTGCATTAGC 


960 


TATCGTCGCT 


TTTTTCATAG 


AACATAGCAA 


TGTATTACAA 


GATATTATTT 


CACCTTAATA 


1020 


TATTAGAAAT 


CTATAGACCT 


GTTTAAATCA 


ACTATAACCT 


GTAGTAGATA 


TCTCGTATTT 


1080 
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ACACAATATC AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCGCT ATGTTGAGTG 

vrrrATArrA aaataacttt tcttctagct gcattttatt attataaaaa cattcatcat 

AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 
TATGAAATTC TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 
TTTACATATA GTATTTAGCG CCACATACTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 
TTTCAATAGC CCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT ATGGCTCAAA 
AACAATCAAT TGATGCTAAA ACCTGTACCT AGATGTTTCG GTTCATAAAA CCATGAAACT 
GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAAGA GGCATCATAA CTCTAAAAAC 
TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TCCCTACATG 
AAAAGAAACG TCAAATGCTC ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 
AATCCCTAAA TTTACTAAAG ACACTGCTCA ACTTTACACC TGTAAATGGT TGTTGTATAA 
TAAAGTTACA AAGATGTACG ACCACACTCT TGTAAATCAT ACTGTTCGCG AATATATTAC 
TGATAGCATT TCTACAAATA CAAGTAAAGA GAGCGGATGA GATTCAAACG AAATATGTCA 
GTGCTTTGG C ATTCCTAGCC TTCATATCAT TTAAAGAATT CTATAGACAA AATTTTTTCC 
AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CATATTTAGG AACAGGATAA 
GATACAAGAG TATTAATCCA TAGCTCAGTT CTATACCAAT CTAACACAAA TAAGCTAAAA 
AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTTAATAAAC 
AATACTAGTG TAACTATCCT TCCAGTCAGA ACCTTGTCAA ATCACACCGA AAATTCTTCT 
AAAATTTATC TCGTTAGGCA ATCAAGCAAA AACTCGACGA TAGTACAAAC ATTATCATAC 
AGGATTGACT TCCTAAATT A T ATACTTT AG TAACCTTTTC GGATAAGAAA AAAGGTTCAT 
TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 
AGCAACAAGA AGATTTTCAC TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 
TTTTGAATAT AAACTACAAT AATATAAACT AAATTTTATA GGAGGAAGAC AATGGATTGG 
TACGATTATA TGATACAGGC ATCCAAACAA TCACAATTCA ACGCAACCCA TTGGTTTCGC 
TATTTGCGAA AAGTTATTTT TGAAG ACT AT TCTTArrTAA CAAACCAAGA TGTAGAAAAG 
TTGCTAGACT CCAAAGAACT AACCCGTTTT CAAAAAATTA CCTTGAAGTA TGCCTTTCAA 
GAG C AT ACT C CAACTCATAA ATATGTGATT TCATTAAATA AACCTGCTAA GTTAACCAAT 
GTTCAAAAAT TGATGCAGAA ATACAAACAT GGAT AAAATG AAACCGGTCT TCCAAGCCCT 
AAATAAGGAA TTAATTCAGG AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTTATGT 



1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2260 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 
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CTTAGAATAT 
TTTCTAGTGG 
CCCATATCAC 
ATTACATTTT 
CATCAATAAG 
CCCTGAGTGC 
CCTCGAAACA 
TCACTGTTCA 
TCGTATTATC 
TGCTCATCAG 
TGAGCATGAT 
AGTGACTGTT 
GATATCATCA 
TATGATAGAG 
TATGACTTAG 
GTACAACATC 
CGAAAATCCC 
CGTAAACTCA 
GACATTTTAG 
CAACTCTTGC 
GACAATCTTA 
AAAGAAAAGG 
AATAATCTCA 
AAAAAACGGA 
CGAGCTTAGC 
TAGCTTTCCT 
AAACAGGATT 
AACACGCCAT 
TGGATCTGAT 
GCTAGGAATC 



CATGGTTTAC 
CTAAATCCCC 
CTATAATGAA 
ATCACAAAAT 
GATACACACA 
GGAAACCAAT 
ACTGGTATGC 
AAAATCATGC 
AACCAAAAAA 
CTGCCCATTT 
TTTTCCCGTC 
TCAATCGGCA 
CTGTTCTTGA 
CCGTCCCATG 
CTAGACAACT 
TTAGCCGTGC 
ATGAATACAA 
GCGATAAACA 
ACAAGCTTTT 
TGTTTCACTT 
AGCAGGTTCA 
TTATCAACGC 
TCAAACTTAT 
TTTTTATCGC 
TTTTTTTCAA 
TTCATTTCTT 
CCCAGAAATG 
ACAGATTGTC 
TTTTGATAAA 
GGATCGCAGG 



GTCCCACACA 
TATGGATATT 
AAGCGACAAA 
TACTAGACAT 
AGGAAATCAT 
TGAAGAAATA 
CTTCTAGAAT 
TCGCTGAAAC 
TTGCGCAAAA 
CAACTTCAAC 
TTCCTGAGAT 
CATGGAGATG 
AGGTAGAACA 
TCGCGTCAAA 
TTTCCCGTCT 
TATGACTCGT 
OGCTATCAAG 
TTTTTATCGC 
GAGCTATTCA 
TCAGAATAAG 
TCCTATTTTT 
CCTTCAACTA 
CAAGCGCAAT 
TCTGAATATC 
CCCACTACAG 
TTTATTTCCC 
CCAGTACCAC 
CTTCTTTCGT 
CAGCGACTCC 
TTATCAAGGT 



706 
AGATGTTGAT 



ATGGAGCCTA 
ACAACTCATT 
TAAAGACCCT 
CGCCAAACTG 
TGACTTTCAA 
TCTCCTTAGA 
TTCTATCGTC 
GTTGATTGAG 
TGTCATTCGC 
TATGTCCTGG 
AGCTTTATTG 
CAAGCTGTCA 
ATT ATT ACTA 
GCTAAAATCG 
GTGCGTGTCC 
CCCTACTGGA 
CCTACTTTTC 
CAAGACTTGA 
GAACCGGAGA 
CAGACTGTCT 
CACTATTCTA 
GCCTTTGGTT 
AAAAAAGAAA 
TTGACAAAGA 
TCGTAGTAAA 
TAGCCCACGG 
AAAAGCAACC 
TTCAAAATTC 
CATGATATTT 



GCTTTTATGG 
TTTTTGTGTA 
AGAAAGAATC 
AATATCCAGA 
GACTACGACG 
AAACCGTCTA 
AAACGCCGTT 
AAGAAGAATC 
AAGATTTCTA 
AACCTCAATG 
GACGTTGAAA 
CGCAACATTT 
TCCCAGATCA 
TGGATAT G TT 
TTCTTGATCG 
AAATCATGAA 
AACTCATTCA 
GTATGCATTT 
AACATCACTA 
AATTTTTCGA 
TTAAAACCTT 
ATGCCAAACT 
TTCCAAACTT 
GGACAAAATT 
GCCGGAAAAA 
CGTGCTAGCT 
TACAACCATT 
CTCGCAACTG 
ACTAATAAGC 
TTAGAGCTAA 



CTCT AT AAT A 
GAAAAAAAGT 
ATATGGAACA 
TTTTAGACAT 
CCCCATCTTG 
AGATCCCTTA 
TCAAGTGCTA 
ATCAAATTCC 
TGACCGATAT 
ATTCTCACTT 
CAGTCCGGGG 
TGAAAAGCTC 
CTTTCTTAAA 
TAGTCCTTAC 
CTTTCACATT 
TCAGTTTCAT 
ACAGGATAGC 
AACCAATAAA 
TCAGCTCTAT 
ACTTATCGAG 
CCTCAAAGAT 
CGAAGCGACC 
TGAAAACTTC 
TGTCCTTTCT 
GGAACAGCCT 
TCCACAAAAC 
GAGAGGTTGC 
CAGCTGTTTG 
GTTTATTAAA 
CCGATTCTAA 



2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 
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TTTTTCCCAT 


TCCGACGCTA 


AAATAATCTC TGTGTCCATC 


ATCTGATATT 


CTACAATTTC 


4 ft ft A 


CTGGCCATTA 


TCATAATAAA 


GAGCATCTCC 


AACTTTTAGC 


inn m ft * ft It* 

TGATCCAAAT 


/V*fY!fi &. & & A & 
uuuuwnAVWUI 




CACATCGCTT 


GGCTCTGCAC 




AATCACTGAG 


CGAATC.Cl.iiut 


TAWLA La»\— A«-t 


a Ann 

HOW 


AGGCAGCGGT 


GTACCATCCA 






AAATGATGAT 


AATvivWTwL 


A OCA 
400U 


CAAATAAACC 


GGCTCCATGA 


111 wwA/v*^» * 


TGGAATAGAC 


AAGTAACCAT 






AGGGTCCTCA 


GACACTTGGT 


AArlunU\ IV- 


nln ivU V- 1 V»V- 


GCCAAAAAAG 


/* n ft /"» ft ft.ffi 

GATCTAwAAT 


470U 


GCGATTTTGC 


GAAGCCAAGC 


GTTGATTGTA 




TGCTTCTGTT 


GTTCTTGGTA 


5040 


CATTTCACTT 


GTCATCCATT 


TuALAAA lu I 




TTCACCTGTC 


CAAGAGACTG 


5100 


CAACACCATC 


TGTCCAAAAC 


AATAAATAGu 




GCTACCAACA 


TCAACAAGTA 


5160 


TCCCAATAAC 


GCTCGTAGTT 


TACTCCTTGA 




CTCCAATTGC 


TlWf AGTC 


5220 


CTTTCACAAT 


CCCTCGATTA 


CGATACACGC 


GATACAviCAA 


GAGAAGGATG 


ACCGCCATCG 


5280 


CTCCTACTAA 


7AACCACAAC 


CAGAATTGCw 


CACGCTtTCT 


CACCGCTCGA 


TTCCGCTCTC 


5340 


CAATTGCTGC 


CGTATACGGA 




LALbl AV.LAA 


CAGACGATGA 


CTGTTAATCA 


5400 


TATACGGTCT 


ACAAGTCAAC 


AAGGTtGCAT 




ATGTTGAATC 


AAGACAGGCT 


5460 


CAAACTCATT 


CGGCTCCACC 






GTAGGCCAAC 


ACCTGATCTA 


5520 


AAACGTGAAG 


ATAAAAGATA 




TCATCTTATC 


CAATTGACTG 


AACAATTCTG 


5580 


CCGTTGGCAA 


TCCTCTGTGA 




CTGTATGGGTP 


ATTTTCACCT 


CCAACAGGCA 


5640 


GCGAAGCCCC 


TTCTAACAGC 


CCTGCCCCTT 




GTCCTCACTC 


CTTCCGACAT 


5700 


ACATCGGAAT 


TTCCTGATCA 




TTTCCACATA 


GCCAATCCGC 


TCATGGACCT 


5760 


TTAGCATATT 


GGCATATTCT 




TCTTTTTCTC 


TTGC'i'V. ™TA 


AAAGGATCAA 


coin 


GAATTTCAGA 


TGGTTTCAAG 


GTCGCATTGA 


AGGCTTGAGC 


CAAGCGCCAA 


CGCTCCTCAA 


3BBU 


CTTCTGCCTT 


ATCCATCTGG 


GAAACCGTCT 


CATCAAACTC 


TTTAATAACC 


TCGTTTGACT 


5940 


CAATACGATA 


ATAATAACGA 


GACACCAATG 


GATATATCGC 


AACGGCGAAT 


CCTACTAAGA 


6000 


AAATCACAAG 


AAGGATCAGC 


GGATGTTTCT 




GCCTTTTTTT 


CGTGAACGTC 


6060 


TACTGTTGTC 


CAtCCTCCAC 


CTTCACTTCC 


TTCCTTGCTG 


CTTTCAGCGC 


CTTCAAAGCC 


6120 


TTTTCCGGTT 


CTTTTTTCTT 


CTTCCGCAAG 


CCTCGAATAA 


TCCATAAAAG 


AATCACAATC 


6180 


AAACCAACTG 


CCACATAAAA 


CAGGTAGCGA 


TAGAGATGAC 


TGAGTTTGTT 


TGCTGCAATA 


6240 


AATTCTTCCT 


CAACCTCTGC 


TACGTACGGT 


ATCCGATCCC 


CCCGAACCAA 


TAGACGATGG 


6300 


GTATTGATCA 


TGTATGGCGT 


ACAAGTCAGC 


AAGGTCACAT 


AATCATGACC 


TGGTACAATC 


6360 
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AATAAATCAT CAAACTTCGT CGCCTCAATC ACCTTTACTT CATCCACTTG ATAGCCCATC 6420 

ACTTCCTTGA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAAGTTTCCT CAAATCCGTA 6480 

AACATCTTAC CTGTTGGCAA ACCTCTATCT GCCGTAATCA CCCCATGGGT CCAATTGCCT 6S40 

CCGATCGGCA CAGAACTTCC CTCTAGATGC CCACCCCCTT CCTCCAATAC CTCTTCAGCA 6600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAACGCGGA TTTCCACATG CCCCATCCCC 6660 

TCATGGATTT CTAACATACG TG CAT ACTCT GCTCGCCCTT TTTTCTTCAT TTCTTCCGAC 6720 

CAAGCATCGC CACTCACTAC ATTATTCAAA GAGTCATTGA AGGCTTCTGC CAATTTCATT 6780 

CGTTCATCAA TGTCAGCCTC ATCCAACGTT G C TT TT TCCT TATCAAACTC AGCAATTT G T 6840 

TGATTTGATT CCACTCGATA ATACAAGCCA GACACCAGCG GATACGCCAT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 6960 

ATTTTTATCA CCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 7001 

(2) INFORMATION FOR SEO ID NO: 89: 

<i| SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10411 base pairs 
IB) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
GAGGGAGCTT AAGAAGTTAC CACCGTCCTC TAGCGCCTTA TCCGCATCAA ACTTAAOGTT 60 

GATATTTTTA AAACTGTCGC CAGCTTGTGA TACCAT G CTT T G TT T AACCT CATTTAGGGT 120 

TTTAGTGAAA TCTGCATTGC TGAGGATATC ACTCTTTGAG AGATTCAAGG CAAAATTGAT 180 

GATGATATTG ATCTGGTTTC CTCTTATGAC CTGATCAACT TTGTAATTTT TTAAGCTATC 240 

TTCAACAATC TTCCGGATAT CTTCTTCTCT CAGATTTCCC TTACTTTCTT TAGCTTTGGC 300 

GAGTCCTGAC TTGATATCAG CTAGGGCAAC CTTTAATTTA TTAGCATCAT AGCCTGATTT 360 

GTCCTTGTTT TCACCATTGA TATCTGACAA ACCTTTTAGC TCTTCTTGAG CCAAATCTTT 420 

ATTAGCTTGT GGCACCTTGG CTCCATTAGC CTCTAGCGAA TAGTAAATCC CTGCTAAAGC 480 

ACTTTCTCCT GTAACTCGAA TACCCSCTCC TACAGTGATT TTGCCATGTT CCATACCCAC 540 

CGTTACTGCT GCGTTTCGCT ACATATCCTG AGTCACCTTA GTGATATTTT CTGCTGTTTC 600 

AATCTTGACC TCAAGTGGCC ATTTCTCACC T AGC TTTTG A ATCTTGGCTG ATGAATACAA 660 

CTCTAAGCTA GAGTCATTGC CCACATTCAT GATTTTAGAA TAAACATCAG CTGTCATGGT 720 

CTTGAGTTCT TTCCTATCTC TTGAGGCATT GTAGCCCAGT TTTTTAAGAG TTTGATTTTT 780 
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TTGCTCTTCA 


GATAGGGAGG 


AACCTAGGAC 


ATATTCAGCT 


TGGACATAGG 


TTTCATCGAT 


840 


AACTTTTTGA 


ACATCTGTTG 


CTGCATGGAC 


CCTATTCATA 


GCTGTTACTG 


CCCACAAGAT 


900 


CGCAGCCCTA 


GTCAGAAAGA 


GTTTCTTTCT 


CATAGGGAAT 


TTCCTCCTTT 


ACTTCTTTAG 


960 


AGTAATATAT 


CTATCTTAAA 


GAAAACTTAT 


AACAAAAACA 


CCTGGTCTAG 


CCAGATGTTG 


1020 


AAAAGAGAGT 


GAAACATTTG 


ATGATGTAAA 


GGTTAAGTCG 


TACCTGTCTA 


GAATAATAAT 


1080 


AGTTTCCTCC 


ATTTACATAG 


AGTTCAGCAC 


CGTGAAAAAT 


GGAAATGGGG 


TGAATATAAC 


1140 


TATAAGTCTT 


TCCAGTCCTA 


TTACCAAGCA 


AGGGGGCAAC 


AGTCTCACGA 


CACTACTCTT 


1200 


TGGCTAGAGC 


CAGGGTATTT 


TCCTTGCCAT 


TTTGGGCGAT 


AAAATCGATA 


TAGGCAGGTC 


1260 


CAAAATTATA 


GGCTTGAACA 


GCTGTCCAGA 


TATCTACCCC 


CTTCTTCTGC 


GCCAGATAGA 


1320 


GATTGCCTGT 


CAGAGTTTGA 


ATGCCTTGCC 


GAATGCTAGA 


GGCATTATCA 


TTGATGGTGT 


1380 


TGGTGCAACC 


ACTTGCAGAC 


TCACTAGACT 


GCATAACATC 


GCCTTCTTTT 


CCTTTTGTTT 


1440 


CAGTATAAAT 


CATAGCAAGC 


ACAAGCTCTT 


CGTTTGCTGG 


GGTGTCTTGT 


TCACTCAATA 


1500 


TTTCTCGCAC 


CATGGGTTGA 


TAGGTCATGA 


CTTCTTTGAC 


ATCTTGATGA 


ACGCGGTAAG 


1560 


CTTTATAGCC 


AGCAAAAAGG 


AAGACTGCTA 


GTACAAGCAC 


TCTTCGAATT 


CGTTTAAACA 


1620 


TTATTTACTT 


TGGATATCCT 


CGATATTTTT 


GATTAAGATA 


GAGTAGGTTC 


CATTTTCGTT 


1680 


TTGCATAAAC 


TCAACAGACT 


CGGCGTCTTG 


ATAGACGTTA 


TTGGGAACGA 


TGAGCTCAAT 


1740 


TCCATTTGAT 


AAGGAGAGTT 


TTTGGTTTTC 


AAATTTCTTT 


AATTGGCGAC 


TGGCATCAAT 


1800 


TTCATCAAAT 


TGAACAGGTT 


CTCGTACGGC 


TTCTTTGACT 


TGGTCAATAA 


AGCTCAAACG 


1860 


AGCCGTCAGA 


TTGTTGTCAA 


AAACGTCATT 


AGCCAATTTC 


TCAGGTGACA 


ATTCATTGCT 


1920 


TTCTTCTAGG 


TTGrrCAAAA 


TAGCTGATTT 


GACCTTGGAT 


TGAAATTGAA 


AATCATCTGT 


1960 


GTTAAAAGAT 


TTAGCAATTC 


TCTGGGCTGT 


TTTTTCCAGT 


TCCTTGATAG 


ATTTTTTAGC 


2040 


AGAAATCTTA 


GGACCGACAG 


CAACAAGATT 


ATCTGAAAAA 


TAGTTCAAAA 


AAGTCCCGTT 


2100 


GTACTTGATT 


CGTTTTTCAA 


TCAGGTGATA 


CTTCCTACTC 


TGAAGATTGA 


CCACCAAGGC 


2160 


CTCATCAGCT 


CCTGTTCCAA 


ATCCAGGCAG 


GTTATTCTGA 


GTTAGCTTGA 


TTGGATTATC 


2220 


AACTTCTCCT 


CCGAGGTGGG 


TCAAGGTCTC 


CCGCAGGGCA 


ATTCGCAAGA 


AAGCGAAATG 


2280 


TTCTACACCT 


TCTTTAGAAA 


ATTGCACAAA 


AATCAAGTCA 


TTGGTCTTGA 


GATTTTCAGA 


2340 


AATGCTAAAC 


TCCTCTTTCC 


AGACATTAGC 


CAGCGTTACT 


GATGTCTCCA 


ACAAATCCTC 


2400 


TGTAATATGA 


TTGAAGAAGG 


GATTTTCTTC 


TTCGAAAATC 


CCAGTCTTGC 


CTTCATCTGA 


2460 


ATACACATGT 


TCAATTTTTT 


TACGCAGGTA 


TTCTTCGATT 


TTTGGAGTAA 


TATTGAGAAA 


2S20 
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CTTATCTGCT 


AAGAACACTT 


CGGTATCATC 


CGGACTGAAC 


TGGTGAATAA 


TGGCTTTCTT 


2580 


AATATAAATG 


TCCATAAAAG 


TTTTAGTCCT 


CGTATAATGC 


GAAGGCATCT 


GTCAATTCTT 


2640 


TGACTCCACT 


TCTCACTTCT 


TCTAATACAG 


CCTCATTTTC 


TGAATTCTTA 


AGGGTTTTAA 


2700 


TGATGAGTTC 


AGCCACTTTG 


CGACTTTCTT 


CTTCACCAAA 


TCCACGTGCA 


GTAATGGCTG 


2760 


CTGCTCCGAT 


ACGAATCCCA 


CTTGTCTTGA 


ATGGTCACAA 


GCTTTCGTAA 


GGGATTGAGT 


2820 


TTTTATTTAA 


GGTAATATTG 


ACTTCATCCA 


ACAAGTTTTG 


AGCAACTTTG 


CCGTTTTCTA 


2880 


CAAC 'ITT ACT 


CACATCAACA 


AGGAAGAGAT 


GGTTTTCAGT 


TCCACCTGAA 


ATAATACGGA 


2940 


AATCAGCCTC 


TTGCAAGAAG 


ACATCTGCCA 


"TAGCCTTGCT 


GTTCTTAATT 


ACATTGGCAG 


3000 


CAT ATTCC TT 


GAAGGCTGGA 


TCCAAAACTT 


CTTTGAAGGA 


AACTGCCTTA 


GCCGCCACAA 


3060 


CATGCTCTAA 


AGGACCGCCC 


TCAATACCTC 


GGAAAATACC 


TGAATTGATT 


TTTTTAGCAA 


3120 


GTTCTTCGTC 


ATTGCTCAAA 


ATCAAACCAC 


CACGAGGTCC 


ACGAAGGGTT 


TTGTGGGTCG 


3180 


TTGTTGTTGT 


GATATGAGCG 


TATGGAACTG 


GGCTTGGATG 


AAGGCCAGCC 


GCAACCAAGC 


3240 


CACCGATATG 


CCCCATGTCC 


ACCATGAGCT 


TCCCACCGAC 


AGCATCTCCG 


ATTTCACGGA 


3300 


ATTTTGAAAA 


ATCGATAATT 


TGAGAATAGG 


CTGAAGCACC 


AGCTACAATC 


AGTTTTGGTT 


3360 


TTACTTCTTG 


GCCTTGTTTC 


AAGATAGCAT 


CAAAGTCTAA 


GAGTTCCGTT 


TTAGGATCAA 


3420 


CACTATAAGA 


AACAAAGTTG 


TAGGTTTGAC 


CAGAGAAGCT 


AACAGG AGCC 


CCATGAGTCA 


3460 


AATCACCACC 


TGATGCCAAA 


TCCATTCCCA 


TAACCGTATC 


ACCTGGCTCA 


ATCAAGGACA 


3540 


TGTAAGCCGC 


ACAGTTAGCT 


TGGCTTCCTG 


AATGTGGTTG 


AACATTGGCA 


AATTTAGCAC 


3600 


CGAAAATTTC 


TTTTGCGCGT 


TCAATACCAA 


GAGTCTCTAC 


AACCTCTACT 


ACATCAGTTC 


3660 


CACCATAATA 


ACGGCGTCCT 


GGGTAACCCT 


CGGCATATTT 


ATTTGTCAAG 


ATAGACCCTT 


3720 


CAGCTGCCAT 


AACAGCCTTG 


CAAACTACGT 


TTTCCGAAGC 


AATTAACTCG 


ATATTATTTT 


3780 


GTTGGCGTTC 


TTCTTCTTTO 


GCAATAGCAT 


TCCAGAGATC 


AGCATCATAT 


GCTTTAAAAT 


3840 


CATCTTTGTC 


AAAAATCATA 


GGTCTTCTCC 


TTTATTGTGT 


GACTAGTCCA 


TTAGTTTGAT 


3900 


TTTACAATAA 


GAAAATCAAA 


CTAACAGATG 


CGAATAAACC 


GTTTCTGCAT 


TTTATCACAA 


3960 


GTATAGCCAA 


CTTTTTCATA 


AAATGCATGA 


GCACCCAGAC 


GATGATTGGC 


AGAATTTAAG 


4020 


CGGATAAACC 


CATAACCACA 


TCTTTTTCCT 


TCTTCTTCCA 


ACCCTTGTAG 


TAAACTTTTA 


4080 


CCAATACCTT 


GACCTTGCGC 


TTGAGGTGAA 


ACTGCTAAAG 


CTAAGATATT 


AAATCCTGCT 


4140 


TTGGAATAGA 


GTGATTCGTA 


AACTTCAGCG 


TGGACATATC 


CAAGTAAGAC 


ATGATTAGCT 


4200 


CCATCCTCAT 


AGCCAAGTAG 


GAAATGATGG 


GAATCCTGAG 


ACAGTCTAGC 


TAGTTGGCTA 


4260 


GCCGTTTCCT 


CTGGACTAAA 


AGTATAACCC 


AAACCCTCTT 


GGTTGATGTC 


ACATATAGCT 


4320 



WO 98/18931 



PCT/US97/19588 



711 

TTCACATCAG TTTCTCTTAA ATCTCTTACC ATCTCATTCC TCCTCAAAAG AAATCTTTGC 4380 

CAACCGAGCA AGAATATCTT CTCGCTTAAT GGCCCCTTGA CGTAAGATTT TCACCTTGTC 4440 

TCCCGACAAA TTCAAAATAG TTGAATCCTG TCCAGTTAGA AAAGCATCGT CTTCCAGACC 4500 

CAGAACCTCT TGGTCAAAAT CCTCTAGAAT TTGATTAAAG GTCACTCCAC TCCCCTGACC 4560 

TGAGATATTG GCACACGGCC CAATCAAGGG ACCTGTCTCT CGAATCAAAT CAAGGGTAAT 4620 

GGGATGACTA GGCATCCGAA ATCCAACAGT TGCAAGGCCA GAATTGACCC AATAGGGAAC 4680 

TCGGTCATTA GCTTCGAGAA TAATGGTCAA CGGACCTGGT AAAAAGATCT CTACAAGTTT 4740 

TTGAAGATAA GTTGCCTGAT TCTTTGAAAA GTACAAGATG TCCTCTAAAG AGGCAACATT 4800 

GAGATTGAGC GCCTTGTCTC TACGTCGACG TTTAAGCTGG TAAACATGGT CAACTGCTTT 4860 

TTCGTCTAGC GCCTTAGCAA AGAGACCGTA AACTGTCTCT CTAGGCAAAA CCACAGCTCC 4920 

ACCATTTTCC AACTCTTGTC TAATCCTGTC CAT CATC AAC GACAACCATC CTATCTTGAC 4980 

CAAATTGGTC CTTGAGTCTT CGTACTCGCT TTTCAGGAAG ATGTTTCCTA AAAACTTCAG 5040 

GAACACTTTG ACCTTGCTTG TATCCAATTT CAAGGTAAAT CTTACCACCA TCTTTGAGAT 5100 

AGTCTTTTGC ATCTTCCGCA ATTCTACGGT AAATAGCTAG GCCATCCTCA TCTGCAAAGA 5160 

GAGCTAGATG AGGCTCCGAA TACAAGACAT TCAAGCCTAC CTCTGACTCA TCTTCACGAG 5220 

AGATATACGG TGGATTGCAA ACAATTATAT CAT ATTTTTC AGAAATTTCT GTAAAACAGT 5280 

CAGATTTTTT TAAAAATATT TGAAGATTTT GATTTTTAGC ATTTTCCCTA GCTACATCTA 5340 

AAGCATCTTG GGAAATATCT GCTGCCCTCA CTGACCAATC TGGTCTGTTT TTTGCTAGAG 5400 

CGAGAGCAAT AGCTCCACTA CCTCTTCCGA TATCTAGGAC CATAAGATTT TTCACAGGAT 5460 

TTTCAGCCAC GATAAGCTCC ACCAACTCCT CTGTTTCTGG ACGAGGAATC AAAACCCGTT 5520 

CATCCACCTT TAAATGCATT CCATAAAAAT CTCCCTCTCC AATGATGTAC TGAGCTGGCT 5580 

TGTGAG CTGC TAGTTGCTGG TAAATATCTT CTACAAATTG TTTTTCTTCC TCTGTTCTCA 5640 

CCTCCTGCTG GAGGGCAAAA ATAAAGTCTG TAAAAGATAG ATTTTTCAGA CTACGATAGA 5700 

CAAAAGAGAG GCTTTCCGCT TCCTCTCCTT CTCTTATCAA CTCTTCTTCA AAATTTGAAA 5760 

ATAATTGAGC TAATTTCATT ATTTGTTTAA TTCTTCTAGT TTTTGTGTTT GCTCATAAAG 5820 

CACCAAGGCA TCCACAACTT CGTCCAATTT A CC AG AC AAA ATCGTATCTA GTTTTTGGAG 5880 

GGTCAACCCG ATACGGTGGT CTGTGACACG GTTTTGTGGG AACTTATAAG TTCGGATCCG 5940 

TTCTGAACGG TCACCAGTAC CGATTGTCGA CTTACGCTCA GCGTCCTGCT CATCTTGAGC 6000 
AATCTGAGCA AAGTGGTCAG CAACACGGGC ACGGATGATT TTCATGGCCT TCTCACGGTT 



6060 
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CTTCTGCTGG 


GTACGTTCTT 


CCTGCATCTC 


AACCTTGATA 


TTGGTTGGCA 


AGTGAACGAT 


6120 


ACGAACGGCA 


GTCGCAACCT 


TATTGACGTT 


CTGTCCACCA 


GCACCAGAGG 


CGTGATAGAT 


6180 


GTCGACACGA 


AGGTCTTTTG 


GATCAATGTC 


CTATTCAACC 


TCTTCAACTT 


CTGGCATAAC 


6240 


AAGAACTGTC 


GCTGTCGAAG 


TATGAACACG 


GCCTTGGCTT 


TCTGTCACAG 


GAACACGTTG 


6300 


CACACGGTGG 


GCACCTGATT 


CATACTTAAG 


CTTAGAGTAT 


ACAGACTGAC 


CTCAAACCAT 


6360 


AGCAACCACT 


TCTTTAAAAC 


CACCGACACC 


ATTCATAGAG 


GCTTCCATGA 


CTTCAAAGCG 


6420 


CCAACCTTCC 


GCTTCCGCAT 


ACTTTTGGTA 


CATACTTACC 


AAATCTCCAC 


CGAAAAGTGC 


6480 


CGCTTCGTCT 


CCACCAGCTG 


CTCCACGGAT 


TTCAAGGATG 


ATATTCTTGT 


CATCGTTTGG 


6540 


ATCCTTTCGA 


AGGAGCAAAA 


TTTTCAGTTT 


TTCTTCATAT 


TCTTCTTTTT 


CAGCCTTGCC 


6600 


ATCTTTGACT 


TCTTGCTTGG 


CCAATTCTTC 


CAAGTCCGCA 


TCTCCGCCTC 


ATTCCTTAAT 


6660 


CATCTCTTCG 


GCATCGACGA 


TATTTTGAAG 


GACTTGTTTA 


TACTCACGGT 


AGGCTATTAC 


6720 


GGTGTCACGA 


TTGGAAGCTT 


CTTCTTTTGA 


AACCTCCATA 


AAACGCTTGG 


TGTCTCAAAC 


6780 


GACATCAGGG 


TCACTCAGCA 


AT TCTCCT AA 


TTCTTCATAA 


CGGTCTTCTA 


CAACTTGTAG 


6840 


TTGATCATAG 


ATGTTCATTT 


TTTCTCCTTA 


TTTCTCAATT 


CTTAAATCAT 


AGATTGCTAC 


6900 


TACTTCATTC 


TCGGATATTT 


CCCCAGTTTC 


TTTAAATCCA 


TAACTGAGCT 


AACAAAATCT 


6960 


TGCCTGTTCA 


TTTTCTGGTT 


CATArGACAA 


CCAAAGTTTA 


TTGCTTAAAC 


CTGCTGGCGC 


7020 


TGTTCGAACA 


TAGTCTAGTA 


CTTTATCCAT 


AATTGGTTTA 


AAATATCCTT 


GATTTTGAAA 


7080 


ATTCTTATCA 


ATCATAAAAC 


GAAATACTAA 


ATAATTTCCA 


CTACTAATTC 


CGATCTTTTT 


7140 


ATCATAAGCT 


ATCATCACAA 


AACCTATAAT 


TGCATCATTA 


TCATAAACTG 


CCAATGGAGC 


7200 


TACAAAATCT 


CCATTTTTAG 


TGTAGACGTA 


TGCTTCAGCT 


AAACTAATTG 


CGTTGGTTGC 


7260 


AATCAATTGT 


TTTTGATATT 


CCTTGACATC 


CAAATTTAAA 


ACATCAAAAT 


AATTTTCCAT 


7320 


TGTAACATCT 


CTTAGTTCAA 


TTGTCATAGT 


TTTGCTCCTT 


GTTAGAGGTT 


ATCATTGGCG 


7380 


CAAAATAATG 


TTTACGGCAA 


ACTCAGATAT 


AGGTTTCGTT 


ACCACCAATC 


TGGATCTGVT 


7440 


CTCCATCGTA 


AACGGGCAGT 


CCATCCTGTG 


TTCGCAACAC 


CATGCTCGCC 




7500 


AATACTGACA 


GATGGTCTTG 


ATTTCGTCAA 


TCTTGTCTGC 


TAAAAGCAAG 


AGATATTTGG 


7560 


AACCTTCGAA 


CAATTCATTG 


CGAAAGTCAT 


TTTTCAAGCC 


AAAAGCCATG 


ACGGGTATGT 


7620 


CTAACTCGTC 


CACAACACGA 


GCTACCTCGT 


AAACATGGTG 


GCGTTTGAGA 


AACTGGGCTT 


7680 


CATCGACCAA 


AACACAGTAA 


GGTTTTTCTG 


GTAGGTCTCG 


GATATAGCCA 


AAGATATCCG 


7740 


TTGTTTCCTC 


AATCGCAAgG 


GCAGGGCGTT 


TCATGCCAAT 


TCGACTCGAC 


ACATAGCCAA 


7800 


CGCCGTCACC 


CGTATCCAGA 


GCCGAGGTCA 


TAATCACAAC 


ACCTTTTCCT 


TGCTCCTCGT 


7860 
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ACTTATAGGC 


CACTTTGAGA 


ATCTCAATCG 


TTTTACCAGA 


GTTCATGGTC 


CCATAACCAT 


7920 


AGTACAACTG 


TGCCATGTTT 


CTTGCTTCAC 


GTCCATTTCT 


AAATTTTTGC 


TACATTCTAG 


7980 


TATATCATAA 


TTTTCTTAAG 


CTTTAAACGG 


CAAAATGTGG 


TAAAATAGAA 


GAAATCAAAA 


8040 


ACTACTGGAG 


GAAGCTATTA TGCCATTTGT 


ACGCATCGAT 


TTATTTGAAG 


GACGCACGCT 


8100 


CGAGCAAAAG 


AAAGCTCTTG 


CTAAGGAAGT 


AACGGAAGCA 


GTTGTCCGCA 


ACACTGGAGC 


8160 


CCCTCAATCT 


GCTGTCCATG 


TCATCATCAA 


CGACATGCCA 


CAAGGAACTT 


ACTTCCCACA 


8220 


AGGGGAAATG 


CGTACTAAAT 


AAGCTAGCTT 


AAGCAGAATT 


GCTTAGGCTT 


TTTCAATCTC 


8280 


CAAGTAGCAT 


TCATTGAAGA 


AATATCCTAA 


ATTTGTTACA 


ATTTGAAAAG 


AAACTTGCAG 


8340 


AATTTCCAAG 


AAAAGAGCTA 


TTAATTAAAG 


GAAACATTAT 


GATTACACGT 


GAATTTCATA 


8400 


CCATCGCTGC 


TATCTCTACT 


CCACTAGGTG 


AAGGGCCTAT 


TGGTATTGTC 


CGCCTGAGCG 


8460 


gAACAGACAG 


TTTTGCTATT 


GCGCAAAAGA 


TTTTTAAAGG 


AAAAGACTTC 


AACAAGGTTG 


8520 


CCAGCCACAC 


TCTCAACTAC 


GGTCACATTA 


TTGATCCTCT 


GACTGGTAAA 


GTCATGGACG 


8580 


AGGTTATGGT 


TGGGGCTATG 


AACTCTCCAA 


AGACCTTCAC 


TCGTGAGGAT 


ATTATCGAGA 


8640 


TTAACACCCA 


CGGTGGGATT 


GCGGTGACCA 


ATGAAATTCT 


CCAGCTAGCT 


ATTCGTGAAG 


8700 


GGGCTCGGTT 


GGCAGAACCT 


CGTGAATTTA 


CCAAACGTGC 


TTTTTTAAAC 


GGTCCCCTAG 


8760 


ACTTGACACA 


GGCAGAGGCT 


GTGATGGATA 


TCATCCGTGC 


CAAGACTGAC 


AAGGCCATGA 


8820 


ACATTCCGGT 


CAAACAATTA CACGGCTCCC 


TTTCTGACCT 


CATTAACAAT 


ACCCGTCAAG 


8880 


AAATCCTCAA 


TACACTTGCC 


CAAGTTGAGG 


TCAATATCGA 


CTATCCTGAG 


TATGACGATG 


8940 


TTGAGGAAGC 


CACTACTGCT 


GTTGTCCGAG 


AGAAGACAAT 


GGAGTTTGAC 


CAATTACTAA 


9000 


CCAAACTCCT 


TAGGACAGCA 




AAATCCTTCG 


TGAAGGAATT 


TCAACGGCTA 


9060 


TCATTCGACG 


TCCCAACGTT 


GGGAAATCAA 


GCCTTCTCAA 


CAACCTCTTG 


CGTGAGGACA 


9120 


AGGCTATCGT 


AACAGATATC 


GCTGGGACAA 


CACGAGATGT 


CATCGAAGAC 


TACGTCAACA 


9180 


TCAATGCTGT 


ACCTCTCAAA 


TTGATTGATA 


CAGCCGGTAT 


TCGTGAAACG 


GATGATATCG 


9240 


TTGAACAAAT 


TGGACTTGAG 


CGTTCGAAAA 


AAGCTCTTAA 


GGAAGCTGAC 


CTAGTTCTGC 


9300 


TAGTACTAAA 


CGCTAGTGAA 


CCACTAACCG 


CCCAAGATCG 


CCAACTCCTA 


CAAATCAGTC 


9360 


AGGAGACTAA 


TCGCATTATT 


CTTCTTAACA 


AAACTGACCT 


GCCTGAAACG 


ATTGAAACTT 


9420 


CGGAACTACC 


TGAAGATGTC 


ATCCGCATTT 


CAGTTCTTAA 


AAATCAAAAC 


ATCGATAAAA 


9480 


TCGAAGAGAG 


AATCAACAAC 


CTCTTCTTTG 


AAAATGCTGG 


TTTGGTTGAG 


CAAGATGCTA 


9540 


CCTACTTGTC 


AAACGCCCGT 


CACATTTCCT 


TGATTGAGAA 


GGCCGTTGAA 


AGCCTACAAG 


9600 
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CTCTTAACCA AGGTCTTGAA CTAGGGATGC CAGTTGACTT CCTTCAAGTT GACTTGACCC 9660 

CTACTTCGGA AATTCTAGGA GAAATCACTG GAGATGCTCC TCCAGATGAA CTCATCACCC 9720 

AACTCTTTAC CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 9780 

TGCATTTTAC GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT TATGGAGCCT 9840 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9960 

TAATATCCAG ATTTTAGACA TCATCAATAA GCATACACAC AAGGAAATCA TCCCCAAACT 10020 

GGACTACGAC GCCCCATCTT CCCCTGAGTG CCGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTGGTAT CCCCACTAGA ATTCTCCTTA 10140 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA GATTGCTCAA AAGTTAATTG 10260 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 10320 

GTAAGCTCAA TGACTTTCAC TTTAAACATC ATTTTTCTTG TCTTCCTGAC ATTATGTCTT 10380 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



GTTTTCGCTT 


CTGGAAATTA 


TCAGATGGTT 


GGAAAAGCCG 


TCCACATCAA 


CATAGTGTTC 


60 


GGAGATTTAA 


GTTTAAATTG 


AAGAAACTAA 


CACAGAGGAA 


ATGGAGTATA 


GACCTAACAA 


120 


GACGTATTGA 


GCAACTGAAT 


TTGTCTATTC 


GACGATGGAT 


AAACTATTGC 


TCATTGGGAA 


180 


ATATGAAAAG 


TATAGTCCCC 


AGCATAGATG 


ACCGCTTGCG 


TACTCGCCTA 


CGAGTGATTA 


240 


TCTGGAAGCA 


ATGGAAGAAG 


AAATCGAGAC 


GATTATGGGG 


ATTGCTTAAG 


TTAGGAGTTC 


300 


CTAAATGGAT 


AGCAGATAAG 


GTATCTGGCT 


GGGGCGACCA 


TTATCAATTA 


GTAGCTCAGA 


360 


AGTCGGTACT 


TAAACCTGCT 


ATATCAAAAC 


CACTCCTGGA 


AAAACGTGGA 


CTGGTTTCGT 


420 


CTTTCCATTA 


TTACCTTGAA 


CGACATGCCT 


TAAAAGTTAG 


TTGAACCGCC 


GTATGCCAAA 


480 


CGGCACGTAC 


GGTGGTGTGA 


GAGGGGCTAG 


AGATTATCCC 


CTACTCGATT 


AACTCCCCTG 


540 


AAATTTATTT 


TAATTATGCA 


AATTTCACGT 


ATTTTTGATG 


CTGAGACGAC 


GATCCTGGGA 


600 
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ACTTTTCAGA 


TATTTTTTTG 


ACTATCTAAA 


TCTATCATTA 
TTTTTCCAGT 


GAAAAGCTTA 


CAGCGCCAAA 
ATTGAAGATG 


660 
720 


GGATTTGAGC 


GTTTTTCTCA 


TTTTTAAGAC 


ctctttttcc; 


TAATTATTCT 


ACTAACTAAC 


TAACTTCTTA 


GTACTAGCCA 


ACAACGATAA 
TTCATGTAAA 


TCATAATTCC 
AAACCTCACT 


780 
840 


TCCTAAAATT 


AGGAATAATA 


AAGGCAATAG 


ri'rlTGTTTT 


TTTGTTTTCT 


GCTATTTTAT 


GCTAAAATAT 


TAAAAATCAA 


ATTTAATTCC 


AAAGTTTGTA 


900 


ACTAAAGGGG 


GAGCGCTACA 


TGTCTAATTC 


ATTTGTCAAG 


TTGTTAGTCT 


CTCAATTATT 


960 


TGCAAATTTA 


GCAGATATTT 


TCTTTAGAGT 


AACAATCATT 


GCTAACATAT 


ACATTATTTC 


1020 


AAAATCACTA 


ATTGCCACAT 


CACTAGTTCC 


TATCTTAATA 


GGAATATCCT 


CTTTTGTTGC 


1080 


GAGTCTTTTA 


GTTCCGTTGG 


TTACTAAAAG 


GTTAGCGCTA 


AATAGGGTTT 


TATCTTTATC 


1140 


TCAAlTlwA 




TATTGGCGAT 


ACTGGTAGGA 


ATGTTTACCG 


TAATGCAATC 


1200 


CGTAGCGCCT 


TTGCTGACCT 


ATCTATTTGT 


TGTTGCAATT 


TCCATACTAG 


ATGGTTTTGC 


1260 


AGCACCCGTT 


TCCTATGCTA 


TTGTGCCACG 


CTATGCGACC 


CATTTGGGTA 


AGGCTAATTC 


1320 


AGCCTTATCA 


ATGACTCGTG 


AAGCTGTTCA 


ATTGATAGCT 


TGCGGATTAG 


GTGGACTCTT 


1380 


GTTTGCAACA 


ATTGGTCTGT 


TACCTACCAC 


CTGTATCAAT 


TTAGTCTTGT 


AT ATC ATTTC 


1440 


TAGCTTTCTG 


ATGTTATTTC 


TTCCTAACGC 


TGAACTGGAG 


CTGTTAGAGT 


CAGAAACTAA 


1500 


TCTTGAAATT 


TTGCTCAAAC 


CTTGGAAGTT 


AGTTGCTAGA 
AAATACGATT 


AATCCTAGAT 


TAAGACTTTT 
CCATTATACT 


1560 
1620 


TGTATCAGCA 


AATTTATTGG 


AAATTTTTTC 


TGGGTTlvrr 


TGTTTTTGTA 


ACCGAGTTAT 


TAAATAAAAC 


GGAAAGTTAC 


TGGGGATATT 


CTAATACAGC 


1680 


ATACTCTATT 


GGTATTATAA 


TTAGTGGCTT 


AATTGCTTTT 


AGGCTATCTG 


AAAAGTTCCT 


1740 


TGCTGCTAAA 


TGGGAACCCC 


AATTATTCAC 


CCCAAATCTA 


AAAACCATCC 


AGAATCCTTG 


1800 


CCTTAGCTTA 


GATCCTGGAT 


GGTTTCTTTT 


TTCACCCAAT 


GGCTGTTTTT 


TACTAGACAA 


1860 


AAAAGAGTTT 


CCCCTTTATG 


GTATAAGTGT 


AGAAAAAAAC 


ACAAAAAGAA 


AGCAAACTCA 


1920 


CATGAACAGT 


TTACCAAATC 


ATCACTTCCA 


AAACAAGTCT 


TTTTACCAAC 


TATCTTTCGA 


1980 


TCGAGGTCAT 


TTAACCCAGT 


ATGGTGGTCT 


TATCTTTTTT 


CAGGAACTTT 


TTTCCCAGTT 


2040 


GAAACTAAAA 


GACCGGATTT 


CTAAGTATTT 


AGTAACGAAT 


GACCAACGCC 


GCTACTGTCC 


2100 


TTATTCGGAT 


TCAGATATCC 


TTGTCCAGTT 


CCTCTTTCAA 


CTGTTAACAG 


GTTATGGAAC 


2160 


GGACTATGCT 


TGTAAAGAAT 


TGTCAGCTGA 


TGCCTACTTT 


CCAAAATTGT 


TGGAAGGAGG 


2220 


GCAGCTTGCT 


TCACAGCCAA 


CCTTATCCCG 


TTTTCTTTCC 


AGAACTGACG 


AGGAAACAGT 


2280 


CCATAGTTTG 


CGATGCCTCA 


ACCTTGAATT 


GGTCGAATTC 


TTTTTACAGT 


TTCACCAGCT 


2340 
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AAACQAACTC ATTGTACATA ACGATTCTAC CCATTTCACA ACTTATCGCA AGC 2393 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

<A» LENGTH: 4762 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



fxi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


91: 






TTTGTATCTT 


TTTAGGTCTC 


TTTCAATCCA 


AACCCTTTAA 


ACTATACGTC 


ATTTCGGTTC 


60 


CTCCAAGTCT 


TGTGGTAATT 


TTAGGTTTGA 


TTTTACTTTT 


CTTTTCACAA 


GAGCCTCTGC 


120 


ACGCTTCTTA 


TTTGATGGTC 


GTCTTCCCTG 


TTTTCCTACT 


TTTATTGGTA 


ACCAATATTA 


180 


AGAGTCAACA 


GAGGGGCCGT 


AGTGCTAGAA 


GAAGCCGAAG 


AGAAACGCCA 


TTATGCCTAT 


240 


GGAGTCGTTT 


CTTCAAAGGA 


AATCTATATC 


TGCTAGTTTT 


TGCGTTTCTC 


TATCTTTTCT 


300 


CTGTTCCTTT 


TTTGATGAAG 


TTTCTCCTTT 


ATCCACTACC 


TTATCAAGAA 


CCTAATCGTC 


360 


TTGCTGATTT 


GGTAAAAGAG 


GAGACAAATA 


CGGAAGATGC 


TATCTCATGC 


ATGGGATGAT 


420 


ACTCCGACTC 


TTTATCGTAA 


GAGTGAGCGC 


TTGTCCCATC 


GGCCATTTTG 


TCCCCGTTGC 


480 


ACTATACAGC 


AACTGAGGAA 


AATCGTAATA 


AGTTACTTAA 


TGACTTCAAA 


GAAAAACAAC 


S40 


CTAAGGTGAT 


TGTGGTAAAT 


GATAAGGTGG 


TAGTCTGGTC 


TGAACTGGAA 


ACACTCTTAA 


600 


AAGAAAATTA 


CCAACAAGTA 


AACACTGATT 


ACTCAGAGTT 


TAAAGTCTAT 


AAAATTAAAT 


660 


AACCAAATCA 


ATATCTTGTG 


TATTTTTAAA 


AATTTTAGGA 


TTTTTAACAC 


AAGATATTGA 


720 


TTTTTCTTTT 


TAGAGTGCTA 


TAATACTTTT 


TAGAAAGAAC 


ATTTTAGAAA 


AGAGCATGCA 


780 


TATGATTGCA 


CTAGAAGAAA 


AAATTACAAT 


TTTGCCAACT 




AGAAACGAGA 


840 


TGGGAGACGT 


GTTGTATTTC 


ATGTGGACAA 


GATTGACAAG 


CCTCTCCACA 


AGGCGGCTGA 


900 


CAAGGTTATG 


GATGTGACAC 


CCCTGGTTGA 


AAAATGCCTC 


AATGATCTGA 


CTGACCGAAT 


960 


TATTACAGAA 


ATTCATAGTC 


GCTTTCCACA 


GGGAATTAAG 


ATTTACGAAA 


TTCAAAATAT 


1020 


CGTAGAACAT 


CAACTCCTTG 


AAGCCAAAGA 


ATATGCGCTG 


GCTCAGGACT 


ATATTACTTA 


1080 


TCGGACACAG 


AGCCATTTTG 


AGCGCTCAAA 


AGCGACGGAT 


ATCAACTTTA 


GTATTCATAA 


1140 


ACTTCTCAAC 


AAAGACCAGA 


CAGTTGTCAA 


TGAAAACGCT 


AATAAAGACA 


GTGATGTCTT 


1200 


TAACACTCAG 


CGTGATTTGA 


CAGCAGGGAT 


TGTTGGGAAA 


TCAATCGGAC 


TGCAAATGCT 


1260 


TCCTAAGCAC 


GTAGCCAATG 


CCCACCAAAA 


GGGGGATATC 


CACTATCACG 


ATTTGGACTA 


1320 


CAGTCCCTAT 


ACCCCTATGA 


CCAACTGCTG 


TTTGATTGAT 


TTTAAGGGTA 


TGTTGGAAAA 


1380 



{ 

i 
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TGGTTTTAAC ATTCGAAATG CAGAGGTAGA GAGTCCCAAG TCT ATCCAGA CTGCGACACC 1440 

ACAGATTTCT CAAATCATTG CCAACGTTGC TTCTAGCCAG TACGGTGGCT GTTCAGCTGA 1500 

CCGTATCGAT GAAATTTTGG CGCCTTATCC ACAGAAGAAT TATCAAAAAC ATCTCAAAGA 1560 

TGCAGAAGAG TGGGTATTGC CTGAAAAACA GGAAGATTAC GCTTGGAAGA AAGCGCAAAA 1620 

GGACATCTAC GATGCCATGC AATCTCTTGA GTATGAAATC AATACTCTCT TCACTTCAAA 1660 

TGGACAAACA CCTTTTACTT CGTTAG G TTT TGGTCTGGGA ACCAGTCGTT TTGAACGAGA 1740 

AATTCAAAAA CCTATTTTAA ACATTCGCAT CAAGGGTCTT GGTTCAGAAC ACCGTACGCC 1800 

TATCTTTCCT AAACTTATCT TTACGCTTAA AAGAGGCCTC AACTTAGACG AAGGAACTCC I860 

CAACTATGAC ATCAAGCAGT TGGCTCTAGA CTGTGCAACC AAGCGGATGT ATCCAGACGT 1920 

CTTGTCTTAT GATAAGATTG TTGATTTGAC AGGTTCTTTC AAGGTGCCTA TGGGCTGCCG 1980 

TTCTTTCCTT CAAGGGTGGA AGGATGAAAA TGCTGTAGAA GTCAATTCA6 GTCGCATGAA 2040 

TCTGGGTGTT GTGACGGTTA ATCTGCCTCG TATTGCTCTT GAGTCTGAAG GTGATATGAA 2100 

TAAGTTCTGG CAAATCTTCA ACGAGCCAAT CAATATCGCA GAAGATGCTC TTGTTTACCG 2160 

TGTCGAACGC ACTAAAGAGG CGACACCAGC GAATGCTCCT ATTCTTTATC AGTACGGTGC 2220 

TTTTGGCCAT CCTCTAGGTA AAGAAGAAAC TGTTGACCAG CTCTTTAAGA ATCCTCCTGC 2280 

GACCGTTTCG CTGGGCTATA TCGGCTTGTA TCAACTACCG ACAGTTTTCT TTGGTAACAG 2340 

CTGGGAAAGT AATCCAGATG CTAAGGAATT CACGCTAGAC ATCATTCACG ATATGAAACG 2400 

CCGTGTAGAA GAGTGGTCAG ACCAATATGG CTACCATTTC TCTATCTACT CAACACCATC 2460 

CGAAAGTCTG ACAGACCGTT TCTGCCGACT AGATATACAC AA G TT T OCCT CTATTCCTGA 2520 

TATCACAGAC AACGAATACT ACACCAACTC TTTCCACTAC GATGTTCGTA AAAATCCAAC 2580 

ACCGTTTGAA AAATTGGACT TTGAGAAAGT CTATCCGGAA CCAGGTCCCT CAGGTGGTTT 2640 

CATCCATTAT TGTGAGTATC CAGTCCTTCA GCAAAATCCA AAGGCCTTGG AAGCTGTCTG 2700 

GGATTATGCT TATGACCGTG TAGGCTATCT AGGCACCAAT ACTCCGATTG ACCGTTGCTA 2760 

CAAGTGTGAC TTTGAAGGCG ATTTTCAACC AACTGAGAGA GG G TTTGCTT CTCCAAACTG 2820 

TGGCAATAGC GACCCTAAAA CAGTAGATGT GGTGAAACGA ACTTGTCGCT - ACCTAGGTAA 2880 

TCCTCAAGCA AGACCGATGG TCAACGGGCG TCACAAGGAA ATCGCTGCGC GTGTCAAACA 2940 

TATGAATGGT TCAACGATTA AAATAGCTGG GCATCAAGTA ACAAATTAGA AAGAAATGAA 3000 

ATGGGAAAAT ATCAACTAGA CGATAAGGGG CGCGCACAAG TGACCCCTTA TCACGAGAAA 3060 

CACTCTAAAG CTGGAGCTGG TAAGAAAGAA CGCTTGCTTA GCTTCAGAGA ACAATTTTTA 3120 
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AACAACAACA AGAAAAAATA AAAGTGAGAG CCAGCTCTCG CTTTTCTCAT AGTGGGAGGT 3180 

AAGGATGGAA TTACGCACAC CAAGATTAGC GGATAAGAAA GCT G TTTTAG ATATGATGAC 3240 

AGAGTTTGAA AAATTTCAGT CGCCTCACGA CGGCGGTTTC TGGGATACAG AGAACTTTGT 3300 

GTATGAAGAC TGGTTACAAA GCAATCAGGA ACAGGAAATG GGGATTAATC TGCCTGAAGG 3360 

ATGGGTTTCT GCAATTCAGT TAGTGGCTTT TTCTGAGAAA GGTCAAGCAG TTGGATTTCT 3420 

TAATCTCCGG TTGCGCCTCA GTAACTTTCT ACTAGAAGAA GGTGGCCACA TTGGCTACTC 3480 - 

CATTCGTCCA TCTGAAAGAG GCAAGGGTTA TGCAAAAGAG ACTCTC C GTC AGGGCTTGCA 3540 

AGTTGCTAAG GAAAAGAACA TCAAGAAAGC TCTGGTGACC TGTAGTGTGA ATAATCCTGC 3600 

TAGCAGAGCA CTCATTCTAG CAAATGGTGG AATATTTGAG GATGCTCGCA ATGGAGTCGA 3660 

GCGTTATTGG ATAGAGGTAG CGAATGAATA ATCCAAAACC ACAAGAATGG AAAAGCGAGG 3720 

AACTTAGTCA AGGTCCTATC ATTGACTACA AGGCCTTTAA CTTTGTGGAC GGCGAAGGCG 3780 

TGCGCAACTC TCTCTATGTA TCAGGCTCTA TGTTTCACTG CGAGCGATGT TATAATGTTG 3840 

CGACTTGGTC TTTTAATGCT GGCATTCCCT ATACAGCAGA ATTAGAAGAG CAGATTATGG 3900 

CAGACCTTCC CCAACCCTAT GTTCAAGGCT TGACTTTGCT GGGAGGGGAG CCTTTTCTCA 3960 

ATACTGGGAT TCTCTTGCCA CTTGTTAAGC GGATTCGCAA GGAATTGCCA GACAAGGACA 4020 

TCTGGTCCTG GACCGGCTAC ACTTGGGAAG AAATGATGTT GGAAACTCCA GATAAACTGG 4080 

AATTCTTGTC ACTGATTGAC ATTCTTCTCG ATGGAACATA TGATCGAACT AACAGAAATC 4140 

TTATGCTCCA GTTTCGAGGT TCATCTAACC AACGAATTAT CGATGTGCAA AAATCGCTCA 4200 

AAAGTGGGCA AGTAGTGATT TGGGACAAGC TCAATGACGG AAAAGAAAGC TATGAACAGG 4260 

TGAAGAGAGA ATGAAGAAAA AGGACTTACT AGACCAACTA GTCTCAGAGA TCGAGACGGG 4320 

GAAAGTCAGG ACACTGGGAA TATACGCTCA TGGAGCTTCA GGTAAATCAA CCTTTGCACA 4380 

GGAATTCTAC CAAGCTTTAG ATTCTACTAC AGTAAATTTG CTAGAGACAG ATCCTTATAT 4440 

CACCTCAGGA CGCCATCTGG TAGTACCCAA GGACCCGCCG AATCAAAAGG TGACAGCCAG 4SO0 

TCTGCCAGTG GCCCATGAAC TGCAGAGTTT GCAGAGAGAT ATC C TT G CTT CCAGGCGGGT 4S60 

ATGGATGTCT TGACAATTGA ACAACCTTGG AAGGCTAGTG AGGTCTTGTC TGGAGCCAAA 4620 

CCAATTTTGA TTGTCGAAGG GATGTCTGTT GGCTTTCTAC CCAAGGAACT CTTTGAAAAA 4680 

ACCATCTGTT TCTACACGGA TGAGGAGACC GAATTAAAGC GACGCCTTGC TAGAGATACG 4740 

ACTGTGAGAA ATCGCGATGC GG 4762 
(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3832 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92i^ 



GATGCACCTT 


TCGACCCACA 


TATTCCAGAA 


AATTACTTTA 


AAGATGATGA 


TCTTAATCAG 


ou 


GTACCTTCTC 


TTTGl'TGGTC 


TTCATCTGCA GCCCTCTTTT 


TCAGTAATTG 


GGTAGACCAT 


JL2U 


GCGGTCTATC 


AGGAGACGCC 


TTTTGATTGG 


AGAAAGATAG 


AAGATGATGC 


ATCTGCATAT 


i on 


GGGTATTTAT 


AAGAGGAATT 


ATGACATATT T AG ACGCTTT TAAATCAGGT 


ACCTTGCTTT 


240 


TACCGAGTGC 


CCTGCTCTTG 


CATTTTAAGG 


AACTCTTTCC 


TTCTAGCGAC 


GATTTTCTGG 


300 


TTTGGCAATT 


TTTCTATTTG 


CAAAATACGA 


CACGCTTAGA 


AGAAATGTCG 


CCAAGCCAGA 


360 


TTCCTCAAAG 


GATTGGCAAG 


GAAATTTCGG 


ATGTCAACCA 


GTCCATTTCT 


AATCTGACGG 


A ^ t% 

420 


AAAGGGGACT 


GCTCCAGTAT 


CCTACTATCG 


AATTAAATGG 


CGAAATTGAA 


TTGCTCTTTG 


480 


ATGCTAGTTT 


GGCCTTGGAA 


CGTTTGGATG 


ACCTGTTTGG 


AGCAGTTCAT 


TCAACTTCAG 


540 


ACCAGCTAAC 


ACCTCAAAAC 


CAGCTCAAGC 


ATTTGGTGGA 


AACCTTCCAG 


CAGGAGTTGG 


est ft 


GACGATTGTT 


GACGCCTTTT 


GAGATTGAGG 


ATTTGACCAA 


GACACTAAAG 


GAAGATGGAA 


o bw 


CCAGTGCTGA 


CTTGATTAAG 


GAGGCTCTTC 


GTGAAGCTGT 


TTTGAATGGA 


AAACCAAACT 




GGAAGTACAT 


TCAGGCGATT 


TTGAGAAACT 


GGCGCCATCA 


AGGAATCAAG 


ACTGTGGCTC 


"7 an 

'OU 


AAATTGAGGC 


CAAGAGAGCA 


GAAAGAGAAG 


CAAGCAATCC 


TCAGTTGACA 


CAGGTATCTG 


840 


CAGATTTCAT 


AAATCCCATG 


GATCTCTGGA 


AGGATTAATC 


CATGCAAGTA 


CGCTTGAAAT 


900 


CCGAGTAACA 


TTTCCAAGCT 


CTGTATAATT 


GTGATAGAAT 


AAATAGAAAA TAAATTGAAA 


960 


AAAGAGGTAT 


GTGAAATGTC 


ACGTAAACCA 


TTTATCGCTG 


GTAACTGGAA 


AATCAACAAA 


1020 


AATCCAGAAG 


AAGCTAAAGC 


ATTCGTTGAA 


GCAGTTGCAT 


CAAAACTTCC 


TTCATCAGAT 


1080 


CTTGTTGAAG 


CAGCTATCGC 


TGCTCCAGCT 


CTTGATTTGA 


CAACTGTTCT 


TGCTGTTGCA 


1140 


AAAGGCTCAA 


ACCTTAAACT 


TGCTGCTCAA 


AACTGCTACT 


TTCAAAATGC 


AGGTCCTTTC 


1200 


ACTGGTGAAA 


CTAGCCCACA 


AGTTTTGAAA 


GAAATCGGTA 


CTGACTACGT 


TGTTATCGGT 


1260 


CACTCACAAC 


GCCGTGACTA 


CTTCCATGAA 


ACTGATGAAG 


ATATCAACAA 


AAAAGCAAAA 


1320 


GCAATCTTTG 


CGAACGGTAT 


GCTTCCAATC 


ATCTGTTGTG 


GTGAATCACT 


TGAAACTTAC 


1380 


GAAGCTGGTA 


AAGCTGCTGA 


ATTCCTAGGT 


GCTCAAGTAT 


CTGCTGCATT 


GGCTGGATTG 


1440 


ACTGCTGAAC 


AAGTTGCTGC 


CTCACTTATC 


GCTTATGAGC 


CAATCTGGGC 


TATCGGTACT 


1500 
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CGTAAATCAG 


CTTCACAACA 


CGATGCACAA 


720 
AAAATGTGTA 


AAGTTGTTCG 


TGACGTTGTA 


1560 


CCTGCTGACT 


TTGGTCAAGA 


AGTCGCAGAC 


AAACTTCGTC 


TTCAATACGG 


TGGTTCTGTT 


1620 


AAACCTGAAA 


ATGTTGCTTC 


ATACATGGCT 


TGCCCACACG 


TTGACGGTCC 


CCTTGTAGGT 


1680 


GGTCCGTCAC 


TTGAAGCTGA 


AAGCTTCTTG 


GCTTTGCTTG 


ACTTTGTAAA 


ATAATCAGTA 


1740 


AGTAGCAAAA 


GCTAGGTGGA 


ACAGCATTCA 


GATGTCTGTT 


ACATTTTTTA 


TAGGAGAGAA 


1800 


AGATTGAAAA 


CAAAAATTGG 


ATTAGCAAGT 


ATCTCTTTAC 


TAGGCTTGGC 


AACTAGTCAT 


1860 


CTCGCTGCAA 


ATGAAACTGA 


AGTAGCAAAA 


ACTTCGCAGG 


ATACAACGAC 


AGCTTCAAGT 


1920 


AGTTCAGAGC 


AAAATCAGTC 


TTCTAATAAA 


ACGCAAACGA 


GCGCAGAAGT 


ACAGACTAAT 


1980 


GCTGCTGCCC 


ACTGGGATGG 


GGATTATTAT 


GTAAAGGATG 


ATGGTTCTAA 


AGCTCAAAGT 


2040 


GAATGGATTT 


TTGACAACTA 


CTATAAGGCT 


TGGTTTTATA 


TTAATTCAGA 


TGGTCGTTAC 


2100 


TCGCAGAATG 


AATGGCATGG 


AAATTACTAC 


CTGAAATCAG 


GTGGATATAT 


CGCCCAAAAC 


2160 


GAGTGGATCT 


ATGACAGTAA 


TTACAAGAGT 


TGCTTTTATC 


TCAAGTCAGA 


TGGGGCTTAT 


2220 


CCTCATCAAG 


AATGGCAATT 


GATTGGAAAT 


AACTGCTACT 


ACTTCAAGAA 


GTGGGGTTAC 


2260 


ATGGCTAAAA 


GCCAATGGCA 


AGGAAGTTAT 


TTCTTGAATG 


GTCAAGGAGC 


TATGATGCAA 


2340 


AATGAATGGC 


TCTATCATCC 


AGCCTATTCT 


GCTTATTTTT 


ATCTAAAATC 


CGATGGAACT 


2400 


TATGCTAACC 


AAGAGTGGCA 


AAAAGTGGGC 


GGCAAATGGT 


ACTATTTCAA 


CAACTGGGGC 


2460 


TATATGGCTC 


GGAATGAGTG 


GCAAGGCAAC 


TACTATTTGA 


CTGGAACTGC 


TGCCATGGCG 


2S20 


ACTGACGAAG 


TGATTATGGA 


TGGTACTCGC 


TATATCTTTG 


CGGCCTCTGG 


TGAGCTCAAA 


2580 


GAAAAAAAAG 


ATTTGAATCT 


CGCCTGGGTT 


CACAGAGATG 


GTAACCGCTA 


TTTCTTTAAT 


2640 


AATAGAGAAG 


AACAAGTGCG 


AACCGAACAT 


GCTAAGAAAG 


TCATTGATAT 


TAGTGACCAC 


2700 


AATGGTCGTA 


TCAATGATTG 


GAAAAAGGTT 


ATTGATGAGA 


ACGAAGTGGA 


TGGTGTCATT 


2760 


GTTCGTCTAG 


GTTATAGCGG 


TAAAGAAGAC 


AAGGAATTGG 


CGCATAACAT 


TAAGCAGTTA 


2820 


AACCGTCTGC 


GAATTCCTTA 


TGGTGTCTAT 


CTCTATACCT 


ATGCTGAAAA 


TGAGACCCAT 


2880 


GCTGAGAGTG 


ACGCTAAACA 


GACCATTGAA 


CTTATAAACA 


AATACAATAT 


GAACCTGTCT 


2940 


TACCCTATCT 


ATTATGATGT 


TGAGAATTGG 


GAATATGTAA 


ATAAGAGCAA 


GAGAGCTCCA 


3000 


ACTGATACAG 


GCACTTGGGT 


TAAAATCATC 


AACAAGTACA 


TGGACACGAT 


GAAGCAGGCG 


3060 


GGTTATCAAA 


ATGTGTATGT 


CTATAGCTAT 


CGTAGTTTAT 


TACAGACGCG 


TTTAAAACAC 


3120 


CCAGATATTT 


TAAAACATGT 


AAACTGGGTA 


GCGGCCTATA 


CGAATGCTTT 


AGAATGGGAA 


3180 


AACCCTCATT 


ATTCAGGAAA 


AAAAGGTTGG 


CAATATACCT 


CTTCTGAATA 


CATGAAAGGA 


3240 


ATCCAAGGGC 


GCGTAGATGT 


CAGCGTTTGG 


TATTAAGCCA 


TCATTTGAAA 


GAGGGATGTG 


3300 
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ATAGTAGCAC CCTCTTTTTC TTTGTTTTAT GATAGTTCAT 
TTCCTCGGAA ATGAAGCTTA TATAGTAGAT TGAATATAGA 
AAACATTTTA GAAATTCATT TACCTTTCCT AATCGACTTG 
ATTATAGTAT TGGGGAATTT CTTCAAACCA CATCAGCTTG 
CAAAACTTGT GCTTTGGTCA AGCTGGGTTT ACTTTCCTAG 
GACTATAAGC ATCCAACCCT CTTTTTGTCT TCTAAAGAAT 
GCAACTTTTC TCATATAAGT TCTTTCTCTT GCTATTGGTT 
CTACTAATCA TTAAGAAGTG GTTACAAAAA ATAATGAATG 
TTGAAAAAAG AAGCAGTAAA AGACGTAACA TCATTGACAA 
(2) INFORMATION FOR SEQ ID NO! 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



CCTCGAGTAA 
CAAATACCTT 
GTTTCATCTT 
GTCAGTTCTA 
TTTGCTGATG 
TCTTAAATTA 
TTCCTTACTA 
AGGTAAAGAA 
AAGCAGCGCC 



ATTCAAGTTC 
GTGATTGGTA 
ATTTCAATCT 
CCTGCGACCT 
GATTTCCATT 
TCAGTCTATT 
CTATACTAAG 
AATGGTAGAA 
GG 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3832 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TGAAAAAATC CTCATGAACC TGGCGCCAAT AGACAAGTCT CTTGTTTCCC 
TATAGGCATG GTCAGCTGAC ACTCGATTGA AGGGTTTAAC ACAAACCTTT 
CAATGCAGAC AGCCTGATTT TGACTATCTA AAATGACATC GAAGGTCCCT 
CTGGTTCGTC TTCTAGCACA TAGAGGTCAT AGCCTGATGC TGTTGCTGTC 
TAAACACCAA ATCCGCTAAA AGCTCTGGTT CAACTCCAAA AGCCCAGGCA 
CTCCGATCAA AGGATTGATT TGCTTGTATT TATTCCACAT TTCTTCCGGT 
CTCCTTTCTA ATTTTTTACT TTCITCTTTT atgtgtttaa gatgatctcg 
TCTAAATCAA AAATCTCTGG AATAGAACTG TAGTGGATAA TGCACTTGAT 
TTCATTTTTT GTATGAAAGA AGTATTCAGA TAGCCTGCTA CAGCAAAATC 
TTTCTTGCTT TATCCTGCAT ATCTCTTAGC ATATCTAACA TTATTGGACT 
TGCCATTGAC TGTTTCTCAT ACTCGCAAAA ACAAAGGAAG TCAAATCATT 
ACAATCTTTG AAATGCCCCT TTCCAGTATA CTAGATAAGT CAAAATACGC 
TCAATCATCC TTCCGACTTT CCCAGTAAAA CCCTGCTGAC GCAATACTGT 
TTTAATTGGT CGGCATCATT GACAAAAGGA AAGATAACAG ATAGATTGGG 



TCACCTTCCT 

GTAATTTCGA 

ACTTGGGGAA 

TTTTCTCCTT 

TCGATTTCAT 

ATCATGGGTG 

ATGCTCAATC 

ACCCAACTGA 

AATCTTGTTC 

TTCCATATCA 

CATTCCAACT 

TGACGGTAAT 

AATACCTTGT 

GTTGGTTTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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TAAACTTCTG 


TAACGACATG 


TGCTTCAGCC 


TGAAATTCAT 


CCAAACACGC 


CAGTAAACCC 


900 


CTACTTCCTC 


TATAGCCAAA 


CAAGGGATGC 


CCTTCCTCAA 


AAAACTCTTT 


AGTCCCCACT 


960 


AAACAATTGG 


CTTCTGTATT 


CGTTAATTCA 


CTAAAACCAT 


ACCAAACTTC 


CTTACCTAAG 


1020 


TAAAAGGAGC 


AAATAGTATC 


AAGATAATCT 


TTCACAAATT 


CCTGACAACT 


TTGTAATAGT 


1080 


ATATTTTGAT 


TGAGCTCTCT 


CAATAAGTAT 


TCCCCACCAA-TCATGCCGAC 


GTGGTGAAAT 


1140 


AGTTGAGGAT 


AAATTTTTTC 


AAGAATTTTT 


TCGCCACTAA 


GGGCAAGTTG 


ATTTCTCATC 


1200 


ATTCACCTTC 


CAATTCATCT 


AAGAAGTCTT 


GTCCAGTTCT 


GGAAATCCTA 


ATAATTCAGA 


1260 


CTTAACCTTC 


AAGACTAATG 


GCCATGCATT 


TTCTTCTGTA 


ATCTCTTGAA 


TATCCATCCA 


1320 


AATATATCCA 


AGTGAATCAT 


TCGCACCATC 


AGACACAGCT 


TCCGAAATCG 


TAACTTGAGG 


1380 


TGCACTCTCA 


TTCATTTCAA 


CATCATACAA 


GGCTATGACA 


TGGTGAACCA 


TAAAATTTTT 


1440 


TAACTCTTCC 


CTGACGAAAA 


CATCCTAGAT 


TCGAGGATTA 


GAGTAGCTTC 


TAACAGTAAA 


1500 


TCCCGTCTCT 


TCCATAACTT 


CTCTAGTCAG 


CGTTTCCGTC 


ACTCCTTCAC 


CAAGTTGCTG 


1S60 


ACTGCCTCCA 


GGTAGATCAT 


ACCGATGTTG 


ATAAGGGCCT 


CTCGTTTTTT 


CAATGCAAAG 


1620 


TAACTTTCCA 


TTTTCAAAGC 


AAACACAGTA 


GACCCCAAAG 


TCATTTTTGA 


TTTCCATCCA 


1680 


ACTCCTCCTA 


CTTCAAAGAC 


CAGCCACCAT 


CTATTGTCAA 


GATTTGTCCT 


TGCATGGCGC 


1740 


TCGCTTTTCC 


ACTTGCTAAA 


AAAAGACTAA 


GCTCTGCTAT 


TTCCTCTGGC 


TCAATCCAGC 


1800 


GCTTGATTGG 


GGTTTCACTA 


GCCACCCACT 


CAGCCAAACC 


ACCTGCTTCA 


AAATCCGCAG 


1860 


CGCTCATAGC 


TGTCTTGACT 


GCTCCTGGAC 


CGATACCAAA 


GACCTGAATC 


CCAGCTTCAG 


1920 


CATAGTCTAG 


AGCCAACTGC 


TTGCTGAAGC 


CAGCCAACGC 


ATGCTTGGAT 


GAAGTATAGG 


1980 


CGTGACCACC 


TCCACCTGCT 


AGGCTAGAAG 


CAATGGAACA 


CATATTGATG 


ATGATTCCCT 


2040 


TTTTATTTTC 


CAGCATTTGT 


GTCAAATAAT 


ACCCAGTCAA 


CTCTACTGGA 


ATAATGTAGT 


2100 


TGATTTCAAA 


AATCTCTTGA 


ATGTCCTGCG 


CCGTTTGTTC 


CAACAGTGGT 


TTGTAATCAT 


2160 


CCAAAACTCC 


AGCAGTATTA 


CACAAAACAT 


CCACCTGAGG 


GCACCAGTCA 


AAAATAGGTT 


2220 


CCAAGTCCAA 


GGTCAAATCT 


CTCTGTAAAA 


AGCGAAAATC 


ACCCTCTAAG 


AGTGGCTTTT 


2280 


CACCTTGGTC 


AACTCCATAA 


ACTTGATAGC 


CCTTCTCTAA 


AAAGAGGCGA 


GCTTGAGCCA 


2340 


ATCCGATCCC 


TGAACTCACT 


CCTGTAATGA 


GTACACGTTT 


AGTCATGCAC 


TTCTACCCAA 


2400 


TCCGTTGCCA 


AAACATCACA 


AACTGTCGGG 


CTCCACATGG 


AAAAACCTTC 


TCCTTCGCCA 


2460 


GAAACGTTGA 


TTAGGAAATA 


AGGTGTCATT 


TCAAGTGCAA 


GCCCATTTTG 


CTCGATGGTA 


2520 


TCAAAGAGTT 


GGACATAGTT 


TTCCGCACCT 


CCCCAACCAG 


TTCGTACATA 


TTTTCTCTTA 


2580 


GCCTTTAACC 


CAGGCAGGAT 


CTCTTCAAAT 


GTCATGTTTT 


TCTCCTTTAA 


TTCTACATTC 


2640 
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TTCATTTAAT 


TATAGCAAAA 


AACCGCTTTA 


TACCGCTTTT 


TGAATGTGAC 


TTATTCAAAC 


2700 


CTGCTACTAC 


TTACGGCAAA 


TTATTCCCTG 


CAGCAAGATA 


AATTTCATAC 


CATTCTTTTC 


2760 


TTGTTAAGCT 


AAAGTTTGCC 


GCTCGGCTAA CTTCTCTCAA GTGCTTAGGA TTTGTTGTAC 


2820 


CTACGACTGC 


CTGCATTTTT 


GCTGGATAAC 


GCAATATCCA 


AGAAATGGCA 




2800 


AGGTTACTCC 


ATATTTAATA 


GCTAAACGAT CAAGTACTTG 


ATTTAAAGCT 




2940 


CATTTCCAAC 


AAAATTCCCT 


TTAAAATACC 


CGAATTGTAA 


GACAGACCAT 


GCTTGAATGA 


1 AAA 


CCACATCGTC 


TAATTGGCAA 


TATTCAAAAA TGCTGCCATC 


TCGCATAGCT 


GCTTGACTAT 


JUDU 


CTTC CATATT 


AACATGAAAA 


GCTGATTCAA ATCCTGGACT AAAAGCCGCA 


CTCAATTGTA 


J 1«U 


GCTGATTAAC 


AGCTAACGGC 


TGCTTGACAT 


CTTTTTTAAG 


CAACTCCATC 


ATCATAGGAT 


1 1 OA 


TTTGATTAGA 


AACTCCAAAA 


TCTCGAACTT 


TACCTTGTTT 


ATAAAGGAGA 


TTAAAGGCTT 


3240 


CTGCTACTTG 


GTCAGATTCC 


ATCAAAGCAT 


CTGGTCGATG 


AAGGACCAAG 


CTATCTAGAT 


3300 


GATCAATCTT 


CAATCTTTGC 


AAAATACCGT 


CTACTGATTT 


TATAATATAC 


TCCTTAGAAA 


3360 


AATCAAAATA 


GGTAAATTCT 


TCAATGCGAA 


TGCCACATTT 


GCACTGAATC 


CACATCTTTT 


3420 


CTCTTAAATC 


TGGACCATTT 


TTTAGGACAA 


GACCTAACAG 


TTCTTCACAA 


CGACCACGAC 


3480 


CAT AAATATC 


AGCCAAGTCG 


AAGCCATTGA 


TTCCAACAGA 


AAGTGCTGTT 


TCTACAAGCT 


3540 


CTTCAACTTC 


TTTTACACAT 


TTATCTTTTA TTCTCATCAT TCCGAGAACA ATTTCTGATA 


3600 


ATTCTTTGTC 


ATCTTGACCA 


AGAGTTATGT 


ATCTCATCAA 


ATTTTTCTCC 


TTTAATTTCT 


3660 


AACATTCTTC 


CCTTCATTAT 


AACAAAAAAC 


CGCTTTGCAA 


CGACTTTTTG 


ACTATACTTC 


3720 


ACTCCATTTT 


ATCTTCTTAA 


ACCCACGGAA 


CAAGACAAAG 


ATTCCAATAA 


AGAGGACAGC 


J7HO 


TAAAGGAATA 


ACTTTTGTAA 


GGAAAACATT 


TGAAATTCCC 


ATCCACTCAT 


AATAACGGAG 




CAGAGAACCC 


ACCACAAGAT 


CGGCAATAAT 


CATACTGACA 


AATGGACGAA 


AGACCGCTTC 


J7Uv 


TTTCCAATTC 


CAAATACCGA 


TAACTAGCCA 


AATCGTAAAG 


ACAGACAAAC 


TATCCCAGGG 


3960 


AGCCGGAATA 


TAAAAGGCTC 


CTTCTTGTAT 


GAAGCTTGCC 


ATTCCTACAT 


ATCCTAAAAC 


4020 


AACTAGAAGA 


ACTATAGTCC 


CAACAACAAT 


GTAACTCCCA 


ATTTTCATTT 


TAGGAGAATC 


4080 


TTGGACTAAA 




AAATTGTGGC 


CACAAGTCCA 


AATCCAATCA 


GAAAAATAAG 


4140 


AACTTGCCCT 


AAAAATGTGA 


GCAAATTGAC 


TGTTAAGAGA 


GGACCTTTAG 


AAAAATCACT 


4200 


TAGTACTTGA 


TAATAACGTA 


ATACCGCCAG 


GACAAGAATT 


GGCGTCAAAA 


GGGACTCTTT 


4260 


GATAGAACTG 


CGAGGTGCTC 


CCTTGAGAAT 


CTCTTTCATT 


ATTTTTTTAG 


GATTCTTACC 


4320 


TAGATAATCC 


TCTGCACTCA 


TGCCATCTCG 


TTCTGCTTCT 


GAGAAATCTA 


GCATCATCAA 


4380 



WO 98/18931 



PCT/US97/19S88 



724 



ATAGATCTGC 


TCTCTGAGAT 


AGTCTTCATC 


ATAGAGAAAT 


CCAGCAAGAT 


TAAAACTTTC 


4440 


CCACAACTCC 


TCAAAATACT 


TTTGATTCTC 


CTCAGAAAAC 


TCATGTAGCA 


AAGCGCTTG T 


4500 


TTCTTCGTAA 


TACTTCATTT 


TCTTCATGGT 


TTAACCCCCA 


TTCTTAATCC 


CTTCTACTTT 


4560 


TTGACTCAAA 


TCGTCCCATT 


GTTGCCAAAA 


GACTGAGACA 


CGCTCTTCTC 


CTTCTTTCAT 


4620 


TAATCAAAAA 


TACTTCCGAT 


CTGGACCATC 


TGGCGACGGG 


CGCATGTCGC 


CTCTTATCCA 


4680 


TTGATTTTTT 


TCTAACTTTT 


GCAACAAAGG 


ATAAATAGTT 


CCTGGAACGA 


TAGTATCAAA 


4740 


TCCAGCCTCT 


CGCAAAGTCT 


GAACCAACTC 


ATAACCATAC 


CGCTCTTTTT 


GACCAATCAT 


4800 


ATCCAAGACA 


CAACCTTCAA 


GAACACCTTT 


TAATAGCTGA 


GTTTCTTTCA 


TCACTTCTCC 


4860 


CTTCTAATCT 


ATTTTGTAAT 


AC CT ACT ACT 


GACTTCACCT 


ATAGTATATC 


ACTTCTACAC 


4920 


TACTTTCTAA 


AGCATAATAG 


TTAATACTCT 


TCGAAAATCT 


CTTCAAACCA 


CGTCAGCGTC 


4980 


GCCCTACCGT 


ATGTATGGTT 


ACTGACTTCG 


TCAGTTTCAT 


CTACAACCTC 


AAAAACATGT 


5040 


TTTGAGCTGA 


CTTCCTCAGT 


TTCATCTACA 


ACCTCAAAAC 


AGTGTTTTGA 


GCTGACTTCG 


5100 


TCAGTTTCAT 


CTACAACCTC 


AAAACACTGT 


TTTGAGCTGA 


CTTCGTCAGT 


TTCATCTACA 


5160 


ACCTCAAAAA 


CATGTTTTGA 


GCTGACTTCG 


TCAGTTTCGT 


CTACAACCTC 


AAAACAGTGT 


5220 


TTTGAGCAAC 


CTGCGGCTAG 


CTTCCTAGTT 


TGCTCT TTG A 


TTTTCATTGA 


CTATAAATAA 


5280 


AAAAACAGAA 


CTAGCCTGAA 


CTACTCCTGT 


CTACTTTTAC 


CCAATCACAC 


TTCCATTTGG 


5340 


TACAGCTGGA 


TCAACTGTGA 


GAAGGGTTAA 


TTTGCCATCA 


TCTTCAGCTC 


AGAGAATCAT 


5400 


ACCCTGGCTG 


ACATATTTTT 


TCATCATTTT 


ACGTGGTTYG 


AGGTTAGCAA 


CGATTTCAAC 


5460 


TTTCTTGCCG 


ACCAATTCTT 


GTTCATTTGG 


ATAGTATTTT 


GCAATTCCTG 


AAAGAATCTG 


SS20 


ACGATCTTCT 


CCATCACCAG 


CATCCAAGCG 


GAATTGAAGC 


AACTTATCTG 


AACCTTCTAC 


5580 


TTTAGACACT 


TCTTTGACTT 


CTGCGACACG 


GATTTCAACC 


TTCTCAAAGT 


CTTCAAACTT 


5640 


GATTTCATCC 


TTGTTTAGTT 


TGAGCTCAAC 


TTCGTCCGGA 


TTCCATTCTT 


TTTCGACTCC 


5700 


TGGTTTATTG 


CCTTCCATTT 


GTTCCTTGAT 


ATAGGCGATT 


TCTTCTTCCA 


TATTTAGACG 


5760 


TGGAAAGATA 


GGTGTTCCTT 


TGGCAACTAC 


AGTCACATCT 


GCTGGGAAGT 


CAGCCAAACT 


5820 


CAAGTTTTCA 


AGACTAGAAA 


CTTCTTCCAA 


ACCAAGTTGA 


GTCAAAACTG 


CACGACTAGT 


5880 


TTCCATCATA 


AATGGTTCAA 


TCAAGTGAGC 


AACTACACCA 


ATCCTGGCTC 


CCAAGTGGCT 


5940 


CATGACACTT 


GCCAATTGGT 


CACGAAGAGC 


TTCATCCTTG 


GCCAAGACCC 


ATGGTGCGGT 


6000 


CTCATCGATG 


TATTTATTGG 


TACGAGAGAT 


CAGAGTCCAG 


ACTGCTTCAA 


GCGCACGTGG 


6060 


ATAGTCAACT 


GCTTCCATGT 


GTGTATGGAA 


GTCTGCGATT 


GATTGTwCTG 


CAACCTCAGC 


6120 


AAGAACATGA 


TCATATTCAG 


TCACACCTTC 


TACATAGGCA 


GGGATTTGTC 


CATCAAAGTA 


6180 
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CTTATTAATC 


ATGGAAACCG 


IAV-VKjI Innu 


GAGGTTCCCA 


AGGTCATTAG 


CCAATTCATA 


6240 


CTTGATACGG 


CCGACATAGT 




AAAGGTTCCG 


TCTGAACCAA 


CTGGAAGGTT 


6300 


ACGGATGAGG 


tAGTAACGAA 


IfTWUA IV. 1 flu 


TCCATAACGC 


TCTACCAACA 


TTTCAGGGTA 


6360 


AACGACATTC 


CCTTTTGACT 


TAGACATm 




ATGACAAACC 


AACCATGGGC 


6420 


AATCAAACGA 


TCAGGTAATT 


I AALn I LUnn 


CATCATAAGA 


AGGATTGGCC 


AGTAGATAGA 


6480 


GTGGAAGCGA 


AGGATATCTT 


TTCCTAWCAT 


ATCfZAAGACT 


GTTCTCATT*^^ 


AGAACTTGTC 


6S40 


AAACTTACCA 


TGTTCGTCTT 


GAGCGTAGCC 


AA\*Al*w 4 v» 1 v. 


GCATAGTTAA 


GAAGGGCATC 


6600 


AATCCAAACG 


TAGACAACGT 


GTTTTGGATT 


TTlTflGGACA 


GGCACTCCCC 


ATGTAAAGGT 


6660 


TGTACGAGAT 


ACCCCCAAAT 


CTTCCAAGCC 


TwO^ AwV»A Iw 


AAGTTGCGTA 


GCATTTCATT 


6720 


AAGCCGACCA 


TCTGGCGTGA 


TAAATTCAGG 


ATGAK.1 i 




CCAAACX3GTC 


6780 


TTGGTATTTC 


CTAAGGCGAA 


GGAAGTATGA 


TTCTTCWiAA 


A^fTCATTCAA 


CCTCATGACC 


6840 


TCATGGAGCA 


ATACCACCAG 


TCACATTTCC 


AOCTTCATCA 






6900 


GCTTTCTGTA 


AAGAATTCTT 


CGTCTGATAC 


TGAATACUAA 


LLAunul Al i 


CACCCAAGTA 


6960 


GATATCATCT 


TGAGCAAGTA 


AGCGTTCAAA 


GACTTGTGC.W 


At—AAl. llill 


CATGGTAGTC 


7020 


ATCAGTTGTA 


CGGATAAATT 


TATCCTAtGA 


CAT A 1 *- A W» 1 


1ATTCCCACA 


GTTCTTTAAC 


7080 


TCCAACCGCC 


ATTCCATCAA 


CATAGGCTTG 


AuwTVi i AA 1 A 


WWAWW * A W ft * 


CCGCTTTCTG 


7140 


CTGGATTTTC 


TGACCATGTT 


CATCAAGACC 




#W\ A n*«n A WW * 


AGCCCAT C AG 


7200 


GCGTTTGTAA 


CGTGCTAGGA 


CATCACATGC 


GATAVai lulu 


TAGCCAGAAC 


CGATATGAAG 


7260 


TTTCCCAGAT 


GGATAGTAAA 


TCGGCGTTGT 


f\J\ I n A AAAAA 


TTTTTTTCAC 

A A A A * * • W« 


ACATAATTTT 


7320 


TCCTTTCCAG 


GCAAATGAAA 


CCTGTTTTTC 


^•AJVCACTTCA 
1 nnUnVi a * v>« 


TTATATCACA 


TTTTTAATGA 


7380 


ATTTCAATAG 


GGAAATCCAT 


AtnAAnnUln 


CATAGACGAG 


TGTCCATCTT 


GTTGATCTCA 


7440 


TTCATAACGA AGGGCTTCAA 


TTGGATCAAG 


TTTCGATGCC 




wuiAunL i^v. 


7500 


AAAAATCATA 


CCAACACTAG 


CCGAAACTGC 


AAGACTAAAT 


AGGGCGACTG 


CGATTGATAC 


7560 


TCCAACTTCT 


ATACCTTCTA 


TTAAACCTTG 


CAGTAACAAA 


CCTGCTAAGq 


CAGTTAAACC 


7620 


ACTTGCAATT 


GTCAACCCAA 


TTAAGCCACC 


TAACAAGGTC 


AAAATCATGG 


ATTCAATCAA 


7680 


AAACTGAATT 


AAAATATTGG 


CACGTGTTGC 


ACCCAAAGCC 


TTACGAAGAC 


CAATCTCACG 


7740 


AGTGCGCTCT 


GTCACCGAAA 


CCAGCATGAT 


GTTCATGACA 


CCAGTTCCTC 


CAACAAAGAG 


7800 


AGAAATCCCT 


GCGATGGAAC 


TAATAATCGT 


CCTCATAAAA 


CTAAACGATT 


GTTGAATTTC 


7860 


TGCAAATACA 


ACGGACTCAT 


CTGCCACCTG 


GTATTCTCCC 


TGTTGTAAGC 


CTGCAAGCTC 


7920 
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TGTCATTTTT 


CGTGCCAGTT 


CTGGACCCAG 


ACTTGGGGTT 


AAACTGGTAT 


CATTCACTCG 


7980 


AAAGACAATA 


TTAGCTATTT 


CATCTACATT 


AAAATTCGCA 


GCAAGGGAGA 


TATTGGTAGT 


8040 


AATAGGCAAC 


CCACCAAACC 


CATATATTTT 


TGATCTTTTA GCCTCCGGAC TAGTATAAAC 


8100 


CCCAATGACC 


CGGTAACTAA 


ATCCATTGAC 


TTCTACAACC 


TTGTTAATAG 


CCTCTTGAGG 


8160 


AGATTCAAAT 


AAACTAATGG 


ACAATTCCTC ATCTAGCAAA ATGACACTTG 


CAAACTCTTT 


8220 


GAAATCTTGC 


TCTCTCAGAC 


TACGACCTGC 


AATAATTTCA TTCTTAACAG 


CGTCCATGTA 


8280 


AGTTCTGTTT 


CCACCTGTCA 


AATTAGCATT 


CTCAACCTTT 


TTATCTTGAT 


AGGTCAAGAT 


8340 


CG C ATT CGTT 


GAATTGGTTA 


CATAGTAACT 


ATCCACTCCC 


TTCAGTTTAG 


CTGCCTCTTG 


8400 


GACCCAGGAT 


TCTTGCGGTT 


TTGGCGGTTC 


AACAGGAACT 


TCCTCTTCCT 


TTCCAGAAAC 


8460 


CGTAAAAGCT 


GATTGTTTCT 


GACTAAAAGA 


CCCGTCTTTA 


CTTTTTTTAG 


GAGAGAAAAA 


8520 


GACGCTAATA 


TTTTTCTGAG 


ATTTAGTCAT 


ATCTTTATTG 


ACTTGACGAG 


ATAGGGAATC 


8560 


ACCCAAAGCC 


ATAATCACAA 


CAACTGATGA 


AACACCGATA 


ATAATCCCAA 


TCATAGTAAG 


8640 


CAAAGAACGC 


ATCTTCTGAG 


CCATGATAGA 


TGAAAAGGCA 


AATTTCAGAT 


TCTGCATCTT 


8700 


AGTTTTCCTC 


CTTTCCTAAC 


TGAGCACTGT 


CAGACGAAAT 


GACCCCATCC 


CGAATGACAA 


8760 


TCTGACGTTT 


GGCATAGGCA 


GCAATCTCAC 


GCTCATGCGT 


TACCATGATA 


ATGGTTTTTC 


8820 


CTTCTTTATT 


CAAATCAACC 


AATAATTGCA 


TAATTTGGTT 


ACCTGTTTTG 


GTATCCAAGG 


6680 


CTCCTCTCGG 


TTCATCCGCT 


AGGATAATAG 


AAGGATTGTT 


TACCAAGGCA 


CCCGCAATGG 


8940 


CTACACGTTG 


CTTTTGACCA 


CCAGATAATT 


CTGAAGGTAA 


ATGGTGACTA 


CGTTCTGTCA 


9000 


ATTCAACCTT 


GTCTAAATAT 


TCCTCAGCCA 


ACTTGCGACC 


TTTTGAAGAC 


CAAACTCCTG 


9060 


CGTAAATCAA 


CGGCAATTCT 


ACATTTTGCA 


GAGCATTGAG 


CTTCGATAGA AGAAAGAACT 


9120 


GCTGAAAGAC 


AAAACCGATT 


TGTTGGTTAC 


GGACCTTACC 


TAGTTCTTTT 


TCACCAAGCC 


9180 


CAGCCACTTC 


TTGACCTTCA 


AGATAATATT 


CTCCACTGGT 


TGGTGTATCC 


AACATGCCAA 


9240 


TCGTATTCAT 


CAGAGTGGAC 


TTACCAGACC 


CAGATGGTCC 


CATGATGGCT 


ACAAATTCAC 


9300 


CCTCATTCAC 


TTCTAGATTG ATATTTTTGA GAACCTCCAG TTCTTGGTCA 


CCATTACGGT 


9360 


AACTTCTGAA 


GATATTTTTT 


ACACTAATTA 


GTTGCTTCAT 


CAGCCTTCAC 


CTCTTTTCCT 


9420 


TCTTCCAAGG 


AAGATCTTGG 


ATTACTGATG 


ACCTTAGCAC 


CGTTCGTTAA 


ACCAGAAGTG 


9480 


ATTTCTTGAT 


TTTCTGCGTC 


AGCATTTCCC 


AATGAAACCT 


CAACTTTTTT 


AGCCTTTTGT 


9540 


TGTTCATCCA 


CAATCCAGAC 


ATAATTTTTA 


CTATCATCCA 


TTACTAGACT 


GCTAACAGGA 


9600 


ACAAGAATAG 


CCTTAGTTTT 


GCTTTTAACC 


TCAATGTTGA 


CAGAAAAACC 


TTGTTTCAAA 


9660 


TCACCAACCT 


CGCCTGTCAC 


ATCAATAGTA 


TAAGGGTATT 


TAGAACCTGT 


ATTATTCCCG 


9720 
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CCTCCTCGAC TACCTCCTTC ACCATTGTTT TTAGGATAGT CAGAAATATA GCTTAATTTC 9780 

CCACTCCATT TTTTATCAGG ATACACTTTA GAAGTAAACC TTACTTCTTG ACCTACAGAA 9840 

ACGTTCGCTA GATTGTACTC AGACAATTCT CCCTTGACTT GTAAATTTTC ATTGCTGACA 9900 

ATATGAACCA TAACTTGACT CGCCCCTGTT GGAGATTTAG AAACATTGCT ATTCACTTCG 9960 

ACCACAGTTC CCTCTAGGGT ACTGACAACA GTTGTTGCAT CCAATTGACT TTGAGCCTTG 10020 

CTTAATTGCG CCCCAGCATC TGCACGCGCA TCACGGGCAT CACCCAATTG AGCGTCAATA 10080 

GAAGCAACAG AATTTCCAGC CACTGGAGTT GGGCTTTGCA CCGTTGCATC TTCTCCTCCT 10140 

ACTCGCCCTG GTAACTGTGG AGCCGGACCT GAAGCGGCTT CATTTCCTGC TTGATTGAGT 10200 

TCATTGATAT GACGATCTGC CCTAGCTACT GCTCGACTAG CTGAATCATA GGCCGCCTGC 10260 

GCTTCT G AAC TACTGTACTT GACTAAAGCC TGCCCTTCGC TGACCTTATC GCCCACAGAA 10320 

ACAAGGATTT CATCTAAATC ACCCTTACTA GCATCAAAAT AAACATATTG TTCATTTTTT 10380 

GCTCTTACTG TCCCTGACAA TAAAACAGAG GAGGCCACGC TTCCTTCCTT CGCAACAACA 10440 

AGATGAGTAG GCTCATCTTT TAGAGCAGTC TGAGAAGGTT GTCTAAAGAG TAAAATCCCC 10500 

CCAGCACCCA ATACAACTAC ACTCGCAGCA CCGATTGCTG CATACAGTTG CCACTTTTTA 10560 

CCTTTACCAT TCTTTTTCTT CATAATGAAA CTCCTTTTCT TTTTTACAAT ACTTTGCTAT 10620 

TATACCAAAT TTCCCTCCAG CAAACAATAC AGTTCAGGAT TAAACAATCG TTCCGAATTT 10680 

TGCTTTTCGG 10690 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8195 base pairs 
. (B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GAGAAAGCGC CCACGTTTCC CCGAAGGGAG AAAGGCGGAC AGGTATCCGC TAAGCGGCCA 60 

GGGTCCGAAC AGGAGAGCGC AACGAGGGAG CTTCCCAGGG GGAAACGCCT GGTATCTTTA 120 

TACTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTCTGAT GCTCGTCAGG 180 

GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 240 

CTGGCCTTTT GCTCACATGT TCTTTCCTGC CTTATCCCCT GATTCTGTGG ATAACCGTAT 300 

TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC CCAGCGACTC 360 



SR1R931A2 i > 



WO 98/18931 



; PCT/US97/19588 



728 



AGTGACCGAG 


GAAGCGCAAG 


AGCGCCCAAT 


ACGCAAACCG 


CCTCTCCCCG 


CGCGTTGGCC 


420 


GATTCATTAA 


TGCAGCTGGC 


ACGACAGGTT 


TCCCGACTGG 


AAAGCGGGCA 


CTGAGCGCAA 


480 


CGCAATTAAT 


GTGAGTTAGC 


TCACTCATTA 


GCCACCCCAG 


GCTTTACACT 


TTATGCTTCC 


540 


GGCTCGTATG 


TTGTGTGGAA 


TTCTGAGCGG 


ATAACAATTT 


CACACAGGAA 


ACAGCTATGA 


600 


CaTGATTACG 


AATTCGAGCT 


CGGTACCCGG 


AAAATCCAGA 


AAATGCTTGA 


AAAAAATCCT 


660 


AGAAGATGGT 


ATAATACTAA 


ATTGTAAGGG 


TTATCACATA 


TAACTCAAAA 


AAAGAAAGAA 


720 


CAAAAGGAGA 


CTCAAACTAT 


GGCTTCTAAA 


GATTTCCACG 


TAGTGGCAGA 


AACAGGTATT 


780 


CACGCACGTC 


CAGCAACATT 


GTTGGTACAA 


ACTGCTAGCA 


AATTTGCTTC 


AGATATCACT 


840 


CTTGAGTACA 


AAGGTAAATC 


AGTTAACCTT 


AAATCAATTA 


TGGGTGTTAT 


GAGTCTTGGT 


900 


GTTGGCCAAG 


GTGCTGACGT 


AACTATCTCA 


GCTGAAGGTG 


CAGATGCAGA 


TGACGCTATC 


960 


GCTGCAATCT 


CAGAAACAAT 


GGAAAAAGAA 


GGATTGGCAT 


AAGGGAAATG 


ACAGAAATGC 


X020 


TTAAAGGAAT 


CGCAGCATCT 


GACGGTGTTG 


CAGTTGCAAA 


AGCATATCTA 


CTCGTTCAGC 


1080 


CGGATTTGTC 


ATTTGAGACT 


ATTACAGTCG 


AAGATACAAA 


CGCAGAAGAA 


GCTCGCCTTG 


1140 


ATGCCGCTCT 


ACAGGCATCA 


CAAGACGAGC 


TTTCTGTTAT 


TCGCGAGAAA 


CCAGTAGGTA 


1200 


CGCTCGGTGA 


AGAAGCAGCT 


CAAGTTTTTG 


ATGCTCACTT 


AATGGTTCTT 


GCTGACCCAG 


1260 


AAATGATCAG 


CCAAATCAAG 


GAAACTATCC 


GTGCGAAGAA 


AGTGAATGCA 


GAAGCAGGTC 


1320 


TGAAAGAAGT 


TACAGATATG 


TTTATCACTA 


TCTTTGAAGG 


CATGGAAGAC 


AACCCATACA 


1380 


TGCAAGAACG 


CGCAGcGGAT 


WTCCGCGACG 


TGACAAAACG 


TCTATTCGCA 


AACCTTCTTG 


1440 


GTAAAAAATT 


GCCAAACCCA 


GCTTCTATCA 


ATGAAGAAGT 


GATTGTGATT 


GCGCATGACT 


1S00 


TGACTCCTTC 


AGATACAGCT 


CAATTGGACA 


AAAACTTTGT 


AAAACCTTTT 


GTAACCAACA 


1S60 


TTGGTGGACG 


TACAAGCCAC 


TCAGCTATCA 


TGGCACCTAC 


ACTTGAAATT 


GCTGCTGTAT 


1620 


TAGGTACAAA 


TAACATCACT 


GAAATCGTTA 


AAGACGGTGA 


CATCCTTGCT 


GTTAACGGGA 


1680 


TCACTGGAGA 


AGTGATTATC 


AACCCAACAG 


ATGAACAACC 


GGCAGAATTT 


AAAGCAGCTG 


1740 


GTGAAGCCTA 


TGCGAAACAA 


AAAGCTGAAT 


GGGCACTTTT 


CAAAGATGCT 


CAAACAGTGA 


1800 


CTGCTGACGG 


TAAACACTTC 


GAGTTGGCTG 


CTAATATCGG 


TACTCCAAAA 


GACGTTGAAG 


1860 


GTGTTAACAA 


CAACGCTCCA 


GAAGCTGTTG 


GACTTTACCC 


TACAGAGTTC 


TTGTACATGG 


1920 


ATTCTCAACA 


CTTCCCAACT 


GAAGATGAGC 


AGTATGAAGC 


ATACAAGGCT 


GTTCTTGAAG 


1980 


GAATGAACGG 


TAAACCTGTT 




CAATGGATAT 


CGGTGGAGAT 


AAGGAACTTC 


2040 


CTTACTTCGA 


TATGCCTCAC 


GAAATGAACC 


CATTCCTTGG 


ATTCCGTGCT 


C TTCGT ATCT 


2100 


CTATCTCTGA 


GACTGGAGAT 


GCTATGTTCC 


GCACACAAAT 


CCGTGCTCTT 


CTTCGTGCGT 


2160 
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CTGTTCACGG 


TCAATTGCGT 


ATCATGTTCC 


CAATGGTTGC 


GCTCTTGAAA 


GAATTCCGTG 


2220 


CAGCGAAAGC 


AGTCTTTGAT 


GAAGAAAAAG 


CAAACCTTCT 


TGCTGAAGCT 


GTTGCAGTTG 


22B0 


CGGATAACAT 


CCAAGTTGCT 


ATCATGATCG 


AGATTCCTGC 


AGCGGCTATG 


CTTGCAGACC 


2340 


AATTTGCTAA AGAAGTTGAC TTCTTCTCAA 


TTGCTACAAA 


CGACTTGATC 


CAATATACAA 


2400 


TGGCAGCAGA 


CCGTATGAAC 


r.AArAAGTTT 


CATACCTTTA 


CCAACCATAC 


AACCCATCAA 


2460 


TCCTACCCTT 


GATTAACAAT 


U J 1 ^«/W*\* 


CAGCTCACGC 


TGAAGGTAAA 


TGGGCTGGTA 


2520 


TGTGTGGTCA GATGGCTGGT 




^ 1U 1 * V» w/*«— « 


TCTTGTCGGA 


ATGCGCTTGC 


2380 


ATGAGTTCTC 


TATGTCAGCA 


& f AfV'TVST Af* 


TTCGTACACG 


CAGCTTGATG 


AAGAAACTCG 


2640 


ACACAGCTAA GATGGAAGAG 






AGAATGCTCA 


ACAATGGAAG 


2700 


AAGTTCTTGA ACTTCAAAAA 




A 111 I V*A 1 A ** 


ATCGAAAACT 


CCCTGCAACT 


2760 


CAGTTACAGG 


GATi"ri"rnu 


ATAi 1 1 IAAA 


AAUAAl 1 lit 


AACAAAATCT 


TTCTTATAGA 


2820 


AAGTCCAACC 


TTGAAAAAGT 


AuTuu 1^.AV»*% 


ALAAAAAniA 


CTTAAATGGT 


TCATAAAATT 


2880 


CTTGACAAGT 


TCGATATTTA 


ViUAu 1 nAnW 1 


AllAALLAVi 1 


TAAGTAATAG 


AGAGGAGTTT 


2940 


CTGCAATTTA 


GAAATGAATT 




Aln I *\ 


GAAAGAGAGT 


TTCGATGAAA 


3000 


ATTAATAACA 


AATACCTTGT 




111 i 


TTTAACTCTT 


TGTTCTTACG 


3060 


AGTTGGGACT 


GTATCAACCT 


AuAnwuu 1 I #\ 


A(mAAAA1AA 


1 w«« * » • * * 


TATATAGATG 


3120 


GAAAACAAGC 


GACGCAAAAA 






TGAGGTTAGC 


AAGCGTGAAG 


3180 


GAATCAATGC 


TGAGCAAATC 


GTCATCAAGA 


TAACACACCA 


AGGCTATGTC 


ACTTCACATG 


3240 


GCGACCACTA 


TCATTATTAC 


AATGGTAAGG 


TTrPTPATCA 


CGCTATCATC 


AGTGAAGAAT 


3300 


TACTCATCAA 


AGATCCAAAC 


TATAAGCTAA 


AAGATGAGGA 


TATTGTTAAT 


GAGGTCAAGG 


3360 


GTGGATATGT 


TATCAAGGTA 


GATGGAAAAT 


ACTATGTTTA 


CCTTAAGGAT 


CCTGCCCACG 


3420 


CGGATAACCT 


CCGTACAAAA 


GAGGAAATCA 


« Jl__ 1. _ h k ft m )L 

ATCGACAAAA 


ACAACnbUi 1 


ACTCAACATC 


3480 


GTGAAGGTGG 


AACTCCAAGA 


AACGATGGTG 


CTGTTGCCTT 


GGCACGTTCG 


CAAGGACGCT 


3S40 


ATACTACAGA 


TGATGGTTAT 


ATCTTTAATG 


CTTCTGATAT 


CATAGAGGAT 


ACTGGTGATG 


3600 


CTTATATCGT 


TCCTCATGGA 


GATCATTACC 


ATTACATTCC 


TAAGAATGAG 


TTATCAGCTA 


3660 


GCGAGTTGGC 


TGCTGCAGAA 


GCCTTCCTAT 


CTGCTCGAGG 


AAATCTGTCA 


AATTCAAGAA 


3720 


CCTATCGCCG 


ACAAAATAGC 


GATAACACTT 


CAAGAACAAA 


CTGGGTACCT 


TCTGTAAGCA 


3780 


ATCCAGGAAC 


TACAAATACT 


AACACAAGCA 


ACAACAGCAA 


CACTAACAGT 


CAAGCAAGTC 


3840 


AAAGTAATGA 


CATTGATAGT 


CTCTTGAAAC 


AGCTCTACAA 


ACTGCCTTTG 


AGTCAACGAC 


3900 
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ATGTAGAATC 


TGATGGCCTT 


GTCTTTGATC 


CAGCACAAAT 


CACAAGTCGA 


ACAGCTAGAG 


3960 


GTGTTGCAGT 


GCCACACGGA 


GATCATTACC 


ACTTCATCCC 


TTACTCTCAA 


ATGTCTGAAT 


4020 


TGGAAGAACG 


AATCCCTCGT 


ATTATTCCCC 


TTCGTTATCG 


TTCAAACCAT 


TGGGTACCAG 


4080 


ATTCAAGGCC ACAACAACCA ACTCCACAAC 


CGACTCCGGA 


ACCTAGTCCA 


GGCCCGCAAC 


4140 


CTCCACCAAA 


TCTTAAAATA 


GACTCAAATT 


CTTCTTTGGT 


TACTCAGCTG 


GTACGAAAAG 


4200 


TTGGGGAAGG 


ATATGTATTC 


GAAGAAAAGG 


CCATCTCTCG 


TTATGTCTTT GCGAAAGATT 


4260 


TACCATCTGA 


AACTGTTAAA 


AATCTTCAAA 


GCAAGTTATC 


AAAACAAGAG 


AGTGTTTCAC 


4320 


ACACTTTAAC 


TGCTAAAAAA 


GAAAATGTTG 


CTCCTCGTGA 


CCAAGAATTT 


TATGATAAAG 


4380 


CATATAATCT 


GTTAACTCAG 


GCTCATAAAG 


CCTTGTTTGA 


AAATAAGGGT 


CGTAATTCTG 


4440 


ATTTCCAAGC 


CTTAGACAAA 


TTATTAGAAC 


GCTTGAATGA 


TGAATCGACT 


AATAAAGAAA 


4500 


AATTGGTACA TGATTTATTG CCATTCCTAG CACCAATTAC 


CCATCCAGAG 


CGACTTGGCA 


4560 


AACCAAATTC 


TCAAATTGAG 


TATACTGAAG 


ACGAACTTCG 


TATTGCTCAA 


TTAGCTGATA 


4620 


AGTATACAAC 


GTCAGATGGT 


TACATTTTTG 


ATGAACATGA 


TATAATCACT 


GATGAAGGAG 


4680 


ATGCATATGT 


AACGCCT CAT 


ATGGGCCATA 


GTCACTGGAT 


TGGAAAAGAT 


AGCCTTTCTG 


4740 


ATAAGGAAAA 


AGTTGCAGCT 


CAAGCCTATA 


CTAAACAAAA 


AGGTATCCTA 


CCTCCATCTC 


4800 


CAGACGCAGA 


TGTTAAAGCA 


AATCCAACTG 


GAGATAGTGC 


ACCAGCTATT 


TACAATCGTG 


4860 


TGAAAGGGGA 


AAAACGAATT 


CCACTCGTTC 


GACTTCCATA 


TATGGTTGAG 


CATACAGTTG 


4920 


AGGTTAAAAA 


CGGTAATTTG 


ATTATTCCTC 


ATAAGGATCA 


TTACCATAAT 


ATTAAATTTG 


4980 


CTTGGTTTGA 


TGATCACACA 


TACAAAGCTC 


CAAATGGCTA 


TACCTTGGAA GATTTGTTTG 


5040 


CGACGATTAA 


GTACTACGTA 


GAACACCCTG 


ACGAACGTCC 


ACATTCTAAT 


GATGGATGGG 


5100 


GCAATGCCAG 


TGAGCATGTG 


TTAGGCAAGA AAGACCACAG TGAAGATCCA 


AATAAGAACT 


5160 


TCAAAGCGGA 


TGAAGAGCCA 


GTAGAGGAAA CACCTCCTGA 


GCCAGAAGTC 


CCTCAAGTAG 


5220 


AGACTGAAAA 


AGTAGAAGCC 


CAACTCAAAG 


AAGCAGAAGT 


TTTGCTTGCG 


AAAGTAACGG 


S280 


ATTCTAGTCT 


GAAAGCCAAT 


GCAACAGAAA 


CTCTAGCTGG 


TTTACGAAAT 


AATTTGACTC 


5340 


TTCAAATTAT 


GGATAACAAT 


AGTATCATGG 


CAGAAGCAGA 


AAAATTACTT 


GCGTTGTTAA 


5400 


AAGGAAGTAA 


TCCTTCATCT 


GTAAGTAAGG AAAAAATAAA 


CTAATGAAAA 


ATGAAAGTCT 


54 60 


CGATAAAGAG 


GCTTTCATTT 


TTATTATGTA TATATCTAAA 


ATTCTTGACA AGCAATATTA 


5520 


AAAAGACTAA 


ACTATTAACT 


AGTTAATTAA 


CCGGTTTATT 


ACTTTATAGT 


CAATCAAATA 


S580 


TACTTAAGAA 


AAGAGGAAAG 


AATGAAAATT 


AATAAAAAAT 


ATCTAGCAGG 


TTCAGTGGCA 


5640 


GTCCTTGCCC 


TAAGTGTTTG TTCCTATGAA CTTGGTCGTC 


ACCAAGCTCG 


TCAGGTTAAG 


5700 
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AAAGAGTCTA 


ATCGACTTIcC TTATATAGAT GGTGATCAGG 


CTCGTCAAAA 

^ 4> UVJ • \*rwm mm mm m 


CGCAGAAAAC 


5760 


TTGACACCAG 


ATGAAGTCAG 


TAAGAGGGAG 


GGGATCAACG 


r*£°G AACAAAT 


CGTCATCAAG 


5820 


ATTACGGATC 


AACGTTATGT 


GACCTCTCAT 


GGAGACCATT 


ATf ATT ACT A 


TAATGGCAAG 


S680 


GTCCCTTATG 


ATGCCATCAT 


CAGTGAAGAG 


CTCCTCATGA 


A An ATfTfiAA 

AAOA i^aVaOnA 


TTATCAGTW3 


5940 


AAGGATTCAG 


ArATTCTCAA 


TGAAATCAAG 


GGTGGTTATG 




AGATGGAAAA 


6000 


TACTATGTTT 




TGCAGCTCAT 


GCGGATAATA 




An A AH AO ATT 




AAACGTCAGA 


AGfACCAACA 


CAGTCATAAT 


CACGGGGGTG 






til 50 


GTTGCAGCCA 




ACGCTATACA 


ACGGATGATG 


GTTATATCTT 




6180 


GATATCATTG 




TGATCCTTAT 


ATCGTTCCTC 


ft f*ff*r*f* ft f*f* ft 
ACGGCCAVt-A 






ATTCCTAAGA 




AGCTAGC GAG 


TTAGCTGCTG 


CAGAAOCCTA 




DJUU 


AACCAGGGAT 


CTCGTCCTTC 


1 1 l_nA\J 1 1 V> L 


AGTTATAATC 


CAAATCCAGC 




D J OU 


TTGTCAGAGA 






CC AACTTATC 


ATCAAAATCA 


AGGGGAAAAC 


D42U 


ATTTCAAGCC 


tttTACGTGA 


ATTCTATGCT 


AAACCCTTAT 


CAGAACGCCA 


TGTGGAATCT 


04 Oil 


GATGGCCTTA 


1 L I 1 LonUUV. 


AGCGC AAATC 


ACAACTCGAA 


CCGCCAGAGG 


lb I AOL 1 vj 1 v. 


O jIU 


CCTCATGGTA 




WA 1 i A V» V» *»» * 


TATGAACAAA 


TGTCTGAATT 


GGAAAAACGA 


BQUU 


ATTGCTCGTA 




TPGTTATCGT 


TCAAACCATT 


GGGTACCAGA 


TTvAAOACCA 


cccn 

ODOU 


GAACAACCAA 




GACTCCGCAA 


CCTACTCCAA 


GTCCGCAACW 




6720 


CCTCAACCAG 




TCCAATTGAT 


GAGAAATTGG 


TCAAAGAAGC 


TO l is. uA/wt 


6780 


GTAGGCGATG 


GTTATGTCTT 


TGAGGAGAAT 


GGAGTTTCTC 


CTTATATCCv. 




6840 


CTTTCACCAG 


AAACAGCAGC 


AGGCATTGAT 


ACCAAACTGG 


r*r*b. ft m** Ann A 


A Ai^^n^^^P A^Pl^^n 


6900 


CATAAGCTAG 


GAGCTAAGAA 


AACTGACCTC 


CCATCTAGTG 




TT A PAATAAC 


6960 


GCTTATGACT 


TACTAGCAAG 


AATTCACCAA 


GATTTACTTG 


ATAATAAAGG 


TCGACAAGTT 


7020 


GATTTTGAGG 


CTTTGGATAA 


CCTGTTGGAA 


CGACTCAAGG 


ATGTCyCAAG 


TGATAAAGTC 


7080 


AACTTAGTGG 


ATGATATTCT 


TGCCTTCTTA 


GCTCCCATTC 


GTCATCCAGA 


ACGTTTAGGA 


7140 


AAACCAAATG 


CGCAAATTAC 


CTACACTCAT 


GATGAGATTC 


AAGTAGCCAA 


GTTGGCAGGC 


7200 


AAGTACACAA 


CAGAAGACGG 


TT AT AT CTTT 


GATCCTCGTG 


ATATAACCAG 


TGATGAGGGG 


7260 


GATGCCTATG 


TAACTCCACA 


TATGACCCAT 


AGCCACTGGA 


TTAAAAAAGA 


TACTTTGTCT 


7320 


GAAGCTGAGA 


GAGCGGCAGC 


CCACCCTTAT 


GCTAAAGAGA 


AAGGTTTGAC 


CCCTCCTTCG 


7380 


ACAGACCATC 


AGGATTCAGG 


AAATACTGAG 


GCAAAAGGAG 


CAGAAGCTAT 


CTACAACCGC 


7440 
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GTCAAAGCAC CTAAGAAGGT GCCACTTGAT CCTATGCCTT ACAATCTTCA ATATACTCTA 7500 

CAACTCAAAA ACGCTAGTTT AATCATACCT CATTATGACC ATTACCATAA CATCAAATTT 7S60 

CAGTGCTTTG ACCAACCCCT TTATCAGGCA CCTAAGCCGT ATACTCTTGA GCATCTTTTG 7620 

CCGACTGTCA AGTACTATCT CCAACATCCA AACGAACGTC CGCATTCAGA TAATGCTTTT 7680 

GGTAACGCTA GCGACCATGT TCGTAAAAAT AAGGTAGACC AACACAGTAA ACCTGATCAA 7740 

GATAAGGAAC ATGATGAAGT AAGTGAGCCA ACTCACCCTG AATCTCATGA AAAAGAGAAT 7800 

CACGCTGGTT TAAATCCTTC AGCAGATAAT CTTTATAAAC CAAGCACTGA TACGGAAGAG 7860 

ACAGAGGAAG AAGCTGAAGA TACCACAGAT GAGGCTGAAA TTCCTCAAGT AGAGAATTCT 7920 

GTTATTAACG CTAAGATAGC AGATGCGGAG GCCTTGCTAG AAAAAGTAAC AGATCCTAGT 7980 

ATTAGACAAA ATGCTATGGA GACATTGACT GGTCTAAAAA GTAGTCTTCT TCTCGGAACG 8040 

AAAGATAATA ACACTATTTC AGCAGAAGTA GATAGTCTCT TGGCTTTGTT AAAAGAAAGT 8100 

CAACCGGCTC CTATACACTA CTAAAATGAA TGGAGCATAT TTTATGGAGA AGTAACCTTT 8160 

CCTGTTACTT CTCTTTTTTA GAAAAACGTA ACAGA 8195 
(2) INFORMATION FOR SBQ ID NO: 95: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND E ONES S : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



TTTACTAAAA 


GGAAAAAAGA 


ACTGATTTCT 


CAGTCCTTCA 


TTAATCTTAT 


TCCACACTAA 


60 


ATAGGTATGG 


GTAAACAGGT 


TCTTGACCTT 


GGTGAATCTC 


GACTTCAACG 


TCTTCGAATT 


120 


CTTCTACGAT 


TTCTTGAGCG 


ATTTCATTGG 


CAAGTTCTTC 


GCTTCCGTCT 


TCACCTACAT 


180 


AGAACGTTAC 


GATTTCACTG 


TCTTCATCCA 


ACATATGTTT 


CAACCTTTCA 


GTCAATG1 IT 


240 


GGTGCATATC 


AGGGTTTGAC 


ACAAGAATTT 


TTCCATCCAC 


CATACCTAAA 


TTATCGTTTT 


300 


CATGGATTTC 


TAAGCCATCG 


ATCGTTGTAT 


CACGCACGGC 


TGTTCTGACG 


CTTCCGCTAA 


360 


CGACATCGCT 


AAGAGCAGCT 


GTCATACGCT 




TTCAATGGAC 


TTGCTTGGAT 


420 


CAAAGGCAAG 


AAGACTTGTC 


ATACCTTGAG 


GAAGAGTGCG 


AGCCTCTACC 


ACTACCGCTG 


480 


GTTGCTCCAA 


AACTTCTGCC 


CCAGATTGAG 


CTGCCATGAA 


GATGTTCTTG 


TTGTTTGGCA 


540 


AGAAGATGAT 


GTTACGGGCA 


TTAACCTGTT 


CAACAGCCTT 


GATAAAGTCT 


TCTGTTCAAG 


600 


GGTTCATGGT 


TTGACCGCCT 


TCGATAACAT 


AATCCACGCC 


TTGAGAACAG 


AAGATATCTG 


660 
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CTAGACCTTT 


ACCAGCCACC 


ACAGCAATCA 


AAGCATACTC 


TTTTTCTTCA 


GCCGACTTGA 


720 




AGCTTCTTTC 


TCAACCTGTG 




GTTACGCATA 


TTGTCAACTT 


780 


TAACTTCACT 


cttcctgttg 


TTACCTTGAC 


CAAGCTACCA 


TATTTGAGAC 


CTTCTTGCAT 


AACAAGTCCT 


GGATCTTCTG 


840 


TATGAACATC 


GACTTTGACA 


ATTTCATCAT 


CGTTAACAAC 


AAGGAGAGAA 


TCTCCAAGCT 


900 


CATCCAAGTA 


GTTACGGAAT 


TCATCGTAGT 


CAAAATCTTT 


AGCATAGCTT 


GGACCTTGCT 


960 


TAAGAGCTAC 


CATGATTTCA 


GTACACTAAC 


CAAACGTGAT 


GTCCTCAGTC 


CCTACGTGAC 


1020 


CAGCTACAGA 


CTTATGATGC 


TCTACATTGA 


TCATCTCACT 


CATGTTGGCA 


GGAGTCGCTA 


1080 


CAAACTCCTC 


AGATGCAATA 


TATTCGCCAG 


TAAGGGCTGA 


AAGGAAACCT 


TCGTAGATGA 


1140 


AGACCAATCC 


TTCACCACCT 


GAGTCCACAA 


CGCCAACTTC 


TTTCAATACT 


GGAAGCATGT 


1200 


CTGGTGTTTT 


AGCTAGAGCT 


GTTTTAGCAC 


CTTCCAAGGC 


TGCGCGCATG 


ACTTCAACAG 


1260 


CGTCATCTGT 


TTCCTCAGCT 


TTTTTCTTAG 


CACCGATAGC 


AGCTCCACGA 


CAAACTCTTA 


1320 


AAATCGTTCC 


TTCAACAGCT 


TTCATCACTG 
CCTGTTAACT 


CCTTATAGGC 


AACTTCCACA 
CTTGATAGCT 


CCTGATTGGA 
TGCCAAAATC 


1380 
1440 


AGGCCAGAGC 


CAACTCTTGA 


CGTCTTTATC 




CACGGAAAAG 


CTGAGACGTA 


ATCACTCCTG 


AGTTCCCACG 


CGCACCCATC 


AAAAGCCCTT 


1500 


TGGCAAGAAT 


GCTCGCTACT 


TCTCCAACTG 


TAGAAGCTGG 


CTT GTCTGC A 


ACTTCTTTAG 


1560 


CACCATTTTC 


AATGGTCATT 


CCCATATTTG 


TCCCACTATC 


TCCATCTGGA 


ACTGGAAAGA 


1620 


CGTTTAATGA 


ATTCACATAT 


TCAGCTTGCT 


TATTCAAGCG 


AGTTGATGCA 


GCCTGCACCA 


1680 


TTTCTTGAAA 


TAAGCTAGTA 


GTAATTTTTG 


ACACGGTTAT 


TCTCCTACAA 


CTTTGATATT 


1740 


TTGAATGTAG 


ACATTTACAG 


TCTGAGCAGT 


AATTCCAAGC 


TGGTTTTCCA 


AGCTAAAGGC 


1800 


AACACGCTCT 


TGAATGTTTT 


TTGACACTTC 


ACTAATCTTT 


GTTCCGTAGC 


TTAACACGGT 


1860 


ATATACATCA 


ACTGCAATAC 


TGCCATCTTC 


GGCTGCCTTT 


ACGACGACAC 


CTTTAGAATA 


1920 


ATTTTCCTTA 


CCTACCAGGG 


CTTGGAAATT 


ATCTTTGAGG 


GCATTTTTAC 


TACCCATACC 


1980 


GACCACACCA 


GAAATCTCAG 


TTGC 








2004 


12) INFORMATION FOR SEQ ID NO: 96: 









U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11915' base * pairs 
<B) TYPE : nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



owcrwin. -wr» 001001149 i «. 
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CCGGGTTGGG CTCTTCCCCC ATTAAAGCCG CACCACAGCT CGCTTCAGAA CCTCGTGAGA 60 

CAGTTCGGTC CCTATCCCTC GCGCGCCTAG GAAATTTGAG AGGATCTGCT CCTAGTACGA 120 

GAGGACCAGA GTGGACTTAC CCCTCCTCTA CCAGTTGTCT TCCCAAAGGC ATCGCTGGGT 180 

AGCTATGTAG GGAAGGGATA AACGCTGAAA GCATCTAAGT CTCAAACCCA CCTCAAGATG 240 

AGATTTCCCA TGATTATATA TCAGTAAGAG CCCTGAGAGA TGATCAGGTA GATAGGTTAG 300 

AACTGGAAGT GTGGCGACAC ATCTAGCGGA CTAATACTAA TAGCTCGAGG ACTTATCCAA 360 

AGTAACTGAG AATATGAAAG CGAACGGTTT TCTTAAATTC AATAGATATT CAATTTTGAG 420 

TAGGTATTAC TCAGAGTTAA GTGACGATAG CCTAGCAGAT ACACCTGTAC CCATGCCGAA 480 

CACAGAAGTT AAGCCCTAGA ACGCCGGAAC TAGTTGGGGG TTCCCCCCTG TGAGATAGGG 540 

AACTCGCTTA GCTCTAGGGA GTTTAGCTCA GCTGGGACAG CATCTGCCTT ACAAGCAGAG 600 

GGTCAGCGGT TCGATCCCGT TAACTCCCAT TTTACCGGGT GTACTTTACT GGTAAAACTA 660 

CAGCCTTCCA AGCTGTTCTC GCGAGTTCGA TTCTCCTCAC CCGCTTTGAA CTTTGTTCTT 720 

TGTACCAACT TTTTGACTTG GGCGCGTAGC TCAGGTGCTT AGAGCCCACG CCTGATAAGC 780 

GTGAGGTCGG TGCTTCCAGT CCACTCGTGC CCATACTGTT TAGTCCATTA CTAGGGGATT 840 

GGAATATTAT CTGTTCACTA AGAGGACACG GCCTTGTTCC CGTATAAACT ATTTTGGAGG 900 

ATTACCCAAG TCCGGCTGAA GGGAACGGTC TTGAAAACCG TCAGGCGTGT AAAAGCGTGC 960 

CTGGGTTCGA ATCCCACATC CTCCTTTTAT ATTAACGCGG GATGGAGCAG CTCGGTAGCT 1020 

CCTCGGGCTC ATAACCCGAA GGTCGTAGGT TCAAATCCTG CTCCCGCAAT AAGGCTCGCT 1080 

AGCTCAGTTG GTAGAGCAAT GCATTGAAGC TCCATGTGTC GGCGGTTCGA TTCCCTCTCC 1140 

CGCCATTTAT ATATTTTGGA AGGGTAGCGA AGAGGCTAAA CGCGGCGGAC TGTAAATCCG 1200 

CTCCTTCGGG TTCGGGGGTT CGAATCCCTC CCCTTCCATT TTACCGCCAT AGTTTAAAGG 1260 

TAGAACTAAG CTCTCCAAAA CCTTCAGTGT GGGTTCAATP CCTACTGCCC GTCTTAATAG 1320 

AATTATGGCG GGTGTCGTGA AGTGGTTAAC ACACCAGATT GTGGCTCTGG CATGCGTGGG 1380 

TTCGATCCCC ATCACTCGCC TATTTTATAT TGGGGTATAG CCAACCGGTA AGCCAAGGGA 1440 

CTTTGACTCC CTCATGCGTT GGTTCGAATC CAGCTACCCC AGTTACTATT TGCCGGCGTG 1500 

CCGGAATTGG CACACGCCCT GGACTCAAAA TCCAGTCTCC GCAAGGACGT GCCGCTTCGA 1560 

CCCCGGCCGC CCCTATAGTA TAGTGTTAGG AAC G TT G TTA TTCTTCGTTC CTTTTTTATA 1620 

TTATTTTTGG TATAATTATA GTTATTCAAA TTTTATTTAG ATTAAGAAAG TGTAGGGGAG 1680 

TATCTCTTCT TCTATCGATT TATTAAAACA TCGGTATTTG AAAAATATTA AAGAAAATCC 1740 

TGAATTGTTT GTCGGAATTG AGTTGGAGTA TCCTGTTGCA AGTTTAGAAG GGGATGCTAC 1800 
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AGATGTTGAA 


GTTATGAAGG 


m MAfn ft hmmi^ ft 

ATCTATTTCA 


TTATTTAGTT 


TCTACTTTGG 


ATCTCACCGT 


1 B£A 
iOOU 


AGCAAAGGTA 


GATGATTTTG 


GCAATCTGAT 


CCAGTTAGTA 


GATCCGATAA 


GTCAGGATGC 


1920 


TATTTTATTT 


GAAGTTTCCT 


ATACAACGAT 


TGAGTTTGCA 


TTTCGTAAGG 


ft ft m ftM* 

CTGAAACGAT 


1980 


TCAAG AGGTC 


GAAAATCGTT 


TCAATAATTA 


TATGAATGTA 


ATTCAGAGAA 


AGTTAGCTGA 


2040 


ATCAAATCAT 


GCTATTGTTG 


GCTGTGGTAT 


CCATCCCAAC 


TGGGATAAAA 


mjvw*fe^*m ft«M*h#% 

ATGAGAATTG 


2100 


TCCAGTGGCT 


TATCCACGCT 


ATCAGATGTT 


GATGGATTAT 


TTGAATTTCA 


GTAGAAATAT 


2160 


TATTAAATCA 


GATTTACATC 


ATTTCCCTGA 


ATATCCTACT 


TTTATCTGTG 


GGAGCCAGGT 


2220 


TCAGCTGGAT 


ATTTCAAAAA 


CCAACTACTT 


ACGGGTGATT 


AATGCTTTTA 


CTCAAATTGA 


2280 


ACCGGCTAAG 


GCTTATTT AT 


TTGCAAACTC 


TGAATTTTCG 


GGTGCGGATT 


GGGATACGAA 


2340 


AATTTCAAGC 


GATATTTTCT 


GGGAAGAATC 


TATGCATCGT 


ATCTATCCAG 


AGAATGTTGG 


2400 


GGTCAATGCT 


AGACTCCTTA 


ATGATGAAAC 


TGATTTriTT 


GACTATCTAA 


ATCATTCTGC 


2460 


GATTTTTACT 


GCGGAACGTG 


ATGGGCAGAC 


CTATTATTTT 


TATCCTATTC 


AGGCTGGGGA 


2520 


CTATTTGGCT 


ACGTCCGAAA 


TCCAAGCATT 


TGCTCTGAAT 


GGGGATGAGG 


TTATTATTTA 


2S80 


CCCCCAAGAG 


AAGGATTTTG 


AAACTCATCG 


TAGTTACCAG 


TACCAAGATT 


TAACCACTCC 


2640 


ACGAACAGTT 


GAGTTTCGTA 


GTGTGTGTAC 


ACAGCCACTT 


GATAGGACTT 


TTGCTTCTGC 


2700 


AGCTTTTCAC 


TTGGGATTAT 


TGCTTAATTT 


AGACAAGTTA 


GAAGCTTACT 


TAGAAACAGC 


2760 


ACCTTTCTTT 


AAAGTATTTG 


GTTATGATTA 


CAAGTCTTTA 


AGGAGACAAT 


TTTCTAAGAA 


2820 


AAATCTTACA 


GATGAGGAAG 


AAACTACGAT 


TATTGAATTT 


TCCAAAGACT 


TACTCCTACT 


2880 


AGCTGAGGAG 


GGACTAGTGG 


TGAGAAATAA 


GGAAGAAATG 


ACCTATTTAC 


AGCCTTTGAG 


2940 


AGAAGAATTG 


AGCCTATAAT 


TTCTCTTATA 


AAGGGAGAAT 


TiTCTGAAAA 


ft ft ft ft Mt 

ATCATGATAT 


3000 


AATGGACGAG 


ACTATAGATA 


AAGGATAGAG 


AGTAATGACA 


TTACTTTATC 


ft ft OT^^ ft ft ^+0+^+++ 

AATCAACGCG 


3060 


TGATGCCAAC 


AATACAGTAA 


CTGCCAGCCA 


AGCAATTTTG 


CAAGCTTTGG 


CGACGGACGG 


3120 




ACACCGGATA 


CTTATCCAAA 


CGTAGATTTG 


AACTTTGACA 


AATTGAAAGA 


3180 


TCCTTCTTAC 


CAGGAACTTG 


CTAAGCTACT 


TTTGTCAGCA 


TTTTTAGATG 


ACTTTACAGT 


3240 


TGAGGAGTTG G ACT ACTGTA 


TCAACAATGC 


CTACGATAGC 


AAATTTGATA 


CTCCACCTAT 


3300 


TGCACCATTA 


GTGAAATTAG 


ATGGGCAATA 


CAATTTGGAA 


CTTTTCCATG 


GTTCAACGAT 


3360 


TGCCTTTAAG 


GATATGCCCT 


TGTCTATTTT 


GCCATACTTT 


ATGACGACTG 


CTCCTAAGAA 


3420 


ACATGGTTTG 


GAGAACAAGA 


TTGTTATCTT 


GACAGCGACA 


TCTGGTGACA 


CGGGGAAAGC 


3480 


TGCTATGGCG 


GGGTTTGCGA 


ATGTGCCTCG 


TACTGAGATT 


ATCGTCTTTT 


ATCCAAAGGA 


3540 
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TGGTGTCAGC 


AAGATTCAAG 


ACTTACAAAT 


GACCACTCAG 


ACTGGCGACA 


ATACTCATGT 


3600 


TATTGCTATT 


GATGGTAACT 


TTGACCATGC 


GCAAACAAAT 


GTGAAGCACA 


TGTTTAACGA 


3660 


CGTGGCTCTT 


CGTGAAAAAT 


TGACTACCAA 


CAACTTGCAA 


TTTTCATCAG 


CTAACTCTAT 


3720 


GAACATTGCT 


CGTCTGGTGC 


CACAAATTGT 


TTATTATGTT 


TATGCTTACG 


CTCAATTGGT 


3780 


TAAGACTGGT 


GAAATTGTAG 


CTGGTGAAAA 


GGTTAACTTC 


ACAGTACCAA 


CAGGAAACTT 


3840 


TCCAAATATC 


TTGGCTGCCT 


TTTATGCCAA 


ACAAATTGGT 


TTGCCACTTG 


CTAAATTAAT 


3900 


CTGTGCTTCA 


AATGACAACA 


ATGTTTTGAC 


AGACTTCTTT 


AAAACACGTG 


TCTATGACAA 


3960 


AAAACGTGAG 


TTTAAGGTAA 


CAACCAGCCC 


ATCTATGGAT 


ATCTTGCTAT 


CTTCAAACTT 


4020 


GGAGCGCTTG 


ATTTTCCATC 


TTTTGGGAAA 


TAATGCTGAA 


AAGACAACTG 


AACTTATGAA 


4080 


TGCCTTGAAC 


ACGCAAGGAC 


AATATAAGTT 


CACAGACTTT 


GATGCAGAGA 


TTTTGGACCT 


4140 


CTTTGCACCT 


GAATATGCGA 


CTGAGGAAGA 


AACGGCAGCA 


GAGATCAAGC 


GTCTTTGTGA 


4200 


GTTAGATTCT 


TATATCGAGG 


ACCCTCATAC 


AGCTGTTGCT 


TCACCAGTTT 


ATAAAAAATA 


4260 


CCAATCGGCC 


ACTGGAGATG 


TAACTAAGAC 


AGTGATTGCT 


TCAACAGCTA 


GTCCATACAA 


4320 


GTTCCCAGTA 


GTTGCAGTAG 


AAGCTGTAAC 


TGGAAAAGCA 


GGTTTAACAG 


ACTTTGAAGC 


4380 


CTTGGCTCAA 


TTACATGAAA 


TCTCACGCGT 


TGCAGTGCCA 


CCAGCACTTG 


ATGGGCTTGA 


4440 


AATAGCTCCA 


ATTCCTCACA 


AGACAACAGT 


GGCAGCTGCT 


GACATGCAAG 


CAGCGCTTGA 


4500 


GGCTTATTTA 


GGACTTTAAG 


ACAGAGGGAG 


CAAACTCGGT 


TGGGAAACCA 


ACTGAGTTTC 


4560 


TTTTCATCAG 


GAGGAGAGAT 


TGTTTAAGAA 


AAATAAAGAC 


ATTCTTAATA 


TTGCATTGCC 


4620 


AGCTATGGGT 


GAAAACTTTT 


TGCAGATGCT 


AATGGGAATG 


GTGGACAGTT 


ATTTGGTTGC 


4680 


TCATTTAGGA 


TTGATAGCTA 


TTTCAGCGGT 


TTCAGTAGCT 


GGTAATATTA 


TCACCATTTA 


4740 


TCAGGCGATT 


TTCATCCCTC 


TGGGACCTGC 


TATTTCCAGT 


GTTATTTCAA 


AAAGCATAGG 


4800 


GCAGAAAGAC 


CAGTCGAAGT 


TGCCCTATCA 


TGTGACTGAG 


GCCTTGAAGA 


TTACCTTACT 


4860 


ATTAAGTTTC 


CTTTTAGGAT 


TTTTGTCCAT 


CTTCGCTCGG 


AAAGAGATGA 


TAGGACTT**T 


4920 


GGGGACGGAG 


AGGGATGTAG 


CTGAGAGTGG 


TGCACTGTAT 


CTATCTTTCG 


TAGGCGGATC 


4980 


GATTGTTCTC 


TTAGGTTTAA 


TGACTAGTCT 


AGGAGCCTTG 


ATTCGTCCAA 


CGCATAATCC 


S040 


ACGTCTCCCT 






CAATGCCTTG 


/ATATTCTTT 


TTTCAAGTCT 


5100 


AGCTATTTTT 


GTTCTGGATA 


TGGGG AT AG C 


TGGTGTTGCT 


TGGGGGACAA 


TTGTGTCTCG 


5160 


TTTGGTTGCT 


CTTGTGATTT 


TCTGGTCACA 


ATTAAAACTG 


CCTTATCGGA 


AGCCAACTTT 


5220 


TGGTTTAGAT 


AAGGAACTGT 


TGACCTTGGC 


TTTACCAGCA 


GCTGGAGAGC 


GACTTATGAT 


5280 


GAGGGCTGGA 


GATGTAGTGA 


TCATTGCCTT 


GGTCCTTTCT 


TTTGGGACGG 


AGGCAGTTGC 


5340 
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TGGCAATGCA 


ATCGGAGAAG 


TCTTGACCCA 


GTTTAACTAT 


ATGCCTGCCT 


TTGGCGTCGC 




TACGGCAACG 


CTCATGCTGT 


TGGCCCGAGC 


AGTTGGAGAG 


GATGATTGGA 
ATGTTGCCCC 


TGTCCTTTAG 


3320 


TAGTTTGAGT 


AAACAAACCT 


TTTGGCTTTC 


TCTGTTCCTC 


TATATATGTC 


TTGGGTCTAC 


CATTAACTCA 


TCTCTATACG 


ACTGATTCTC 


TAGCGGTGGA 


c e o A 
3380 


GGCTAGTCTT 


CTAGTGACAC 


TGTTTTCACT 


ACTTGGGACC 


CCTATGACGA 






CATCTATACG 


GCAGTCTGGC 


AGGGATTAGG 


AAATGCACGC 


CTCCCTTTTT 


ATGCGACAAG 


5700 


TATAGGAATG 


TGGTGTATCC GCATTGGGAC AGGATATCTG 


ATGGGGATTG 


TGCTTGGTTG 


5760 


GGGCTTGCCT 


GGTATTTGGG 


CAGGGTCTCT 


CTTCGATAAT 


GGTTTTCGCT 


GGTTATTTCT 


5820 


ACGCTATCGT 


TACCAGCGCT 


ATATGAGCTT 


GAAAGGATAG 


CAAATGCAAA 


AAACAGCTTT 


5880 


TATTTGGGAT 


TTAGACGGGA 


CTTTATTGGA 


CTCTTACGAA 


GCGATTTTAT 


CAGGGATTGA 


5940 


GGAGACTTTT 


GCTCAGTTTT 


CTATTCCTTA 


TGATAAGGAG 


AAGGTGAGAG 


AGTTTATCTT 


6000 


CAAGTATTCC 


GTGCAAGATT 


TGCTTGTGCG 


GGTGGCAGAA 


GATAGAAATC 


TGGATGTTGA 


6060 


CGTGCTAAAT 


CAGGTCCCTG 


CCCAGACTCT 


GGCTGAGAAG 


AATGCTCAGG 


TAGTTTTGAT 


6120 


GCCAGGTGCG 


CGTGAGGTGC 


TAGCTT GGGC 


m & ***** Wk ft ft+^* ft 

AGACGAATCA 


GGAATTCAGC 


ACTTTATATA 


6180 


TACTCATAAG 


GGGAACAACG 


CTTTTACCAT 


TCTCAAGGAC 


TTGGGGGTGG 


AATCCTATTT 


6240 


TACAGAGATT 


TTAACCAGTC 


AGAGTGGCTT 


TGTGCGGAAG 


CCAAGTCCAG 


AAGCGGCTAC 


6300 


CTATCTCCTA 


GATAAGTATC 


AGTTGAATTC 


TCATAATACT 


TATTATATAG 


GGGATCGGAC 


6360 


TCTGGATGTG 


GAATTTGCCC 


AGAATACTGG 


GATTCAAAGC 


ATCAACTTTT 


TAGAGTCTAC 


6420 


TTATGAAGGG 


AATCACAGGA 


TTCAAGCGTT 


AGCAGATATT 


TCCCGTATTT 


TTGAGACTAA 


6480 


GTGATAAAAA 


GATTGTCTCA 


GTTTTGTGAC 


AGACACCTAA 


CAAACTATTT 


CAAGTAACCT 


6540 


AGTTTGTTAC 


AAGGAATAGA 


CAGTTCTGTT 


AAATAGGCCC 


GAGAGGGCTT 


I IUTACA 


££Aft 

OOUU 


TTTTTTGTGT 


TATGATAGAC 


AGGTACTCAT 


TTGAAACGAA 


TTTGAAAGAA 


TGAAGAAAAG 


6660 


AATGTTATTA 


GCGTCAACAG 


TAGCCTTGTC 


ATTTGCCCCA 


GTATTGGCAA 


CTCAAGCAGA 


6720 


AGAAGTTCTT 


TGGACTGCAC 


GTAGTGTTGA 


GCAAATCCAA 


AACGATTTGA 


CTAAAACGGA 


6780 


CAACAAAACA 
GGGTGTAGAT 


AGTTATACCG 
GTCACAGTGC 


TACAGTATGG 
TTGCGAATCT 


TGATACTTTG 
GAACAAAATC 


AGCACCATTG 
ACTAATATGG 


CAGAAGCCTT 
ACTTGATTTT 


6840 
6900 


CCCAGAAACT 


GTTTTGACAA 


CGACTGTCAA 


TGAAGCAGAA 


GAAGTAACAG 


AAGTTGAAAT 


6960 


CCAAACACCT 


CAACCAGACT 


CTAGTGAAGA 


AGTGACAACT 


GCGACAGCAC 


ATTTGACCAC 


7020 


TAATCAAGTG 


ACCCTTGATG 


ATCAAACTGT 


TCAGGTTGCA 


GACCTTTCTC 


AACCAATTGC 


7080 
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AGAAGTTACA 


AAGACAGTGA 


TTGCTTCTGA 


AGAACTGGCA 


CCATCTACGG 


GCACTTCTGT 


7140 


CCCAGAGGAG 


CAAACGACCG 


AAACAACTCC 


CCCAGTTGAA 


GAAGCAACTC 


CTCAGGAAAC 


7200 


GACTCCACCT 


GAGAAGCAGG 


AAACACAAGC 


AAGCCCTCAA 


GCTGCATCAG 


CAGTGGAAGT 


7260 


AACTACAACA 


AGTTCAGAAG 


CAAAAGAAGT 


AGCATCATCA 


AATGGAGCTA 


CAGCACCACT 


7320 


TTCTACTTAT 


CAACCAGAAG 


AGACGAAAAT 


AATTTCAACA 


ACTTACGAGG 


CTCCAGCTGC 


7380 


GCCCCATTAT 


GCTGGACTTC 


CAGTAGCAAA 


ATCTGAAAAT 


CCAGGTCTTC 


AACCACAAAC 


7440 


AGCTGCCTTT 


AAAGAAGAAA 


TTGCTAACTT 


CTTTGGCATT 


ACATCCTTTA 


CTGGTTATCC 


7500 


TCCAGGACAC 


AGTGGAGATC 


ACGGAAAAGG 


TTTGGCTATC 


GACTTTATGG 


TACCAGAACG 


7560 


TTCAGAATTA 


GGGGATAAGA 


TTGCGGAATA 


TGCTATTCAA 


AATATGGCCA 


GCCCTGCCAT 


7620 


TAGTTACATC 


ATCTGGAAAC 


AACGTTTCTA 


TGCTCCATTC 


GATAGCAAAT 


ATGGGCCAGC 


7680 


TAACACTTGG 


AACCCAATGC 


CAGACCGTGG 


TAGTGTGACA 


GAAAATCACT 


ATGATCACGT 


7740 


TCACGTTTCA 


ATGAATGGAT 


AAACCCGACT 


TGATAACATC 


ATTTTGACGA 


ATGAGATCPA 


7800 


GCTTTCGTCA 


TGGAAAGCGA 


TTCTCGTTCG 


TTTTTTCTTT 


GTCATACTCT 


TCGAAAATCT 


7660 


CTTCAAACCA 


CGTCAGTTTT 


ATCTGAAACT 


TCAAACCTGT 


GCTTTGAGCA 


ACCTGCGACT 


7920 


AGCTTCCTAG 


TTlttCTTTTT 


GATTTTCATT 


GAGTATCAAT 


TTGAATGGAA 


AATGGAAAGT 


7980 


TATCATCTTG 


TAATGACTTA 


AGCAACATTC 


TTGCAATCTA 


TTTTACTTTA 


TATCACAATT 


8040 


AATTACTCAA 


ATATTGATAA 


ATCAATAAAA 


AGACAGGGGA 


AGAAATCCTA 


GAGATTCAAG 


8100 


ATTTACTGTA 


TCAACTCCGC 


TTGTCTGAGC 


AAGCGAGTAC 




GAAAAAAGCC 


8160 


TTGGGATTAG 


TTTGACACGG 


TATCAGATTT 


TACTGTTTTT 


GCTGGAGCAT 


TCTCCTTCTA 


8220 


ACCAAATGCC 


GGTTCAGGAG 


CGTTTGAAAA 


TTCATCAGGC 


TCCTTTGACA 


CGGCATTTCA 


8280 


AAATTTTGGA 


AACGGAAGGT 


TTGGTGGAGC 


GTCATCGTAA 


TCCTGAAAAT 


CAGCGGGAAG 


8340 


TGTTCGTAGA 


GGCTCCGAAG 


TATCCCAAGC 


AGCAGTTAGT 


GGTGAATCCC 


CCTCTGCAAC 


8400 


ATATCACCCT 


TAAGGAAGAG 


ATAGAAAGTA 


TCTTAACAGA 


GTTTGAGAGA 


ACAGAACTCA 


8460 


GCCGTTTATT 


AAATAAATTG 


GTTTTGGGTA 


TTGAAAATAT 


AGAAATTTAA 


GCACAAATAG 


8520 


ATGTCAATTA 


TTTTAACAAC 


CATCGTTGCT 


TTGGAGCATT 


TTTACATTTT 


TTATTTGGAA 


8S80 


AGTATTGCCA 


CGCAATCAGA 


TGCGACTAGT 


CGTCTATTTA 


ATATGGAAAA 


GGAAGAATTC 


3540 


GCTCATCCGT 


CAGTAAGTTC 


ATTGTTCAAA 


AATCAAGGAA 


TTTATAAGGC 


TCTGCTAGGA 


8700 


GTCTTTCTCT 


TGTATGTCAT 


TTATTTCTCA 


CAGAATTTAG 


AAATTGTGAC 


TATTTTTGTC 


8760 


TTATTTGTCA 


TTGGTGCTGC 


GACTTACGGC 


TCTTTAACAG 


CGCATAAAAA 


AATTATTTTG 


8820 


AAACAAGGTG 


GATCAGCTAT 


TTTCGCCTTG 


ATTAGTATTT 


TACTCTTTAA 


ATACACTTGA 


8880 
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ACGTCGATTC 


TAATCTCGCT 


AATCCTTTTT 


AATCCAGAAT 


AAGGGAAATA 


TGTTATACTT 


8940 


GTTTTTAACA AAAAAGTCTC 


ATTGAATTGG 


TTTTGAGGAG 


TTAGAAATGA 


AAGTATTACT 


9000 


GACAGGTTTT 




GAGGGGAAAA GGGCAATCCA GCTTTGGAGG 


CCATTAAAGG 


9060 


TTTACCAGCT 






CCGTTCGCTA 


GAGGTGCCGA 


CACTTTTTCA 


9120 


CAAATCTGCT 




K &f"±A&f?Af?A1* 


GAATCGTTAT 


CAACCTGACT 


TTGTCCTTTG 


9180 


TATTGGCCAA 


CCTGGTwAA 


GAACTAGTTT 


GACACCTGAA 


CGAGTGACCA 


TTAATCAAGA 


9240 


CGATCCATGC 


ATTTCTGATA 






GACCGTCCCA 


TTCGCCCAGA 


9300 


TGGTCCTTCG 


GCCTACTTTA 


GTAGTTTGtt 


C&TTAAAGCG 


ATGGTTCAAG 


CTATAAAAAA 


9360 


AGAGGGCTTA 


CCGGCCTCTG 


I I iCCAATAC 




* A A \* • W • 


GCCATTTGAT 


9420 


GTATCAGGCT 


CTCTATTTGG 


TAGAAAAGAA 




CTTAAGGCAG 


G'l"ITTATGCA 


9480 


TATTCCTTAT 


ATGATGGAAC 


AGGTGGTGAA 






TGAGTTTAGT 


9540 


GGATATTCGG 


CGAGGGATAG 


AAGCAGCAAT 




ATAfiAACATG 

•T% ft mL3^+^%%m^% ft X* 


CACATCAGGA 


9600 


ACTCAAGTTG 


GTAGGCGGAG 


AAACTCAl iG 


ATAnAAAAAA 


GCTTCAGGGG 


AAAJVACCTTC 


9660 


AAGCTTTTGG 


ACGTTTTCGG 


GCCAATACTG 




f" AT AATTTTA 


GTGCATTGGA 


9720 


TATAAGCTAG 


GAGTGAAAAA 


CTAGCAATGC 


CAAAGGTAAT 


CCAATTGACG 


AAGT ACC AAC 


9780 


GAAGAAGCTG 


TAAATCTAGG 


ACAAAGTviLT 




GCCCTTCATA 


AAGGAACGGC 


9840 


TAGTTTTTAG 


GATTCGTCTT 






TAGACTATAA 


CAGAGAAGAA 


9900 


ATTCCACCTG 


TGAATAviuWA 




GAATATAGAG 


GATATTTCCT 


ACAATGATCA 


9960 


AGATGAGACT 


1W* & & Art 
TVfV.AAw/WUUi 


TAGAGTCCAA 


AGACCATGAG 


GAAACGCT^iJ 


GTTTCAACTG 


10020 


ATGACAGATC 


TAGATTTGGA 


AACTCAGGAT 


CTAGGGTGAC 


GAATTTTTTG 


CCTAAAAAGC 


10080 


TACTATAAAA 


GAGGAGGTAA 


ATCCCAAGTA 


AATTAGGCAT 


ACTCCATAAA 


AAGAGATAGA 


10140 


AACGTTTGAG 


AAGTAGGGTC 


AAAAAGGTTT 


GAGAAAAGCG 


CTCCTCATCA 




10200 


GGCTGTTTTT 


TACACATGGC 


TCCGTTTTAG 


AATCTTTCAT 


GAGTGTCAGT 


GTTGCATAGA 


10260 


CGGAACTGGT 


CAAAAGAATA 


GTCCCCATAA 


AGGAGACTAG 


TAGAGGAAAG 


AGCTAGGTTT 


10320 


GAAGTATTTG 
ATAAGGGATT 


GCCAAGTATG 
AAGAAAACCA 


CTGAAAAATG 
GATAAGATGA 


GCTGTTCTAA 
CCAGCATACT 


AACAGTCCCG 
GGGAAGGATA 


TGGATCCGAG 
TAGAGGAGAA 


10380 
10440 


AGAGACGGGG 


GGTGTCAGCC 


TGAAAATGTT 


TTGACTCCTG 


ACGAATTGTT 


TTTAAATCAA 


10SO0 


TTTTTGGATA 


GTTCATTCTC 


TTATTATACC 


ATAGTTCTTA 


TACATAGTTC 


GTGACAGTTC 


10560 


CTACTTTTTT 


TGATAAAATC 


ATACAGTGTG 


TCCTTGGGCA 


CACTGTATGA 


ACTGGGACTG 


10620 
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TCTTTCCCAC CTTCGCAGCT AAAAAATCTC AGATTCACCA ATCAAATATC CTTTCATTAA 10680 

GAAAGAAAAA CACACAGCAC CTCCTCTGGG AGAAATCATC ACTCCCCACC CTAC CT TTCC 10740 

GACACCTATG TTTATCCCAG TTCGGACACA AGCCACTGTC AAAACTCAGT CACCTGAAGA 10800 

ATTGAAGGAG ATGGGTTCGG GAATTATCCT ATCAAACACC TATCATCTCT GCCTTCCCCC 10860 

TGGAGATGAA CTCATTCCAC GCGCTGGTGG TCTCCACAAG TTCATGAATT GGGACCAGCC 10920 

TATCTTGACA GATAGTCCTG G TTTTCAGGT TTATTCTTTA GCAGATAGCC GTAATATCAC 10980 

AGAAGAAGGA GTAACCTTTA AAAATCATCT AAATGGTTCT AAGATGTTCC TATCCCCAGA 11040 

AAAAGCCATC TCTATTCAGA ATAATCTGGG TTCAGACATC ATGATGTCCT TTGATGAATG 11100 

TCCTCAGTTT TATCAACCTT ATGACTACGT TAAGAAATCG ATCGAGCGTA CCAGCCGTT G 11160 

GGCTGAGCCT CGTTTGAAGG CTCACCCTCG TCCACATGAC CAAGCTTT G T TTGGAATTCT 11220 

GCAAGGTGCA GGATTTGAAG ACCTTCGCCC CCAATCAGCT CATCATCTTG TCAGCATGGA 11280 

TTTCTCAGGC TACTCTATCG GTG G TTTGGC AGTCGGAGAA ACCCATGAAG AGATGAATGC 11340 

CCTCTTGGAC TTTACAACTC AACTGCTCCC TGAAAATAAA CCTCGTTATC TGATCGGTCT 11400 

GGGAGCGCCA GATAGCTTGA TCGATGGGGT CATTCGTCGG CTGCATATGT TTGACTtSTGT 11460 

CTTACCGACT CGAATTGCTC GTAACGGGAC TTGTATGACC AGTCAAGGAC GT1TGGTTGT 11520 

GAAAAATGCC CAGTTTCCTG AGGACTTTAC GCCACTGGAT CCTGAGTGTC ATTGCTACAC 11580 

ATGTAATAAC TATACACGCG CTTACCTTCG TCACCTGCTC AACGCTGATG AAACCTTTGG 11640 

TATCCGCTTG ACTAGCTACC ACAATCTTTA CTTCTTGCTT AACCTGATGA ACCAACTGCG 11700 

ACAAGCCATC ATGGATGACA ATCTCTTGGA ATTCCGTGAG TATTTTGTGG AAAAATATGG 11760 

CTATAATAAG TCAGGACGTA ATTTCTAAAA TGGAATTGAT ATAAAAAAAT CCTAA G TTTT 11820 

CTCTTAGGAT nTi ' CTTCT T TTTTTGATAG AATAAAGTGT ACAATGAAAG GAAGAATAAA 11880 

CTCGTATGCG CATTAAATGG TTTTCCTCGA TTAGG 11915 
(2) INFORMATION FOR SBQ ID NO: 97: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GAGAGGGCAA CAGTTCTATC GCTTCAAATT TTTTCTTGGT TTGCAGATAT TCAAGAATCG 60 

GGAGTTTTTC TATAGTATTC GGCAGATTTA TTACAGCCAA CCATCTCAAA AATACGG ACA 120 
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CCATCCTCCA 


TCTTTTTCTG 


GCCTTCCTTG 


ACTCTACCTT 


GCTTGCTATC 


AAGGAGACCT 


180 


TCTGCCCACA 


GATAAACAAT 


TCGGAAATAG 


GTCTCATTTT 


CCTTGTACAA 


ATCCTCTTCG 


240 


ATAACACGTT 


TAAAATAATA 


GGCATTGGTA 


AATTCTTCAC 


ACTCAATACT 


AGCTAAAAAG 


300 


CCATTCAATA 


GTATAGTATG 


AAAAAGGTTT 


CGATTGCCAG 


ACATTTCCAT 


TAGAAAATCA 


360 


GATTTACGTA 


CCATTTCTCG 


TACATATCTA 


CTAAAAAGAG 


AAACAGATAA 


AAATGGAGAA 


420 


CTGACTGAAA 


ATAAATTGAG 


TTCATAGATT 


CCCCAGATCT 


CGCTAGAAAA 


CAAATAATCA 


480 


TGAAGGACTT 


TTCCTTCCTC 


TGCTGTTAAG 


TCTACCCTTT 


CATCTATGCT 


CTTCATATAA 


540 


GACTTGATAA 


TAATGGCATT 


TAGAATATGT 


TTCTGTTTCT 


TGTGAGAATG 


GGCATGCTTT 


600 


TATACTCCCT 


GCGATATAAG 


TCCTCAAGAG 


GTGCTATATT 


CTTTGGTTCC 


AAGACATCTG 


660 


TAATTTCTTT 


TCTCAACTCA 


GAATCTGTAT 


CATACTGGAA 


ACCTCTTGCC 


AGAAAGAGGA 


720 


TCTCCTCCAC 


ACTGGCAGAT 


ATATTTTCCA 


GAGCAAATAG 


AAACTTTTCC 


ACCGAAAGCT 


780 


CACTCTGACC 


TGTTTCAAAA 


CGGG ACAACA 


TAGACGGCGA 


AAATTGTCCT 


CCGGTTGCTT 


840 


GTCTCAGTGA 


GATATTTCTT 


GACTCTCGTA 


ATTGTCTAAA 


GACTTTTCCA 


ATCTGCTCCA 


900 


TAGACTTCCC 


CTTGATTCCG 


TATTTTCTTC 


ATTTTATCAT 


ATTTTTCAGA 


AAATTCATCA 


960 


AAAACTTGCC 


AAATTGTCAG 


AATTATGACA 


AAATAGAGGA 


TATTTATCAC 


CTGGAGGGAC 


1020 


TGCTATGAGA 


GACGATATCA 


AAATCAATGA 


CCGTGCTTTG 


GCCTTGCAAG 


AC CAAATTAT 


1080 


CGAAAAACTA 


GAGAAACTTT 


TTGATACAGA 


TGTGGAATTG 


GATGTTTACA 


ATCTAGGTCT 


1140 


GATTTATGAA 


ATCAATCTGG 


ATGAAACGGG 


GCTCTGCAAG 


ATTGTCATGA 


CCTTCACCGA 


1200 


TACTGCCTGT 


GATTGCGCCG 


AAAGCCTGCC 


TATTGAAATC 


GTGGCACCTC 


TGAAACAAAT 


1260 


CGAGGGTATC 


AAAGATATCA 


AGGTTGAAGT 


TACCTGGTCG 


CCTGCTTGGA 


AAATCACACC 


1320 


AATCACTCGC 


TATGGCCGTA 


TTGCCCTTGG 


ACTACCACCT 


CGTTAAGCAG 


ACCAATCACT 


1380 


TTTAAAGATG 


AAAATCAAAG 


GGCAAACTAG 


AAAACTAGCC 


GCAGGTTGCT 


CAAAACACTG 


1440 


TTTTGAAGTT 


ATGGATAGAA 


CTGACGAAGT 


CAGCTCAAAA 


CACTGTTTTG 


AGGTTGTGGA 


1500 


TAGAACTGAC 


G AAGTCAg CT 


CAAAACACTG 


TTTTGAGGTT 


GTGGATAGAA 


CTGACGAAGT 


1560 


CAGCCCAAAA 


CACTGTTTTG 


AGGTTGTGGA 


TAGAACTGAC 


GAAGTCAGTA 


ACCATACCTA 


1620 


CGGCAAGGCG 


ACCTTGACGT 


GATTTGAAGA 


GATTTTCGAG 


TATGAGTTTA 


TTTTTTACCT 


1680 


GACTTGTCCA 


TATTCCAGAA 


CTCTGTCACG 


GCTCCGCGTG 


AAGCAGATGA 


TACGATGTGG 


1740 


GCATATTTAC 


CGAGGACACC 


ACGGCTGTAA 


AGTGGTGGCA 


AGGTTGTTTC 


TGCCTTGCGT 


1800 


TTTTCAAGTT 


CTTCTTCCGA 


TACGGCCATA 


GAAATTTCTT 


TGGTATCTTG 


GTCAACCGTA 


1860 
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ACGATATCCC CGGTACCGAG ATAGGCAATT GGTCCACCAT CCTGAGCTTC AGGAGCGATA 
TGTCCAACAA CCAGACCATA AGTACCACCA GAGAAACGTC CGTCCGTCAA GAGGGCCACC 
TTATCTCCCT GACCTTTACC AACAATCATT GAAGAAAGTG AT AG C ATCTC AGGCATACCA 
GGACCACCTT TAGGTCCAAC AAAACGAACA ACGACTACAT CCCCATCAAC GATTTCATCT 
CTCAGAACGG CCTGAATCGC ATCTTCTTCT GAGTCAAAGA CCTTAGCTCG CCCAACGTGA 
CGACGCACTT TAACACCTGA TACCTTGGCA ACTGCACCCT CAGGAGCAAG GTTCCCGTTC 
AAGATGATAA GCGGACCATC CGCACGTTTT GGATTTTCAA GTGGCATGAT AACTTTTTGG 
CCTCGACTCA AGTCTGCAAA GTCAGCCAAG TTTTCAGCTA CAGTCTTACC AGTACATGTG 
ATGCGATCTC CGTGAAGGAA ACCATTTGCC AACAAATACT TCATAACCCC AGGGACACCA 
CCGACTTCGT AGAGGTCTTG GAAGACATAC TGACCAGATG GTTTCAAGTC GGCCAAGTGA 
GGCACACGTT CTTGAATCGT ATTGAAGTCC TCAACTGACA AGTCAACATT TGCGCCATGG 
GCAATGGCGA GCAAGTGAAG AGTGGCGTTT GTAGAACCAC CGAGAGCCAT CGTTACAGTG 
ATAGCATCTT CAAAGGCTTC ACGAGTCAAG ATATCTGATG CTTTGAGACC AAGTTCCAAC 
ATCTTAACAA CAGCACGTCC TGCTGCTTCG ATATCTTCTT TCTTATCAGC TGATTCAGCT 
GGGTGAGAGG ATGACCCTGG CAAACTCATC CCTAGAACTT CGATAGCAGT TGCCATGGTA 
TTAG CAGT AT ACATACCACC ACAACCACCA GGGCCAGGGC AGGCATTACA TTCAAGACGT 
TTCACGTCCT CAGCTGTCAT GTCACCGTGG TTCCATTTTC CGATACCTTC AAAGACAGAA 
ACCAAGTCGA TATCTTTACC ATCAAGATTT CCCGGTGCAA TACTTCCACC ATAGGCGAAA 
ATAGCTGGGA TATCCATATT AGCAATAGCA ATCATAGATC CAGGCATGTT CTTGTCACAG 
CCACCCATAG CGACGAAGGC ATCCACGTTG TGACCACTCA TAGCCGCCTC GATGGAGTCC 
GCGATGATGT CACGAGATCT TAGAGAGAAA CGCATACCAG CCGTTCCCAT AGCGATCCCG 
TCCGCTACGG TAATGCTTCC AAACTGTACA GGCCAACCGC CTGCACATTT GACACCTTCT 
TTACCCAGTT TCCCGAAATC ATGCAAGTGA ATGTTACATG GTCTATTTTC CGCCCAAGTC 
GAAATCACTC CCACAATCGA TGTTTCAAAG TCCTTATCTG TCATACCAGT CGCACGAAGC 
ATAGCACGGT TAGGTGATTT AACCATGCTG TCATAAATGC TACTGCGGTG ACGTTTATCT 
AATTCACTCA TCTTATCCCT CCCATTTCAG TTTTTACTAT TATAGCACAA TTTTCGCATG 
AAGAACAGAA TAAAATTCTT CAATTTTCAG AAAATTCTAT ACACATCTGA AATATTTAAA 
ATTAAAAACA ACAAAGCCGA TTAGTGCACT TTCTGATGAC CAGAATATGC TTTTTAATCC 
GCTTTCTTTA AATAACGTAC TGTAATTTTT ACAGAAATTC TTTCAAATAA GTGTATTTAA 
CATCTATCTT GCATTATAAA TTTCTAGAAC CTTCTCTTTT ATATTCGATT CACTCAAACC 
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ATACTCATTA 


AGAAGATAAT 


CCATTTTCOC 


TACTTGACCG 


AATCTTTCTT 


GAACAULwAT 


3720 


CCGATGAATT 


TTTGTTATTC 


CATCATCAGA 


GAATAATTGA 


CATAAACCAW 


TGCCAATTwv. 


3780 


ACCTATCTGA 


TTGTGGTTTT 


CTACAGTAAA 


TATAU'i'i 


CCACTTAAUV 


i 1UI 1'ITiAT 


3840 


CTCTTCTCGT 


ATCGGTTTGA 


TTCTAAATAA 


ATCTATCACA 


CCTACTGAAT 


AACCTAATTT 




AGACAGTTCA 


TCTGCAACTC 


GAATACTTGG 


AGCAACCATT 


ATGCCAGAAG 


CAACGATTAC 


J7QU 


AAGATCTTCA 


CCATGCCTTA 


ACTCAATGTA 


GCCTTTAGAA 


AAATC YTC TC 


CACCTTGATA 




CACAGGAACT 


GGAGCTTTTC 


TAATTGTTCG 


AATATATTTT 


AGTCCTTTTA 


AGTCTAATGT 


4080 


CTGGTTCAAT 


ATTTCACGAA 


ATTGGATATC 


ATCAGTTGCT 


TCGAAAATGA 


TTGATTTAGG 


4140 


AATTAAACGT 


AACAATCCAA 


TTTCTTCAAA 


TGGCATATGT 


GTTCCACCAT 


TCATCTCTGC 


4200 


CGTTACTCCT 


CCATCTGATC 


CAATCACAGT 


GGCATCCAAT 


TGTGCGTATC 


CAAGAGAAAT 


4260 


AAATAATTGA 


TCAAATACTC 


TTCGTGAAGC 


AAAAGGACCA 


AATGTATGAA 


GATAAGGTCT 


4320 


AAACCCCTGA 


ATAGACAAGC 


CTGCTGCAAG 


GCCGACCATT 


TCTGCTTC C A 


TAATCCCAAC 


4380 


ATTCACATAA 


CGCTCTCCAA 


AGTCCTTTTC 


AAGATTATTA 


GTAGCCATCG 


AACTTGACAA 


4440 


ATCGGCTTCT 


AAGACTACTA 


TATCAGAATC 


ACTTTGATTA 


GCCTCTAAAA 


CGAAGTCTCT 


4500 


ATATACATGC 


CCTAATTCTT 


TCGTACTTCT 


CATCATTCTG 


TTTCCTCCAA 


TTCCTCACTT 


4560 


AATCTTTCTA 


CAACTGAACT 


TAACATTTGT 


TTCTCCTCTA 


CAGTAGGGCG 


AAGATGATGA 


4620 


TTGGATTTCA 


TTTCTTCCAG 


CTCTTGAACC 


CCTTGACCTT 


TAATACTATC 


TAATACAATG 


4680 


CACTTACGTG 


ATCAATTATT 


TGACTGTTTT 


AATTGGACAA 


TCCCTTCATA 


AATTTCTCTA 


4740 


ATATCTGAAC 


CCTTGACCCT 


AATGGATTCA 


AATCCAAATG 


CTGAAAATTT 


TTCTACGAAA 


4 800 


TCACCTGGAT 


TACAAATATC 


CITTGTAAAA 


CCATCTAATT 


GTTTWWITT 


ATCATCAACA 


4860 


AATACAATTA 


AGTTGGATAA 


CTGTTGATGA 


GAAGCAAACT 


GTATAGCCTC 


CCAACATTGT 


4 


CCCTCATTTA 


ACTCACCATC 


TCCAACAATA 


GCGTAACTAT 


AAAAGGGACT 


CTTTCTTATT 


4980 


CTCTCACCAT 


ATGCAAGTCC 


AGTTGCAACA 


CTAATTCCTT 


GTCCTAAACA 


GCCCGTTGTC 


5040 


ATATCTATGC 


CTGGCGTTAG 


ATTTCTATCA 


GGATGAGACG 


GTAATTTGGT 


TCCATTTGTA 


5100 


TTTAAAGAAT 


ATAAGAATTC 


TTTGTCAAAG 


AAACCATTCA 


AATAGAGTGT 


ACTGTATAGA 


5160 


GCTGGTCCTC 


CGTGACCTTT 


TGATAATATG 


AAATAATCTC 


TATCTCCTGC 


TGCAAATATT 


5220 


TCTGGAGTCA 


TTCGCATTAT 


TTCACCATAA 


AGCACCGCTA 


AAACTTCTAC 


GATAGACAGA 


5280 


CTTCCTCCGT 


AATGTCCGAA 


TCCAAGATGA 


TTCAATGTTC 


TAAGAGTATT 


TAATCGGATG 


5340 


TTAGTCGCAA ATTTTCTTAA 


CCCATCTTCT 


CTATTTTTAC 


TTAAAATCAT 


CCCTTATTCC 


5400 
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TCCGTTGCAG ATGCCTTTTT AATAAAGGAT ACTCCAAACA TAACTGCTAG AATAAGAACA 
AGACCAATCA CAATGCCTGC TTGTGAGCCA AATTGATTTA ACATTCCTAA AATAATTCCT 
GATAGACCAA AATCTGCATC TGAGAAAGTT GATCCTTGGA AACCAACTCC TCCCAAAACT 
GGCATTAAAA AGACTGGAAG AAAACTGATT AAAATACCTT GTAAAAATGC TCCAATAGTG 
GCTCCACGAA CACCACCAGA TGCATTCCCA ATGACACCTG CAGTCGCTCC ACAGAAGAAA 
TGAGGCACAA CACCTGGTAA GATAACAACC GTTCCTGAAG CAATCATAAT TACCATACTT 
ACTAAACCAC CAACAAAACT AG AGAT AAAT CCAATTAGAA CTGCATTGGG TGCATAAGTA 
TAAACAATCG GACAATCCAA AGCAGGTTTT GAATTAGGTA CAAGACGCTC TGAAATACCT 
TTAAAGGCTG GAACAATTTC GCCCAAAATA AGGCGAACAC CTGCTAAAAT AACAAATACC 
CCTGCTGCAA ATTGACCTGC TAATTGTAAA GCATAAACTA GACCACTTGT ACCACTACTG 
ATTTCTTTTT CTATATATTC TGACCCTGCA AAGATAGCTA CAATAATGTA AATAACTGCC 
ATGGATAAAG TAATACTAAC AGTACTATCA CGTAAAAAAG CTAAACTCTT TGGAAATTTA 
ATGTCCTCTG TTGATTTTGA TTTGTCACCG ATAAGGCTAC CACTAAAACC ACTCAACCAA 
TATCCCAAAG AACTGAAATG ACCTAAAGCT ACCTTCTCAT TTCCAGTTAA TT GAACCAT A 
TATTTTTGCA CAAATGCTGG GGAAATACTC ATAATAATAC CGACTGCTAA TCCTCCTAGT 
AAGATGAGAG GCAAGCTAGT AAAGCCAGCA ACTGATAAAA TGACCGCAAT CATACATGCC 
ATATATAGAG TGTGGTGCCC TCTTAAAAAA AT AT ATTT AA ATCGAGTAAA ACGAGCGATT 
AAGATATTGA ACACCATGCC TGCAAACATA ATCATTGCAG TAGCTGAGCC ATATGTTGTT 
AAAGCTACAG CTACAATTCC TTCATTATTC GGCACAACGC C AG AT AAATG AAAAGCATGC 
TCAAACATGG TACCAAATGG ATTTAAAGAA TTTTCTACAA TTCCTGCACC ACCAGATACA 
ACTAAGAAAC CAACAAAGGT CTTAATTCCA CCTTTAATAA TATCAGGTAA TTTCTTCTTC 
TGAAGAACTA ATCCTAAGAT TGCAATTAAA GCTACTAAAA TAGCTGCTGT ACTAACAATA 
TCCAATATGA ACTTCATCAT GACGCTAGCC TCCTATATAA GTCCTTTTTC TTC ACAAAi »T 
TTAGTAATTA ATTCTCGTAG TTCATCCATA TCAATAATAC TATTTAAGAT ACGAACATCT 
CCAAGATGAC TAGCTGAATC AGCTAGATCA CGACCAACAA TCCAAATATC AGCTGCATTT 
GCATCTGCTC CACCTAAATC ATAATGTTCA ACTTCT A CAT CCGAAACATT CAAATCACTC 
AATACAGATT CAATATTCAT CTGTACCATA AAACTTGAAC CTAATCCTGA ACCACAAGCT 
GT AC CAATTT TTAACATTAT CTAATCCTCC TGTTTAATTA TCATTTTAAT GTCATCATAC 
TTTTTTGATG ATATTAAAGT TTGAACATGA TTTTTATCTC TTAAAATTGT TGTTAAATGT 
GACAAAGCCT TTAAATGACT CTCATTATCA ATGGCTGCAA TACAAATCAA CAATCTTACC 
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TCTTGTTCTG 


CATTATCCAA TAAATAAATC 


GGTTCTTCCA AAACTAACAT 


TGACATTCCT 


7260 


ATTTCATTCA 


CACCTTCATC 


TGGCCCAGCG 


TGAGGAATTG 


CTACTCCCTT 


CCCTAAATTA 


7320 


ATAAAAGGTC 


CAAACTCTTC 


TACTTTTTGA 


ATCATTGCCT 


CAGGGTAGTT 


CTCAGTTATC 


7380 


TTATCTTGAT 


CCAAAAGCGG 


TTTAGCTGCT 


AAACGAATCG 


CCTCCTTCCA 


TCCTAATTTT 


7440 


TGCGAACTAA 


CCTGATAGGT 


TTCTTTGGTA 


ATAAGTTGTT 


CTAGCACTGG 


TACAATTTCC 


7500 


TTTCTATCAT 


TTTTTTGGTA AAGATAATTC 


TTTAACGCCA ATCTTAATTC 


CAATTCTTGT 


7560 


GTAATAATTC 


CATATCTTTT 


GACAATATTC 


AGGATTTGTT 


CAATCTCAAA 


ATCTCCATAC 


7620 


TCTAAATTCG 


GAAAATCTTT 


TAACACTAGT 


TCTACTACTT 


GTATTCCTTG 


CTCTTCAGTC 


7680 


ATCATAACCG 


AAACTAGATA 


ATTTGGCTTT 


TCTGTCTCCA CCTTTATGGT 


AGAAAAAACC 


7740 


ATATCATAGT 


CACTACTAGC 


TTTCACCTGT 


AAATCATCAA TCTTTGAGGT 


TCCTATAAAC 


7800 


TCAATTTGAG 


GAAATAATGC 


TAATAGATTC 


TCTTTTAACA TCAATGAAGA 


ACTAACACCA 


7860 


TTAGGACAAA 


TGATTGCTGC 


TTTATACCAT 


TTTTGAGGCA 


AACTATCTGC 


TTTCTTTAAA 


7920 


TAACCTCCGA 


AATGGATAAC 


AAAATATGCT 


GTTTCACTAT 


CAGGTATGGG 


ATTGTCAATA 


7980 


GCGTCC AT C A 


AGCGCATCAA 


AGAATCTTTG 


ACTAATTCAA 


ATAAATCAGG 


ATAATGTTCT 


8040 


TTAACATGCA 


ATACATATTC 


ATTTGAACTA 


GCTAGGCCGA 


ACTTTAATCT 


ATAGTAAGCC 


8100 


GGTATAAGGT 


GGCGGCCAAG 


ATTTTCTCTC 


AATCCTTCCC 


TTTGTTTAAA 


ATGTAACAAA 


8160 


CAAATATCTT 


CCATTCTACT 


TATAATAGCC 


TCTGTTAATT 


GATTAAACTA 


AACCCGAGCA 


8220 


ACATCTACTT 


CACCTTCAAA 


CCAACTTGAT 


AATAAAACGG 


TGATATAGCG 


ATAATCATCC 


8280 


TCAGAAAACA 


CCCTATCTAT 


AATTCCCAAA 


TCAACCACTG 


TATCCAATAA 


AATACTGGTT 


8340 


ATATCTTGAA 


TAACAGGAGA 


TACTAATGTC 


TCTGAAAGAC 


ATACTCTTTC 


AACATCCCTT 


8400 


TGATACCTAC 


ACAGAATGAA 


TACTAAACCC 


AAAAGCTAAA 


CTTTTAATTG 


ATTAACAATA 


8460 


GCTACTAGCT 


GTAGCTTCTC 


ATAATAATCT 


TTAACTACCT 


GATCAATCAA 


ATCATAAGTT 


8520 


AATGAATACC 


CCCAACTGGA 


TAAAACATAA 


TCCAAACCCC 


AAATCCCTAT 


GGAGGATTCC 


8580 


AGCAACTCAC 


TAACCATTTG 


AAAAGCTAAG 


CGGTGCTTAT 


TCCACTCTGA 


ACCGTGTAAA 


8640 


GTATAACCTT 


TTCCTCTACT GTACCCTAGC TCCAAATCAT TATCTAACAT 


aatctttctt 


8700 


AATGATTGAA 


TATCAGATAA 


GGTTGTATTC 


TTACTTACTT 


TCAAAAACTC 


TTGGTAATGA 


87 60 


CTATTCGATA 


TAAAATCTAA 


TCGGCAAAAA 


GTGTAAAGAT 


AGATTAAAGC 


TAAGCGACTC 


8820 


CACTTTGGTA 


AAACCAATTC 


ATCCGACTTA 


ATAATATCTG 


TCAAAGACTG 


CTTCGTACGA 


8880 


TTTGATAAAC 


TATAGCGACC 


TTGCTTTTTA 


TCCAGCACTA 


TCCCTTTATT 


AGCTAGATAA 


8940 
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GCCACTAAAT AATCTATTCC TTCTTTCACT TCCTTTATAG GTAAGCTCAC CTTAACAGAT ( 9000 

AATTCATATA ACCATAGCTC ACAATGATCC ATCAAACTCA TCAAAATAAC TACTGCTCTA 9060 

TAATCAAAC 9069 

<2) INFORMATION FOR SEQ ZD NO: 98: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8654 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CGAGACAACA AGATCAAGAA AAATTTGCCC TATCGTTTGT GGCCCTTGCA ACTCTAGCAC 60 

TTCTTGCAGC CTCTGGAGAA CTCAACTCTG GAGCACTCAA CACTCCTGGT AACTCAGTAG 120 

AGGAAAAGAC AATTAAAATC GGGTTTAACT TTGAACAATC AG G TTCTTTA GCTGCATACG 180 

GAACAGCTGA ACAAAAAGGT GCCCAATTGG CTGTTGATGA AATCAATGCC GCAGTGGTAT 240 

CGATGGAAAA CAAATCGAAG TAGTCGATAA AGATAATAAG TCTGAAACAG CTGAGGCTGC 300 

TTCAGTTACA ACTAACCTTG TAACCCAATC TAAAGTATCA GCAGTCCTAG GACCTGCGAC 3 60 

ATCTGGTGCG ACTCCAGCTG CGGTAGCGAA CGCTACAAAA GCAGGTGTTC CATTGATCTC 420 

ACCAACTGCG ACTCAAGATG GATTGACTAA AGGTCAAGAT TACCTCTTTA TTGGAACTTT 480 

CCAAGATAGC TTCCAAGGAA AAATTATCTC AAACTATGTT TCTGAAAAAT TAAATGCTAA 540 

GAAAGTTGTT CTTTACACTG ACAATGCCAG TGACTATCCT AAAGGGATTG CAAAATCTTT 600 

CCGCGAGTCA TACAAGGGTG AAATCGTTGC AGATGAAACT TTCGTAGCAG GTGACACAGA 660 

CTTCCAAGCA CCCCTTACAA AAATGAAAGG GAAAGACTTT GATGCTATCG TTGTTCCTGC 720 

TTACTATAAT GAGGCTGGTA AAATTGTAAA CCAAGCGCCT GGCATGGGAA TTGACAAACC 780 

AATCGTTGGT GGTGATGGAT TCAACGGTGA GGAGTTTGTA CAACAAGCAA CTGCTGAAAA 840 

AGCATCAAAC ATCTACTTTA TCTCACGCTT CTCAACTACT CTAGAACTTT CACCTAAAGC 900 

TAAAGCCTTC CTTGACGCTT ACCGTGCTAA CTACAATGAA GAGCCTTCAA CATTTCCAGC 960 

CTTGGCTTAT GATTCAGTTC ACCTTGTAGC AAACGCAGCA AAAGGTGCTA AAAATTCAGG 1020 

TCAAATCAAG AATAACCTTG CTAAAACAAA AGATTTTGAA GGTGTAACTG GTCAAACAAG 1080 

CTTCGATGCA GACCACAACA CACTCAAAAC TGCTTACATC ATGACCATGA ACAATGCTAA 1140 

AGTTGAAGCA GCAGAAGTTG TAAAACCATA ATAGAAAAAT CTTGAAATAG GGAATGAGCC 1200 

TTTGACTCAC TCCCTGTTTC GATATTTAAT ACTCTTCGAA AATCTCTTCA AACTGCGTCA 1260 
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ACGTCGCCTT GGATTATATA TGTGACTGAC TTCGTCACTC TTATCTACAA CCTCAAACCA 1320 
GTGCTTTGAG CAACCTGCGC CTAGTTTCCT ACTTTGCTCT TTCATTTTCA TTGAGTATAA 1380 
CAACCTATCA AAAAGTGAGG GAAAACCCTC CGAATTATAA ATAGAAAGAG TGAATCTTAT 1440 
CCTCCAACAA CTCCTAAATG GTTTGATTCT AGGTAGTCTT TACCCGCTGT TAGCCCTAGG 1500 

ATATACCATG GTTTACGGAA TTATCAAGCT CATCAACTTC CCCCATGGTG ATATTTATAT 1560 

GATGGGAGCC TTTATCGGTT ATTTCTTGAT CAATTCTTTC CAAATGAATT TCTTTGTAGC 1620 

GCTTATTGTA GCTATGCTAG CGACAGCTAT TCTTGGTGTC GTGATTGAGT TTCTTGCTTA 1600 

CCGACCTTTG CGCCACTCTA CTCCTATTGC TGTTTTGATT ACGGCTATTG GGGTTTCTTT 1740 

CCTATTGGAG TATGGAATGG TCTATCTGCT TGGTGCCAAT ACCCGTGCCT TCCCTCAAGC 1800 

GATTCAAACA GTTCGATATG ATTTGGGACC AATTAGCTTA ACAAATGTGC AGTTAATGAT 1860 

TTTGGCCATT TCCTTGATTT TGATGATTTT GTTACAACTC ATTGTCCAAA AGACTAAGAT 1920 

GGGGAAAGCC ATGCGTGCAG TATCAGTAGA TAGCGACCCG GCGCAATTGA TGGGGATCAA 1980 

TCTAAACCGT ACGATTAGCT TTACCTTCGC TTTGGCTTCT GCTCTTGCGG CTGCGGCTGG 2040 

TGTTCTGATT GCTCTTTATT ATAACTCTCT TGAGCCTTTG ATGGGGGTTA CTCCAGGTCT 2100 

TAAATCTTTC CTTGCCGCAG TACTTGGTGG TATCCGAATT ATTCCTGCTG CGGCTCTTGG 2160 

TGGCTTTGTG ATTGGTCTAT TGGAAACCTT TGCGACTGCC TTTGGGATGT CAGATTTCCG 2220 

TGATGCCATT GTTTATGGAA TCTTCTTGTT GATCTTGATT GTCCGCCCAG CTGGTATCCT 2280 

TGCTAAGAAT GTGAAAGAGA AGGTGTAAAC GATGAAGGAA AATTTAAAAG TTAATATTCT 2340 

ATGGTTACTC CTTTTGTTAG CTGGCTATAG CTTGATTAGT GTACTG G TTT CAGTCGGAGT 2400 

ACTTAATCTA TTCTATGTAC AGATTTTACA ACAAATTGGA ATTAATATTA TTTTGGCTGT 2460 

TGCTCTCAAC TTAATCGTTG G TTTTTCAGG ACAATTTTCA CTTGGTCATG CTGGTTTCAT 2520 

GGCGATTGGT GCCTATCCAG CAGCTATTAT TGGTTCTAAA TCACCAACCT ACGGTGCCTT 2580 

CTTTCGAGCT ATGCTTGTAG GGGCTTTGCT TTCAGGAGCA CTTGCCTTAC TTGTCGGCAT 2640 

TCCAACCTTG CGCTTGAAGG GGCACTATCT TGCGGTAGCA ACTCTGGGTG TTTCTGAAAT 2700 

TATCCGTATC TTTATCATCA ATCGTGGAAG CCTTACAAAT GGTGCGGCAG CTATCTTAGC 2760 

GATTCCTAAC TTTACAACTT GGCAAATGGT TTACTTCTTT CTCGTCATTA CAACCATTGC 2820 

AACCTTGAAC TTCTTGCGTA GCCCAATTCG TCCTTCAACC CTCTCTCTTC GTCAAGATGA 2880 

AATCGCTGCT GAGTCAGTTG GGGTTAATAC GACTAAAATT AAAATCATCG CTT T ' IMTC T T 2940 

TGGTGCCATT ACTGCAAGTA TTGCTGGGTC ACTTCAGGCA GGATTTATCG GGTCTGTTGT 3000 
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ACCCAAACAT TACACCTTCA TCAACTCAAT CAACGTTTTG ATTATTGTTG TATTTGCTCG 3060 

ACTCCCTTCC ATTACAGGTG CGATTGTTTC GGCTATTGTT CTGGGAATTT TGAATATGCT 3120 

TCTCCAAGAT GTTGCTAGTG TGCGTATGAT TATTTACGCT TTCCCCTTGG TATTGGTAAT 3180 

GATTTTCAGA CCAGGTGGAC TCCTTGGAAC ATGGGAACTG ACCCTATCAC GTTTCTTTAA 3240 

AAAATCTAAG AAGGAGCAAC AAAACTAATC GCATTACTTG AAGTAAAAC A GTTAACCAAA 3300 

CATTTTGCTC GTCTAACAGC TGTTGGAGAT GTGACTCTTG AATTGAACGA AGGGGAACTG 3 360 

GTTGGATTAA TCGGTCCAAA CGGAGCTGGG AAAACCACCC TTTTCAACCT TTTGACCCGT 3420 

GTTTATGAAC CAACCGAGCG AACAGTAACC CTAGATGGTC ACCTTTTGAA TGGGAAATCA 3480 

CCTTATAAGA TTGCCTCTTT GGGACTTGGA CGTACTTTCC AAAATATCCG TCTCTTTAAA 3540 

GATTTAACAG TTTTAGATAA TGTTTTGATT GCTTTTGGAA ACCATCACAA ACAGCATGTT 3600 

TTTACTAGTT TCTTACGCTT ACCAGCTTTT TACAAGAGTG AAAAAGAATT AAAGGCTAAA 3660 

CCTTTGGAAT TGTTGAAAAT CTTTGATTTA GATGGTGATG CAGAGACTCT TGCTAAAAAT 3720 

CTTTCCTACG GACAACAACG TCCTTTGGAA ATT G TTCGTG CCCTTGCTAC GGAACCTAAA 3780 

ATTCTCTTCT TAGATGAACC ACCAGCAGGT ATGAACCCAC ACGAAACACC CGAATTGACT 3840 

G AG TTAATTC CTCCTATCAA AGATCACTTT AAGATTACAA TCATGTTGAT TGAACACGAT 3900 

ATGAATCTGG TCATGGAAGT AACAGAACGT ATCTACCTAC TTGAATATGG CCGTTTAATC 3960 

GCTCAAGGAA CTCCAGACGA AATTAAGACC AATAAACGCG TTATCGAAGC TTATCTAGGA 4020 

OGTGAAGCCT AATCTCTATG TTAAAAGTTG AAAATCTTTC TCTCCATTAC GGTATCATCC 4080 

AAGCAGTTCG TGATGTAAGC TTTGAAGTTA ATGAAGGAGA AGTTGTTTCC CTTATCGGTC 4140 

CCAACGGTGC AGGTAAGACA ACTATTCTTC GCACCTTGTC AC G TTT GG TT CGACCAAGTT 4200 

CAGGAAAGAT TGAATTTTTA CGTCAAGAAA TCCAAAAAAT GCCAGCTCAG AAAATCGTCG 4260 

CAAGTCGTCT TTCACAAGTT CCAGAAGGAC GCCACGTCTT TCCTGGCTTG ACTCTTATGG 4320 

AAAATCTTGA AATGGGAGCT TTCTTAAACA AAAATCGTGA AGAAAATCAA CCTAACTTSA 4380 

AGAAGCTTTT CTCACGCTTT CCTCGTCTTG AAGAACGGAA GAACCAAGAT GCAGCCACTC 4440 

TTTCACGGGG GGAACAACAA ATGCTTGCCA TGGGACGCGC CCTCATGTCA ACACCAAAAC 4500 

7TCTTCTTT7 AGA7CAACGA TCAATGGGAC TTCCCCCAAT CTTTATCCAA GAAATTTTTS 4560 

ATATCATTCA AGATATTCAG AAGCAAGGAA CAACGGTCCT CTTGATTGAA CAAAATGCCA 4620 

ATAAAGCACT TCCAATCTCT GACCGAGGAT ATCTACTGGA AACAGGGAGA ATCGTCCTAT 4 680 

CAGGAACAGG AAAAGAACTC GCTTCATCAG AAGAAGTCAG AAAAGCATAT CTAGGTGGCT 4740 

AAAACAATCC AGTGGATTGT TTTAGTCGGC AGATGGACAT TACGAAGTAA TCATCAATAT 4800 



WO 98/18931 



PCT/US97/19588 



AGTCCGGCGG 
AAACCTTAAT 
AGCTCCATTT 
AATGTAAGCG 
AGAAGTCTCA 
ATAACAGTAT 
ATCGAAAATG 
TCTAAAGCAA 
AAAGATGTCA 
ACTTATCTGA 
TACGCAGTTA 
GAAGAAGGGA 
GTTTCTTTGA 
GATGGTAAGG 
GAAAAATTTG 
TTCTAAGAAG 
GGAAAGAAAT 
TTTCAGTAAT 
CTCTTTGACC 
CCTGATTTTT 
AAAAACACTT 
TTGGATTTTT 
CTTGATAAGA 
AACAATGCCA 
GATATGTTGC 
TTGGATTACC 
CCTAATGCCT 
CCATTTGACT 
TTGGATGATA 



AC CT TTTT AG 
TTCTAATAAT 
CAGAGCTCTT 
TTTGATAGAT 
TGGCAGTTAA 
CTCATGCAGC 
ATCAATTAGT 
CAACTCTTTC 
TGATTCGCGA 
TGTTGAAAAA 
TTACTGACCG 
TTCGTGTACG 
TTGTAGAAGA 
TCATTATCGA 
AAGCAAATGG 
GGAAGCCCAA 
GATAAAATAT 
TCGTCAGAAA 
TCGAAGGGCT 
CGAACGATAA 
ACAATACCTT 
TGGCTGAAGA 
TAATGACCAG 
TCTTGGAAAT 
TTCGT ATGT A 
AAGCAGGTGA 
ATGGTCTCCT 
CTGCCAAACG 
CTATTGAAGT 



TCGGTAGATT 
TGAAAAAATC 
TTTGCTAGCT 
TTACAAAAAG 
AGATTTTATG 
AGATTTGATG 
TGGTTTGGTG 
TATCTATGAG 
TGTTGTCACT 
TAAGATTAGT 
TGACGTTTTC 
CTTTGTTACA 
AAATTTGAAT 
AGTGCAAATC 
TATTCAAGTG 
AGGCTTCTTT 
GCTATAATGA 
ATTGACGCAA 
AGAGGAAGAG 
TATTGCGGCC 
CCATAAGATG 
CGACTCAGTC 
CTACGAGATG 
CCATCCAGGT 
TACTCGTTAT 
TGAGGCTGGT 
CAAGTCAGAA 
TCGCCATACC 
GGAAATCCGT 



749 
GAG ATTGC AA 
GAATGAAAAA 
TATTCATACT 
ATTGTATAAT 
ACCCGCAAGG 
AGAGAGCAAG 
ACTGAGGGAA 
ATGAATTATC 
GTCTCAGCCT 
ATTCTCCCTG 
CAAGCCTTTC 
GAAGATGAAG 
ATCTCCCATA 
GATGGATCAA 
GAAGAAATCG 
TTTCATGAAA 
AATAATCTAA 
ATCCTCAAAA 
ATTGCCATCT 
CAAAAAACGT 
GAAGAGTTGC 
CATGATGAAC 
ACTCTACTCT 
TCTGGTGGTA 
GGTAATGCTA 
ATTAAGTCGG 
ATGGCTCTTC 
TCTTTCACAT 
GAAGATGATA 



ACAAATCTGC 
TTTCTTACCT 
TTTCTGAATT 
AGGGATAAGA 
TAGTTTATAT 
GTTTGCACCG 
CCATTGCACA 
TTCTGAATAA 
ATGCTAGTCT 
TCGTAGATAA 
TTGAAATTGC 
TTGGTGTTCT 
CAGTCAATAT 
TTGATTTACC 
CTCGCACTTC 
AGGGGATTAG 
AAAAGGACTA 
ATTAGCTTCT 
TGGAAAACAA 
CGCAACAATT 
AGGATGAAGT 
TGGTAGCCCA 
TGTCAGAACC 
CTGAGGCGCA 
AACGCTTTAA 
TAACTTTATC 
ACCGCTTAGT 
CTGTAGAACT 
TCAAGATGGA 



ATCTACATTG 
TCATTCACAG 
TCGAAAAAGA 
ATAGAAAAGG 
TAGTCCACAT 
TCTGCCTGTT 
AGCAAGTCCA 
GACAAAAGTA 
AGAAGATGCA 
CCATCAAGTA 
ACGTTATGGC 
TGGAAAAATT 
TCCGCGTAAG 
AGCCTTCAAA 
AGCAAAAGTC 
AGCAAAAGAT 
TTTATGGACA 
TTCAGGGGGT 
GATGACAGAA 
AAATGAATTA 
CCAAATTTTA 
GTTAGCCGAA 
TTATGACCAC 
CGACTGGGGT 
AGTGGAAGTG 
ATTTGAAGGG 
GCGAATCTCA 
GATCCCAGAA 
TACCTTCCCT 



4860 

4920 

4980 

S040 

5100 

5160 

5220 

5280 

S340 

5400 

5460 

5520 

5580 

5640 

5700 
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5820 

5880 

5940 
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6060 
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6180 

6240 
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6480 

6540 
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TCAGGTGGTG 


CCGGTGGACA 


AAACGTCAAT 


AAGGTTTCAA 


CAGGTGTACG 


TTTAACCCAC 


,6600 


ATTCCAACTG 


GAATTGTTGT 


CCAATCAACA GTAGATCGTA 


CCCAGTATGG 


AAATAGAGAT 


6660 


CGTGCCATGA 


AGATGTTGCA 


GGCTAAGCTC 


TATCAAATGG 


AGCAAGATAA 


GAAGGCTGCG 


6720 


GAGGTACATT 


CTCTCAAAGG 


TGAGAAAAAG 


GAGATCACTT 


GGGGAAGCCA 


AATCCGTTCT 


6780 


TATGTCTTCA 


CCCCTTATAC 


TATGGTAAAA 


GATCACCCAA 


CTAGCTTTGA 


GGTTGCTCAG 


6840 


GTAGATAAGG 


TTATGGATGG 


GGACCTAGAT 


CGTTTTATCG 


ATGCTTATCT 


CAAGTGGCGA 


6900 


ATTAGCTAAG 


ATAGAAAGGA 


ACTCACATGT 


CAATTATTGA 


AATGAGAGAT 


GTCCTTAAAA 


6960 


AATACGACAA 


CCCAACAACT 


GCTCTACGCG 


CTCTTTCGGT 


TAGCGTTCAA 


CCGGGGGAAT 


7020 


TTGCTTACAT 


CGTAGGACCT 


TCAGGAGCAG 


GGAAGTCAAC 


TTTTATTCGT 


TCTCTGTATC 


7080 


GTGAAGTAAA 


AATCGATAAA 


GGAAGCCTAT 


CAGTTGCTGG 


TTTTAATCTG 


GTTAAGATCA 


7140 


AAAAGAAAGA 


TGTCCCGCTT 


CTACGTCCTA 


GTGTTGGGGT 


TGTCl"rCCAG 


GATTATAAAT 


7200 


TGTTACCAAA 


GAAAACTGTC 


TATGAAAATA 


TTGCTTACGC 


TATGGAAGTA 


ATCGGGGAAA 


7260 


ATCGCCGTAA 


TATCAAAAGA 


CGAGTGATGG 


AAGTTTTGGA 


CTTGGTTGGA 


TTGAAGCATA 


7320 


AGGTTCGTTC 


TTTCCCAAAT 


GAACTCTCAG 


GTGGGGAGCA 


ACAGCGGATT 


ccgattgcgc 


7380 


GTGCAATTGT 


AAATAATCCC 


AAACTATTGA 


TAGCTGATGA 


GCCAACAGGA 


AATCTGGATC 


7440 


CGGATAATTC 


ATGGGAAATT 


ATGAATCTCT 


TGGAACGGAT 


TAACyTACAA 


GGAACAACTA 


7500 


TTTTGATGGC 


GACTCATAAT 


AGCCAGATTG 


TAAATACCTT 


GCGCCACCCT 


GTCATTGCCA 


7560 


TTGAAAATGG 


CCGTGTCGTT 


CGTGACGAAT 


CAAAAGGAGA 


GTATGGATAC 


CATGATTACT 


7620 


AGATTTTTTC 


GCCATTTATT 


TGAAGCCTTA 


AAAAGTTTGA 


AACGAAATGG 


TTGGATGACA 


7680 


GTAGCTGCTG 


TCACTTCAGT 


CATGATTACT 


TTGACCTTGG 


TGGCAATATT 


TGCATCTGTT 


7740 


ATTTTCAATA 


CAGCGAAACT 


AGCTACAGAT 


ATTGAAAATA 


ATGTCCGTGT 


AGTAGTTTAT 


7800 


ATCCGAAAGG 


ATGTGGAAGA 


TAATAGTCAG 


ACAATTGAAA 


AAGAAGGTCA 


AACTGTTACA 


7860 


AATAATGACT 


ACCACAAGCT 


ATATGATTCT 


TTGAAGAACA 


TGTCTACGCT 


TAAAAGTGTT 


7920 


ACCTTTTCAA 


GTAAAGAAGA 


ACAATATGAA 


AAATTAACCG 


AGATAATGGG 


AGATAACTGG 


7980 


AAAATCTTTG 


AAGGAGATGC 


CAATCCTCTC 


TATCATGCCT 


ATATTGTAGA 


GGCAAACACT 


8040 


CCAAATGATG 


TAAAAACT.VT 


AGCCGAAGAT 


GCTAAAAAAA 


TTGAAGGTCrT 


CTCTGAGGTT 


8100 


CAAGATGGCG 


GTGCCAATAC 


AGAAAGACTC 


TTCAAGTTAG 


CTTCATTTAT 


CCGTGTTTGG 


8160 


GGACTAGGGA 


TTGCTGCTTT 


GTTAATTTTT 


ATCGCAGTTT 


TCTTGATTTC 


AAATACCATT 


8220 


CGTATTACCA 


TTATTTCCCG 


CAGTCGCGAA 


ATTCAAATCA 


TGCGCTTGGT 


CGG AG CT AAA 


8280 


AACAGTTATA 


TCCGTGGACC 


GTTCTTGTTA 


GAAGGAGCCT 


TTATCGGTTT 


ATTGGGAGCT 


8340 
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ATCCCACCAT CTGTTTTGGT CTTTATTGTT TATCAAATTG TTTACCAATC TCTCAACAAA 8400 

T CG TTCCTAG GGCAAAATCT ATCCATGATT AGTCCAGATT TATTTACTCC CTTGATGATT 8460 

GCCCTACTAT TTCTCATTCG GGTTTTCATT GGTTCATTGG GATCAGGAAT ATCCATGCGC 8S20 

CGATTCTTGA AGATTTAGGT AAAATAGCTG CTTTTATGAG GAGATTCTAA AATCTCCTTT 8580 

TTTGCTACAA GAGTTTTTGA AAAGAGATGC GCAGAAGAAA AGAGCTTCCA AAGAAGTCCC 8640 



CCAGAGAAGA CTTC 

(2) INFORMATION FOR SEQ ID NO: 99: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



TGTCGCGTCA 


AAATCATTAC 


TATGGCTATG 


TATAGCCCTT 


ACTATGACTT 


GGCTAAACAC 


60 


CTTCGCTTTC 


AAATTTCTAG 


GCTCAGGCTG 


AAACAGTCTC 


CCAGGCTGTT 


CACTCCCGAA 


120 


TCCTAAAATC 


GTTCTTGATC 


GCTTTCACAT 


TCTACAACAT 


CTTAGCCGTG 


CTATCACTCC 


180 


TCTGCATGTC 


CAAATCATGA 


ATCAGTTTCA 


TCGAAAATCC 


CATGAATACA 


AGGCTATCAA 


240 


GCGCTACTGG 


AAACTCATTC 


AACAGGATAG 


CCGTAAACTG 


AGTGATAAGC 


GATTTTATCG 


300 


CCCTACTTTT 


CGCATGCACT 


TAACAAATAA 


AGAAATTCTT 


GACAAGATTT 


TAACCTATTC 


360 


AGAAGACTTG 


AAACACCACT 


ATCAGATCTA 


TCAACTCTTA 


CTTTTTCACT 


TTCAGAACAA 


420 


AGACCCTGAG 


AAATTTTTCG 


GACTCATTCA 


GGACAATCTG 


AAGCACGTTC 


ATCCTCTTTT 


480 


TCAGACTGTC 


TTTAAAACCT 


TTCTCAAAGA 


TAAAGAAAAG 


ATTATCAACG 


CCCTTCAACT 


540 


ACACTATTCT 


AATGCCAAAC 


TGGAAGCGAC 


CAATAATCTC 


ATCAAACTTA 


TCAAGCGCAA 


600 


TGCCTTTGGT 


TTTCGAAACT 


TTGAAAACTT 


CAAAAAACGG 


ATTTTTATCG 


CTTTGAACAT 


660 


CAAAAAAGAA 


ACGACGAAAT 


TTGTCCTTTC 


TCGAGCTTAG 


CTGACTTCAA 


CCCACTACAG 


720 


TTGACAAAGA 


GCCTAATTTC 


CATAAAAATT 


GACATGGAAA 


TTATAAAACC 


ATTACTAGTT 


780 
















TAGTCCTTTT 


TGATAACCTC 


CCAATTCGGC 


TTGGTTCGCC 


CAAACATACT 


GACCTGGACG 


840 


GATTTCTACC 


ATAGATGGCT 


TATCAGTCTC 


ATAGTCGTGT 


TGACTTGGAT 


CGTAAACCTT 


900 


CAACACCTTC 


TTACGTTCCA 


ACATTGGATC 


TGGGATTGGT 


ACCCCTGAAA 


GCAAGGCTTG 


960 


AGTATATGGG 


TGAATTGGAT 


TGTTAAACAA 


TTCTTCTGTT 


TCTGCAACCT 


CTACAATAAC 


1020 
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ACCCTTGTAA 


ATAACTGCGA 


TACGATCTGA 


AATAAAGCGA ACAACCGACA 


ACTCATGGGC 


1060 


GATGAAGAGA 


TAGGTCAGGC 


CGAGCTCTTT 


TTGGAATTTT 


TTGAGCAACT 


TCAAGACTTC 


1140 


GGCACGTACA 


GAAACGTCCA 


AGGCTGAAAT 


TGGCTCATCT 


GCAATAACAA 


AGTCTGGTTG 


1200 


CATGACCAAG 


GCACGGGCAA 


TACCGATACG 


TTGACGTTGA 


CCGCCTGAGA 


ATTCATGAGG 


1260 


GTAACGAGTC 


AAGTGCTCAG 


CAAGAAGACC 


TACTTCACGG 


ATAATATTTT 


CAACTTTCTC 


1320 


TTTACGTTCT 


TCTTCATCCT 


TAAATAAACG 


GTGATTGTAA 


AGACCTTCAG 


AAATAATATA 


1380 


ATCAACAGTC 


GCACGTTCAT 


TCAAACTTGC 


GGCAGGGTCT 


TGGAAAATCA 


TCTGGATTCG 


1440 


ACGAATCAAT 


TCCGCAGCTT 


GTTCACGCGA 


TTTCTTACCA 


TTAATCTTTT 


GACCATCAAA 


ISOO 


AATGATATCT 


CCATTACTTG 


TATCATTTAG 


ACCCATGATA 


GCACCACCAA 


TAGTTGTTTT 


1560 


CCCACTACCC 


GACTCACCTA 


CAAGCGAGAA 


ACTTTCTCCC 


TTGTTGATAA 


AGAAGTTAGC 


1620 


ATTTTTAACC 


GCGACAAACT 


TCTTACTTCC 


TTCACCGAAG 


GAAATTTCTA 


AATCTTTGAT 


1680 


TTCTACTAAT 


TTTTCAGACA 


TTTCCTTCCT 


CCTACTCACC 


CAGATGGGCA 


AATCCCATTT 


1740 


TTTCACGGAT 


CTTATCATGG 


AGATTTGCAA 


TCACAGCTGG 


TTTTTCTACT 


TTCGGAGCAT 


1800 


CCTCATGAAG 


AAGCCAAGTT 


TTAGCCCAAT 


GTGTCTCTGA 


TACTGAGAAT 


TGAGGAGCTT 


1860 


TTTGTTCGAA 


GTCAATCTGC 


ATTGCGTAGT 


CAGAACGCAA 


GGCAAAAGCA 


TCCCCTTTCA 


1920 


GCTCAGTATA 


AACTGACGGA 


GGTGTTCCTC 


CGATTGAGTA 


AAGATCCCCT 


TTATCATCAG 


I960 


CAAGCTGAGC 


CAAGCTAGAC 


AAGAGACTCC 


ATGTATATGG 


ATGGCGAGGG 


TCATAGAAGA 


2040 


CTTCCTCAAC 


CGTTCCATAC 


TCAACGATTT 


CTCCTGCATA 


CATAACCGCT 


ACCTTATCCG 


2100 


CAATACTTCC 


CACCACACCA 


AGGTCGTGGG 


TAATAAAGAT 


TGTTGTGAAA 


TGATACTCGT 


2160 


TTTGTAAAGA 


TTTTAGCAAA 


TCAATAATCT 


GACCTTGAAT 


AGTTACATCC 


AAGGCAGTTG 


2220 


TTGGCTCATC 


ACAGATCAAG 


ACATCAGGTC 


CGCAGGCAAG 


GGCAATAGCA 


ATAACGATAC 


2280 


CTTGACGCAT 


TCCTCCAGAA 


TATTGGAATG 


GGTATTCATT 


AAAACGTCTA 


TCTGCGTCTG 


2340 


GAATGCCAAC 


CTTATTCATG 


TAGTCAATGG 


CCAATTCTTT 


CGCTTCTTTA 


GCTGTTTTTC 


2400 


CTTGGTGTTT 


TACAATAACT 


TCTGTAATCT 


GACTACCAAT 


TGTTTTAATG 


GGCTCCAAAC 


2460 


TAGTCATTGG 


GTCCTGGAAG 


ATAGTCGCAA 


TCTTAGCACC 


ACGAATTTGT 


TCCCAATCCT 


2520 


TCTGAGAACA 


TAAACCTGTC 


AAGTCCTGAC 


CACCGTACTC 


AATACTACCT 


TGGCCAATAC 


2580 


CACCATTTTC 


TTCGAGCATA 


CCTGTGAAGC 


TCTTTCTCAA 


AACAGATTTA 


CCTGATCCTG 


2640 


ACTCACCTAC 


CAAGGCTAAT 


ACTTCTCCTT 


CGACTACTTC 


AAGGGAAACG 


CCCCGAATGG 


2700 


CTGTCAATAC 


TTTGTCACGA 


ACCTCAAATT 


CCACCACAAT 


ATCGCGAGCA 


GTCAAAATTA 


2760 


cATmriTC 


TTTTGTCATT 


TCTACTCCTA 


TCTATGTGTA 


CCTGGATCAC 


TAGCATCCGC 


2820 
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TAAGTTTTGA CCAACTACGA AAAGGGACAA GGATACCAAG ACAACCGTTG TCAATCGAAT 2880 

CCACAACAAG TAAGCATTGG TTGTTACGTT TTGTGAATAA TCCGAAATCA AACGACCCAA 2940 

ACTTGGCACT GTAATCGCTA ATCCAAGACC GAAGAAAGAC AAGAAGGCTT CGTATCAGAT 3000 

AAAGCTTGGA AGCATTTGAG TCATGGTTGT CACAATAACA CATACCAATT GAGGCATGAT 3060 

ATTTTTGGCA ACAATCTTCA AGGTTCCTGT TCCCAAAGTA CGTGACGCCA AGTTGTATTC 3120 

CAAGTCACGA TAGCGCAAGA TTTGCACACG GATCATGAAG GCAATACCAA TCCATCTTGT 3180 

TACGCTCATG GCAAAAATCA GATTCCAGAA TCCAGCTCCG ATTGAGTAAG TCAAGACAAT 3240 

AACAATCAAA AGAGGTGGCA TGTTTG AC AT GACGTTGTAA ACTTCCATCA TGACACGGTC 3300 

AACTGATTTT GAAATACCCC AAATACCACC GACAAAAACA CCGATAACCA AGTTAATCAC 3360 

TGTCGCAATC ACAGAAATGA GGATGGAGTT ACGAGCTCCG AACCAGACAC CGTCAAAGAG 3420 

CGATTTAC CG TTACTGTCAC TACCGAACCA ATGCTCCGCA TTTGGCTTGA TATAACGAAC 3480 

ACTAAAGTCG TTTACCTTGC TCACATCATT GAAATCAAAC TTAGAAAACA TTGGCTAGAT 3540 

GAAACTTATC AAAATGATGG CTACCAAGAT TCCCAACATG ACTACAGTTG ATTTTTTCTT 3600 

CATAAATTCT TTAAACACTG ATTTCCAGTA AGAATATGCT GGCGCATCAA TAGTTTCAGA 3660 

GGCAAAATCC TCACCTTTTA CAAACTGAAA TTTTTCTTTA TCGATTCTAG ACATTATTTG 3720 

CCTCCTTTCT CAGTCAATTT AATACGTGGG TCAATAATAG TCATCCAAAT ATCTCCCAAA 3780 

ACACGTGAGA AGATAGAAAT ACATGTAAAG ATGAAGACAA GACCAACGAC CATAGAGTTA 3840 

TTAGATGCTT TTACAGACTC AATCAACATT TTACCCATAC CTGGGAAGGC CAAGACTGTT 3900 

TCACTAAGGG TTGCACCACC GATAACCCCA ATAATGGCAG CAGGAATTCC TGAAACCAGC 3960 

GGAACCATGG CATTTTTAAA CATGTCTTTC TTTGAAATTT CTTTTTCAGA CAAACCTTTT 4020 

GCACGAGCGA AACGAACAAA GTCTTGAGAT TGCAAGTCAA TCATCTAACG ACGAATCCAA 4080 

ATGGCTGTAC CAGGAGCACC CAACAAACCA AGGATGACTG CT G GTAAAAC GTAAGAACGC 4140 

CAATCTCCAG CTCCCAAGAT ACGGAATGAA TCTGGAAGGG CAATAGATGA TCCAATCAAT 4200 

CCAACGATGT AAACCAAGGC AATCGTTGGA AGAGCAACCA AGAAGGTCAA AGCCCCTGTT 4260 

GAGAGGCTAT CAATCCAAGT GTTCTTGAAA CGAGCCATGG CTGAACCAAG TGGCACGGCA 4320 

AGAGCATAGG CAAGAACCAA ACCAATCAAA CCAGTAATAC CAGAGCTGAC AATCATAGAT 4380 

GGATATTGGT AATTACTTTC AGTCGCTGTA TAAGGATCAT CTTTCCCATA GCTAGCTACT 4440 

TCACGAGAGT CAGCCTGACT AGGTGACTTG TAGGTTCTTG AGTAAATATT TACAGAAGAC 4500 

GTTTTCTTAC CTGTTGGGAA CTGAACTTGG CCAGTTTTGG TTTCTCCTTG ACCTTGAGTA 4 560 
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ATAACCTGAA 


GAACTCGTGT 


ATTAGCATAG 


754 
GTTGGGTAAG 


AGTCACCTAA 


ATTCAnuTTL 


4620 


ACAAAGTTTT 


GATGAACAAA 


TGGGAACTGA 


CTGTTAAAGT 


ACAAGAGATA 


TTTATGTTTA 


4 AAA 


GTTCCTGAAC 


CGACCAATGA 


CCATCCGATA 


GCTGGATCAT 


TTTCAAAACG 


AAGCTAGCGT 




TTCAAGTCTG 


GATTTTCAGG 


CTCTTGGATT 


TTATTTGTAT 


GGTCAATGTC 


ft *.m^*ft kMVIIft 

AATCAAGTTA 




GCATAGAACT 


GAAAAACACG 


TTCAAAAATT 


GGAATTTCAC 


GAGTAGCATA 


GAATTXJACCA 


f SOU 


CTTTCAGTAA 


ATTCTCCCAA 


AGTCCAACCA 


TGACCTAATT 


GATTGATGTA 


CTTTTCATAA 




ATAGCTTTAT 


TGGTCGCATT 


TGCTTCTACT 


GTTACAGAAG 


AATCCATGCT 


ACTTGCCTTT 


47oU 


TCTTGCAACT 


CTTTAGTATC 


GTAATACTCA 


ATGTAGCCGA 


TACGCTCAAA 


CACAGTATTT 


5040 


TCATAGTTAT 


CACGTTTATC 


AGCCGTTCTC 


GCAATTTTAT 


TATAGTTAGC 


ATCCTGCTTG 


5100 


AAAATCAATT 


TTCGAGGAAC 


CAAGCTATAG 


ATAATCGTGT 


AGGTCAAAGT 


CGTTACTAAG 


5160 


AAAATCGAAA 


CCAATGACCG 


CAAAACACGC 


ATAAAAATAT 


ATTTTTTCAT 


ATTATTTCCT 


5220 


TTAAAAATCC 


CAAAAGAACC 


TTCTCCTCAT 


GGAGAGAAAG 


TTCTATTAGA 


AATTATTTAC 


5280 


TTCACATGAC 


TTGCCAATTC 


TTTTTGAGCT 


TTCTCATTTG 


ATTCAGCTTT 




5340 


CATTTTTCAC GAGCTTTTTC 


ATACTCTTCC 


TTAGTCACCA 


CTTTATCTTG 


TGATTTCAAA 


5400 


TATTTGAAGT 


AAACATCTGA 


CCCCTTAGAG 


CCTGTTTGCG 


CAGAAGCTCC 


AGTAAATGGA 


5460 


ACAATTCGTG 


AAAGCACTGG 


TCCTGCACCA 


GAAGAAGCCA 


TAGCAGGAAT 


AAAGAGTGAA 


5520 


CTATCTGTCA 


ACCATGCTTG 


AGCCGCTGCA 


TATTTTTCAT 


AACGGACATT 


CAAGTCGCTT 


5580 


GTCTCTCTGG 


CAGCTTCATC 


AACTAATTTA 


TCCTATTCTT 


TCAAACCAAC 


TTCAACTACT 


5640 


GAAGGGCTAT TTGCATTATC 


AAATCCTAAA 


TATGTTTTTG 


TAGTTTCACT 


GCTAUTTGTT 


5700 


TTTAAAATAT 


CCAGGTAAGT 


AGATGGGTCT 


TGATAGTCTG 


GCCCCCATGA 


AACTCCTCCT 


5760 


GATACATCCC 


AATCCTCAGA 


TGAAGCATTG 


GCAGCATAGT 


AAGTAATATT 


AAGGAATTCA 


5820 


TCACTTGTCA 


TTTGTTGAAT 


ATCAACAACG 


ACATTTTCAA 


CACCAAGAAC 


TGTTTCTACA 


5BBO 




AGGACTGAAT 


ACGAGATATG 




ATGCTTGGTC 


TACTGGAACG 


5940 


GATTGTTTAA 


TAGTTTTTTG 


TCCAGATGAA 


TAGGAAACTG 


AACGCCGTCT 


GCTTCTAAAG 


CTTTCTTAGC 


TTTCGCAAAC 


6000 


TCTCCCTTGG 


CCTTCTCAGC 


ATTGAATAAA 


CCATCCTGCC 


CATCAGCTAA 


ATTCACACCT 


6060 


TTCCACTCAT 


CACCATAAGC 


AGGAAGTTGA 


CCAGCGACTA 


AATCACCAAA 


GGTCTTCTCA 


6120 


CCAGCTGAAA 


CAAACTCTGG 


TTTTACAAAT 


AAATTACGAA 


CTGCTAAAGC 


TGCTCCATCT 


6180 


TTACCATTGA 


TTTGAGCTGA 


GTAAGCTGAG 


CGATCAAGAG 


CAAAATTCAA 


GGCTTGACGG 


6240 


AAATCTTTGT 


TAAGCAATGC 


CTTCTTAGTA 


GCTACTTTCT 


CTGAATCTGT 


AGTTTTAGAA 


6300 


GTATACTTCT 


AACTTTGGCG 


ATCAATATTC 


ACACCCAGAC 


CAGCAATCCC 


AGAGCCTGAT 


6360 
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TGTGTGTAAT 


AGATATTGTC 


CTTGTATTCT 


TCTGCAACCT 


TAGAATAGTT 


CuAGCTGGTA 


£ A *>ft 
©4 <U 


GCGTAAAGAC 


GGGCATAACT 


ATAAGCTCCA 


CTAGTGAACT 


TACGCTCTAG 


CGAC a vJvJ J l\IA 


o*ou 


TCTGATCCAT 


CATAGTAAGC 


TAGATTGATA 


GTATCTAGGT 


CGACAin iv. 


i nATwWUAA 




TATTCCTCAT 


TTTTTACAAA 


CTCTACAGAA 


GATTTTGCAG 


TCAACCCTTT 


CAACAAGAAT 


OOUU 


GGACCATTAT 


AAAGCAAGGA 


TGTCGGATCT 


GTTGGTTTAG 


CAAAATCGCT 


TCCTTTTGAT 


6660 


GTTTCGAATT 


CTTCATTCAG 


AGGCCAGAAA 


ATAGAATAGG 


TCAACTTAGA 


GTTCCAGAAC 


6720 


CCTTCACCCT 


GGTTCAAAGT 


GTATTGTAAC 


GTATAATCAT 


CAACCGCCTT 


GACACCAACT 


6780 


GTTGAAAAAT 


CTGTTGAAGT 


TCCTGATACA 


TAATCTGCCA 


AGCCTTTAAC 


CGAATTTTCA 


6840 


GCTAAATACA 


TAGCTTCTGA 


TTTTTTATCT 


GCTGCGTGTT 


TTAAACCCTT 


CACGAAATCT 


6900 


TTAGCCGTCA 


CCTCTGCATA 


TTCTTCTCCA 


TCAGAGGTAA 


ACCATTTAAC 


CCCTTTACGA 


6960 


ATCTTATAAG 


TGTAGGTCAA 


ACCATCCTTA 


GAGACTTCCC 


AATCCTCTGC 


AACTGCAGGA 


7020 


GCAAGATTAC 


CGTAATTATC 


GTTAGTGAAT 


AAACCATCAA 


TCCCATTTGA 


AGTCACTACT 


7080 


GTTGTACTAT 


TTTTACTTGA 


AATCAGGTAG 


TCCAAGGTTT 


CTGGGTCTGC 


TCTATAAACA 


7140 


TAGCCATAAG 


CTTTAGGGGC 


TGATGAATCA 


GATGATTTTG 


AAGAACTCCA 


TGCTCCAAGT 


7200 


ACACCTGCTG 


CTAATAAAAC 


AAGACCTGCT 


GTAGCAAATA 


CACGATTTTT 


TTTCATTTTC 


7260 


TACTCCTCTG 


TTTATGTGAA 


TTATAGATTG 


ACAACCATTA 


TAT CACATT A 


TCCATTAAAA 


7320 


ATCAAACAAA 


TTTTCAGAAT 


ATTTAGGCTT 


GTTGGCACAA 


ATTTTTCATT 


TTTTTTGAAT 


7380 


ATATGATTCA 


AATTGTCGTT 


CGAAGTGTCA 


AAGACTACAG 


TGAAAATAGG 


AAATTTGACG 


7440 


CAGAAACTTT 


GGAGTTTAGG 


AAGACATACA 


GTAAAATGAA 


ATACGGACGG 


AACAATGTGA 


7500 


TTTTGGAATT 


CAAATTAAAT 


TATAACAATA 


TTGTAGAACT 


ATCATTCTAG 


TATTCAAGAT 


7560 


TCAGTTTACT 


ATGTCTTTTC 


ACACCAACCT 


TATCCCGAAT 


TCAATTACTT 


TTGTGATTTA 


7621/ 


CATATATAGA 


TTAAGACTAT 


CTTTTATACT 


TTAAAATTTC 


TCGCTACCTT 


ATCCACTATA 


7680 


TGCTCCTCGC 


TATCACCTTT 


CTATTCATAG 


CCTACGATTT 


CACTATTGCT 


TT CTCTG AC A 


7740 


ATTCTTATTT 


CCTGCGTCAG 


ACTTAAAACG 


ATCTATCCCC 


AGACCATTTT 


AATCCGCTAC 


7800 


CTCACGATAG 


TCAGGCTTGG 


GGAGCGCTAT 


TGTATTCACC 


GGTAGTGGAG 


CCCTACAGAG 


7860 


GACTTACACC 


TCAGATCCAC 


GACATGCCCA 


TCGTATAAAA 


AATCT CCT AC 


CCAAGGTAGA 


7920 


AGATTTCAAA 


CTTATAAAAC 


TTAATCCGTC 


ATGTCCGATA 


CCAACATTCG 


ATGCTCCAAT 


7980 


GGAATACTGC 


ACATAACTAG 


CAAGAAAATA 


AAGCCTGACT 


GAATCCAGAA 


GAGAGCCAAG 


8040 


TCAAAAATTC 


CCTGCACAGC 


AACCACTGTA 


AGGAAAGATA 


GATAAAGGCC 


CATAATCGGA 


8100 
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CGTTTCCCCG 


ACTCCTGACT 


CATATCCATC 


ATCAAGCGAA 


CAGGAGCAAC 


AGAACACAAA 


8160 


ACTAATAAAA 


TACTCCCCAC 


AATTCCGTAA 


CTCAGAATCG 


TATCAATATA 


AAGACTGTGG 


8220 


GCATGTTCAT 


GATAAGGAGC 


ATGTATCCGA 


GGATAAGACT 


TCATATAGCT 


CAATGGCCCT 


8280 


TCACCCCAAA 


AAGGATTTTG 


CTTAAACAAG 


GCCATCCCAG 


CATCCCAGAT 


AGAAATGCGT 


8340 


TCTTCCATAG 


AAGAGTCTAA 


AGTACCCATT 


CGAACTCCCA 


AATCACTAGA 


AAAGAGGAAA 


8400 


CTCAAACCAA 


TCGCCAAGAC 


CCCAATACTA AGCCAAAAGG 


CCTTCCAGTT 


TTTAATAGTC 


8460 


CTAAAGAGAT 


AGATAATTGC 


TCCAGCGATA 


ATAGCAGGAA 


AGGCAGTTCG 


ATTTTGAGTA 


8520 


AACTTCAAAC 


CAAAGAGATT 


AACAAAGCCT 


GCAATCACAC 


AGAATACTTT 


CAACCAATTC 


8580 


AACTTGGTCC 


TTGTAAACAG 


ATAGAAAGCA 


ATCATAATAC 


AGAAACAACA 


AATAATTCCA 


8640 


TAATAATTAC 


GATTAAAGAA 


GGTCACTTCT 


GCCCGGTTCT 


GATGCCACAC 


CTGCATATTG 


8700 


GGTGAAAGAA 


AAGCATAGTT 


AAATTTCTTC 


ACAATTTGGA 


AATGTTCTAA 


ACTGGCAAAA 


8760 


GCAGCTGACA 


AGACACTACC 


AAACAAGACA AACTCCAAAA TCAATCGAAA 


GAATTTATGG 


8820 


GATAAAATCG 


ACTGATAGTG 


CAAAAAGAAA 


ATAGTAAATA 


GAAACATTCC 


TACTGAACCC 


8880 


ACAAGACCCA 


TCCAATTTTG 


TGCAAGAATG 


GATATAACAG 


TACTATAGCT 


AAGAAAAAGA 


8940 


AGCAGCATCG 


GATGCTCCCC 


CATTTTCTGA 


AGAATACTTT 


TCATGTCTCC 


TGTAAAAATC 


9000 


AAACTGATAA 


TATATAAACA 


GACTACAACT 


ACAAAAAGAT 


AAAAGGGTAA 


AAAGATACTC 


9060 


AGGATAATTC 


CCAATAAAAT 


CAGCTCTTTA 


CTAGACAACC 


CCTTCACCTT 


TTCAATAAAG 


9120 


CCTATTGATT 


TCAAAATGAA TCCTTTCTCT 


CCAAATCAGC 


TGATTCAGAT 


AATAGTAAGC 


9180 


TATCCTATAT 


TCTACCACTT 


TTTTAGCAAT 


TTGAAAACAA 


AGGAAACGTT 


TTCCAAAATA 


9240 


AAAACCCTAT 


TTTATCCACC 


ATATCAAGGC 


TTCAAAATGA 


TACTTCAACT 


CCATTCTCAA 


9300 


TTACCCGATA 


AGTCTGATTT 


TGCAAATCAA 


TTTCTACTAC 


TGCTGTTACG 


GACTTATCTT 


9360 


TATTTTGACG 


TTTGATTACA 


ATGCTGTGAG 


CTGTTGGTGT CTCTATCTCA 


GTAGTCCCTT 


9420 


CTAGATCAAA 


GGCTTCTGAA 


CGGTTACGGA 


AAGAAAATAG 


ATTGAGAAGG 


GCCTTCACAA 


9480 


CAGGTCGTTG 


CACTTCI-XTT 


GCTATTTCCT 


CGTTGCTATA GTAATGACGA TTAATATTTC 


9S40 


GACCTTCTTT 


AGTTTCTTCT 


AATAATTTCA 


AGTCATTCTT 


GCCTGCTAAT 


AGACCCACAT 


9600 


AGTAAATCTG 


AGGAATACCT 


GGGGCAAAAG 


CTTGAATTAG ACGACCGAGA 


AAATACTTGA 


9660 


CATCATCATC 


TCCAAGCGCT 


GAATACTAGG 


TTCAATTGAT TTGGTAGATA 


TCTAAGTTGT 


9720 


TATACTCGGC 


ACTAGAGTAC 


TTACGTTTGA 


CATTGGCTCC 


AACCTTATAG 


AGTTCATTTG 


9780 


AAGCATAGTC 


AATCTCCTCA 


TCGCTCAGGA 


TATCCTTGAC ATCTACTACT 


CCAATCCCAT 


9840 


CATGGGTATC 


TAGCGTCGTA 


AATTGCTTCA 


TCGGGCTCAT 


CTTTAACCAC 


TTAGCCAAAC 


9900 
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GCTCTGTTCT GCAACTGTAA AGAGTATAAA CTGTCACCAT TCGAACAGCA AAATCATAAA 9960 

CATACTAATC ATGCTCTGCT ATTTTAAACT CAATCGAATA GTGTTCATCA ATCTCAGGTA 10020 

AAAGCTCTGT CCCATACTCA CCAGCGATAT CTCGAACTTT GTCCAATAAA TCCCAAATAT 10080 

CTGGTTCCAC AAACAAATCA TTAGTATCCA ATTTCTTCAC TGCATAAGCA AAGGCATCTA 10140 

GACGAATCAA ATCACACCCA TTACTTGCCA AGTGCTGAAT CCTCTTACGG ATAAATTCCA 10200 

TAGTTACTTC TTTGGTCACA TCAAGATCAA TCTGCTCCTC ACCAAAGGTA TTCCACAAAT 10260 

CTTCCACTGA ACCATCTTCA AACACAATCT CTTCCTTTGG TGCACCATCC TTACGCTTGT 10320 

AAATTAAATC TACATCAGAC TGTGTCCGAC GGTTTTCTGG CCAAAACTTA TCCCACTTTA 10380 

AAAAGAGAGC TTTAAATTCA CTCGCTTCAT GTTTTTCTTG ATAGTCCTTA TAATACTT G G 10440 

ATTGACGAGA AATATCATTA ATCATAAAAT CAAACATAAG ATAATATTTC TCACCTAAAC 1OS00 

CCTTCACATC CTCCCAATCA CCAAAAGCTG AGTCCACTTC GTCGTAGTCA ACTCGCGCAA 10560 

ATCCACGATC AACTGTTGAT GGGAAAAATG GTAAAAGCTG AACTCCTCCA ATAGCATCTC 10620 

CAAAATGCTC TTCCAAATTA TCATATAAGT CTTTAAGATT ATTTCCAAGG CTATCAGAAT 10680 

ACCTAATCAA CATGGTTTTA TTTTGAATTG GCATCATTAC TCTCCTTTTT CTAATTCAAG 10740 

CCAAGTCTCA TATGATCTGG CTTCATAAAT AAAATTCATT TTAAATCTCT ATTTATCATC 10800 

AAACTCGTAC TAATATAGAC TGTGATAAAC AAAGTACTAC TTTC TTO T 1T TCTGCATAGA 10860 

ATTATCAACA AGCTAAACTC TTCCTCTGTG TCAAAGACTA TAGATTCCAT GAGCTCTTCT 10920 

TATACTCTTC GAAAATCTCT TCAAACCACG TCACCTTCAC CTTGCCGTAG GTATGGTTAC 10980 

TGACTTCGTC AGTTTCATCC ACAACCTCAA AACAGTGTTT TCAGCAACCT GCCGCTAGCT 11040 

TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTACTTCAC TCCCCCGTTG CTCATTCCTG 11100 

AAATGATATG GCCTTGGAAG AAGAGATAGA CAATGGTGAT ACTGATAATG CCGACCACGT 11160 

AACAGCCAAA GCTTGGTCCG TAGTCCTTCA AATATTGGCC TGCGTAGTTG TATTGGAACA 11220 

AAGCCAGACT CCACATTTTG GAATCCCGGT TCAAGACAAG GAGTGGCAAC ATGAAGTCAT 11280 

TCCAGAACCA AAGGGCATTG ATGATCATGG TTGTCGCATG CATCGGTTTC ATCATTGGGA 11340 

AGATGATGCG GAAATAGGTT GTAAATTGAT TAGCCCCATC GATCTCTCCT GCTTCATCCA. _ > 11400 

GACTTTCTGG AATCGAGATT TTGATATAGC CAACATAGAG AAAGAGGCTC TGTGGAATCG 114 60 

CATAGCTCAA CTAGAGCAAG ATCAAACCAA AGGTATTAGC CAAACCGAGT TTACTCATCA 11520 

TAACCGTAAT CGGAATCATG ATGACTTGGA AAGCTACGAA GATTCCGAGG ATTAAGAGGG 11580 

TATACATCAT GGTAAAGGCT TTTCTTTTAC TCATATTGCG AGCGATGGAG TAGGCTGCCA 11640 
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TAGGGATAAA 


GATCATTACT 


GCAAGTAAAG 


ACAAGACAGT 


GATGACGACA 


GAGTTCCAAT 


11700 


AATAGCCTCC 


AATCCCATCA 


GCTAAGAGAC 


GGCTAAAGTT 


GTCCCATGTG 


AAGTTGGTTG 


11760 


GAAACCCAAA 


GAAATTATCT 


ACAATATCCT 


TAGTGGGTTT 


GAAGGAACTA 


AAGAGGGTAG 


11820 


CAAGGAGCGG 


CACTAAAATC 


AGAACCGATC 


CTAGAATCAA 


TAGAATGTAT 


TTGCCAATCA 


11880 


GGGCTTTTCT 


TTCATCTTGT 


TTCATCATGC 


TTCTCCTCTT 


AAATTTCAAA 


TTTCTTAGAT 


11940 


ACTCTCAATT 


GGATGATCGA 


AATCACTACA 


ATTAAGAAGA 


ACAAGATTAC 


GGCAATGGCA 


12000 


TTGGCATAAC 


CGAATTGGTT 


GTTTTTAAAG 


GCATAGTTAT 


AAACCAAGAG 


CCC AAGTGAG 


12060 


GTTGTGGCAT 


TGTTTGGACC 


ACCACCGGTC 


ATGGCAAAGA 


CTTGCTCAAA 


GGCAGTCAGC 


12120 


CCACCTTTTA 


GCCCTAGGAT 


AAAGACCATA 


GAGACACTTG 


CTAGCAAGTA 


AGGCAATTCA 


12180 


ATGTTCCAGA 


AAACTTCCTT 


GCTAGTCGCA 


CCATCAATCC 


TTGCTGCCTC 


TGTAATCTCA 


12240 


GTTGGAATAG 


ATTGCAAACC 


AGCTAGGAAG 


ATGATGATGG 


GCATAGCCAC 


CCCTTGCCAA 


12300 


AGAAGGACAA 


AGACAGCCGC 


AAAGATTGCT 


CCCCACTTAG 


TCCCTAAAAC 


ACTGGTTTGG 


12360 


AAAAATTCAA 


TATGAAGGGC 


ATTTCCAATC 


CCTGCAAGAC 


CGTAGTTGAA 


GACTTGCTTG 


12420 


AAGATCAAAG 


CCACTGTCAA 


ACCAGATAAA 


ACAGCTGGGA 


AGAAGAACCA 


AGCACGGAAG 


12480 


AAGGTTTGGC 


CTTTGATTTT 


AGAATTCAAG 


ACACGCCCAA 


TCAAGATCCC 


GAGTGCAATC 


12540 


TCACCAACCA 


CCATGGCAAT 


CGCAATGATT 


GCGGTAAAGC 


CAATCCCATT 


CATGAATTTT 


12600 


GGATCCATGA 


AGAGGAGCTT 


AAAGTTCTTT 


AAGCCAACAA 


ATTTGTAGTT 


ATAAGTCAAT 


12660 


CCTGTCCAGT 


TGGTAAAACT 


GTAAAAGGCT 


CCTTGAAACA 


TCGGCACATA 


GAAGAAAATT 


12720 


GCTTGTAACA 


AGAGGGGGAT 


GACCACAAAA 


CCCCATGCCC 


AATATTTTTG 


TAATACTTTT 


12780 


TTCATAGTCT 


CTCTACTCCT 


AATCC A CATC 


CGCTTTCATC 


GGGTTAAAGA 


AGGCATTCAA 


12840 


ATCATTGACC 


ATGCCTTGTT 


TATCACCGCT 


CAAGACATAG 


TTCATGGTCA 


AGGTATGGAA 


12900 


CTCTGCTTCA 


CTCGTCCAGT 


ATTGTTGCAA 


CCAGACCAAG 


TGACGATCCG 


TAAAGGCATA 


12960 


TTCGGTCATA 


CCAGCAAGCG 


GTGAATCTTC 


TCCTCCTTGT 


TTGACCCCTT 


CGATCGCTGT 


13020 


TGGAGATCCG 


TCCACATCGT 


AGTATTTTTG 


CATCACTTCT 


CGACGGCTCA 


TATATTCCAC 


13080 


AAAGGCATTG 




GATGTTTGGT 


GGTGGCTGAG 


ATAGACCATG 


CCAAGTCTCC 


13140 


CGCACCAACG 


GTTAAGCTTT 






ATCATGAAGC 


TCCCAATCTT 


13200 


AAAGTTCGGT 


TTTTGTTCAT 


TAATCCCTGT 


GATCCCCCAA 


GACCCATTTG 


GTCTCATGAG 


13260 


GACATCCCCA 


CGTGCGAACG 


CTCCCATAAC 


ATCGGTATAG 


CCAGCACCTT 


CCCAGTTCTT 


13320 


TTCCTTAGAT 


CCATTGATGC 


GAAGGATGTC 


CATGACCTTG 


ATATCATCTT 


TCATAATCGG 


13380 


ATCCGACAAT 


TTAATCGCAT 


TTGGTTGAGA 


ATAACGAAGG 


TATTGATTTG 


CTTCTTTTCC 


13440 
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TCCACCTCTT 


CCTGTCGCAA AGGCTAATTG ATTGTAACCA 


TTGAGTGTCC AAGCATCTGC 


13500 


ACCTGCAATT 




TTTGTCCTTT 


AGCAACGATA 


TCTTTGACTA ACTGTTCAAA 


13S60 


TTCATCCCAG 




CCTTCAAGCC 


CAGTTCTTCG 


AATTTATCTT TGTTGTAGTA 


13620 


AATTCCATAA 


GCATTAuL I Vj 


TAAAAGGAAC 


GTTGTAAACT 


TTTTCGTTTA CAGCATATTT 


13680 


TTCAGCGTAG 


CCATTTTTCA 


L\iv>u * 1 i 


GTACTCTTTG 


TTGCTCAAAT CTTCAAAAAC 


13740 


ACCTGCTTTT 


GCCCATTCTT 




GGACTGTGGG 


TAAATATTGA CCACATCAGG 


13800 


CACATCTCCT 


GCGAGAACGC 


t» lV* 1L1 IWW 


TACTTCACCA 


GCATTTGGTA CATTGACGAC 


13860 


TTTGACCTTG 


ATCTTAGGGT 


i nvu n w i w 


AAAATCACGA 




13920 


GGTCATTTCT 


TTTTTCTGGT 


TGnAATAU L W 


flATGGTCACT 


GTGCCATCCG CAGATTTACC 


13980 


ATACTTGGAG 


CAAGCGCCGA 






GTACTTGCAA GAACTCCGAT 


14040 


TTTTTTATAC 


CATTCCATTA 


GAAAuLv. 1 ww 


TTTATAAATT 


TATACACCCT TATTGAACTG 


14100 


CACCCCAAAA 


GTTACACAGA 


ATAAATCTAA 


Will * 


CAGTACATAT CATAGTTTTC 


14160 


TAAAAATATA 


CTCTCTACTC 


AAAAAATCTC 


CTTGGCATAA 


RATAACACTT AAGCCCGCAT 


14220 


AC ATT AGTTC 


TGCACCTGAG 


TAAACTTCGC 


CATTTTCCTG 


TAATTTATAT AGTCCCTCTT 


14280 


CATCCAAATC 


TTTTAATTTT 


AAACTTCTTT 


CCATGCTCTC 


TACAACAGAT AAAACGCGAA 


14340 


CCTAGGTTAC 


AATCGTTTGA 


TTTCCGTAAT 


TAAATTGTAC 


AGCTGCTTCA TTGGATACAG 


14400 


TATCAGGATT 


AATTAGTCTA 


TACTCCTCTC 


CTAACTCAAC 


TACTGGTCGT AATTCTTTAT 


14460 


ACAAGTTCAC 


CTGATTAGCA 


ATCCTAGCTT 


TCTCTTCATC 


TGATAAATTT GTCAAATCAA 


14520 


CTTCATAGCC 


CAAATTTCCC 


ATCATTGCTA 


CAAGGCCACG 


TGTTTCTAAT GGTGTCATTC 


14580 


GTCCCATCTG 


ATGATTCGGT 


ACTGCTGACA 


CATCAGCCCC 


CATAGAAATG GTTGGATAGA 


14640 


GATAGGATGA 


ACCGTATTGA ATTGGTAAAC GTGCAATGGC 


ATCAGTATTA TCACTAGCCC 


14700 


AGACTTGTGG 


GAAATAGCGC 


ATCATACCAA 


GATCATTTCG 


TCCACCACCA CCAGAGCAGG 


14760 


ACTCAAAGAG 


AATATGGCTG 


TGCTTCTCTG 


TCAGATAAGA 


AACGAGTTCA TAAAGCCCCA 


14820 


GCATGTACTG 


ATGAGATTCC 


ATCTGTGTCT 


CTAGATAAGT 


TAATCCATTC CCTAGCTTAG 


14880 


TGATATTGCG 


GTTCATATCC 


CATTTAATGT 


AATCAATATC 


ATGATAAAAT AGGAGTTGAT 


14940 


CTAAGACACT 


TTTCAAGTAT 


TCTACTACCT 


GAGGATTGGC 


AAGATTAAGT ACTAATTGAT 


15000 


TCCGAGAATA 


AGTATGCTCA 


TAGCCAGGAA 


CCTGAATAGC 


CCAGTCAGGA TCTTGACGAT 


15060 


ACAAATCACT 


ATCTACAGAA 


ATCATTTCGG 


GTTCTAACCA 


AAGTCCAAAC TGCAAACCTC 


15120 


TTTCATGGAT 


AGCTGAAATC 


AGACTTTCTA GACTTCCACC 


CAGTTTTTCC TCATTAACAA 


15180 
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CCCAATCACC 


TAAAGCACGA 


TTATCATCAA 


AACGATTGCC 


AAACCAACCA 


TCATCTAATA 


15240 


CAAAAAGTTC 


AATGCCAACT 


TTCTTAGCTT 


CATCTGCTAA 


CTCTAACAGT 


TTTTCTCTCT 


15300 


GAAAGTCAAA 


GTAAGTAGCT 


TCCCAGTTAT 


TGATTAGAAT 


TGGACGTTCT 


TTTTTAGAAA 


15360 


ATTCACTTAG 


CATAATGTGC 


TTCAGTACAA 


AATTCTGACT 


TTCATGACTA 


ATACCAGTTA 


15420 


ATCCCTCATC 


TGAATCAGTC 


ACTAAAGCTA 


CCGGTGTTTC 


AftAGTATTCC 


TCAGGAGCTA 


1S480 


ACTTCCAAGA 


AAAGTTTTCT 


GGATTAATGC 


CAATAGCCAC 


CCGAACTTCA 


TTCAATTCAT 


15540 


TTTTTTGAAC 


AAAAGCTTCA 


AACTTGCCAC 


TATACATTAG 


TTGAATAGCA 


AACACATTCC 


15600 


CAGCATCCTC 


TGTGACTCCT 


TGTTCGCATA 


GTAGAAGAGC 


TGGTGTTTGA 


GCATGACCAG 


15660 


AAGCACCTCG 


GTTTGAACTA 


ATCGAAAAGA TTCCTTGTTC 


TACCTGTTGA 


CGTCTAACAG 


1S720 


TCTTTTCACG 


AGCATAAGCA 


CCCTGCAGAG 


TTACTATTTC 


GTAATCTGCA 


GCTGGAAAAT 


15780 


CAGCCATAAA 


AGAAAAATCT 


TTATGGATGA 


CAACTTCCTG 


ATTACTATTA 


TTATCTAATT 


15840 


TACTGTAGCT 


AGCAATAGTC 


GCATCATTAT 


TAAAAGTAGT 


ATAATACAAA 


GTCAGACTAA 


15900 


GTTGAGCCTT 


AGAATCTTCT 


AACATTAAGA 


CAAGACTCTC 


TGTATCCTCC 


ATGCTATGTG 


15960 


GAGAAGGTAA 


GCCCTGTGGA 


CCATTCTGAC 


CTTTTAAAAT 


CTTTGCTTCT 


ACAAATCGAA 


16020 


ACTCTGTTAC 


TTCAGTTACA 


CTATGCTGAA 


CCTGTATGGT 


TGGTTTCCTA 


AAATCTCCTA 


16080 


AGCCATGTTG 


TCCAAAAATC 


TGTCGCTGAG 


TATCTAAACT 


AAAGGTTCGA 


TTAGTAGCCG 


16140 


TTGGATTTCC 


TGAAAAGGCA 


TGGTCTCGTT 


CATAAACACT 


ATTGGAACCT 


TTATAGTTCT 


16200 


TAATAGTCTT 


TCCTAAATGT 


TTCAAAAGTA 


AGTAGCCATT 


TCGATTTTCA 


ATAATCAAAC 


16260 


TTAGATTTTT 


ACTCTCAACA 


TAAAATAGAT 


TATTCTCTAT 


CCTAACTCCC 


ATTTACTTCA 


16320 


CCTCATCACT 


TTATTGATTA 


TATTTTATCA 


CCTGAAATCG 


CTTTCCAAAA 


TAGAAAAATG 


16380 


TCTCAAGAAT 


ATGGTAAAAT 


CTTAGGTAGG 


AGGTAGCACA 


TGTTAGTTTT 


TTCAGAATAC 


16440 


CAGACTGGAA 


CAATCGACCT 


TGCCCTAAGC 


TTTTATGGAT 


ATGAGGAATG 


CACACCTAAT 


16500 


TACTcrrrrG 


GTCCAGCCAT 


TCCTGATACA 


TACGTTCTAC 


ATTACATTAC 


TAAAGGACfcA 


16560 


GGAAAATTTC 


ATTACAAGGG 


TAAAATTGTT 


GATTTAAAAG 


AAGGAGATTT 


CTTTCTATTA 


i6620 


AAACCAGAGG 


AACTAACCTT 


TTATCAAGCA 


GATAGTAAAG 


AACCTTGGGC 


CTACTACTGG 


16680 


TTAGGAATCA 


CrGGAGGGAA 


AGCCCCTCAT 


TATTTTCCTC 


TTTCCCAAAT 


TTCTGATCAA 


16740 


TCCTATCTCA 


TCCAATCTGA 


AACTTGTCAT 


ACCCAGACTA 


CTGCAAAACT 


CATCTCAGAC 


16800 


ATTGTCCGCT 


TCGCTCAGAT 


TACAAAATCA 


AGTGAATTAG 


CTCAACTCCA 


TATCATGGGA 


16860 


CAACTTCATG 


AACTGATGTT 


TCATCTGGGA 


ACTATTGCTC 


CCAATCAGAA 


AAAAAAGAAT 


16920 


ATTTCATCAA 


CCCACCAACT 


CTATCTTGAA 


TGCAAACGAT 


TAATTGATAG 


CCACTATCCT 


16980 



( 
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CAATCACTTA 


CAATTCAAGA 


TTTAGCAAAA 


WJWIW 1A1 Www 


TTCACAGAAG 


CTACTTATCA 


17040 


AGCCTATTCA 


AAGAATTTAA 


TALCTTA 1 w\ 


PCCAJUVGAAT 

^ W ^ ^^*"»Wfw^ ■ 


ACCTACTCTA 


CGTTCGAATG 


17100 


CACCGAGCTA 


GACAACTTCT 


CGAAAATACw 




TCAAGCTAAT TGCATACTCG 


17160 


GTAGGTTTTT 


CAGATCCACT 


CCATTTTTCG 




AACAATACTT 


TAATCAGACT 


17220 


CCAAGTCATA 


CAAGAAAAGA 


ATACTCTCAA 




TAACAAAGGC 


AACATTATGA 


17280 


AATCCTACCA 


AGCTCTCTAC 


CAAATCCTAT 


CTAAAGAAAC 




AGCGCAGAAA 


17340 


AAATCGCAGA 


AAAACTATCC 


CTAAGCCGAA 


CAGCAATTTG 


ft A. A. A Of CATC 


AAGCGACTAG 


17400 


AACAAGAAGG 


CATTGAAATT 


GATAGTATCA 


AAAATAGAGG 




ATGAATGGTG 


17460 


ACCTTATTCT 


TCCAGAGATT 


CTAGAAGAAA 


ATCTTCCAAT 




TTTAAACCCG 


17S20 


AAACAAAATC 


AACACAACTA 


GATGCAAAAG 


AAGCAATTGA 


■■■■op n.f^j^.r^r* a*p 


GAAGCAAATA 


17560 


CCCTCTATCT 


AGCTTCCTAT 


CAAACAGCAG 


GCCGAGGCCG 


11 1 1 UM^u A 




17640 


CACCACAAGG 


TGGTATTTAT 


ATGACACTCC 


ATCTTAAACC 




TATGACAAAT 


17700 


TACCATCCTA 


CACACTACTT 


GTAGCTCGAG CTGTCTACAA 




i&( *t _rr aactt 


17760 


TAATAGATCT 


CGACATAAAA 


TGGGTCAATG 


ATATCTATCT 


AAACAATCAT 


AAAATTGGAC 


17820 


GAATCCTTAC 


TGAAGCAATG 


ACCTCTCTAG 


AAACTGGCTT 


ACTCACAGAT 


ATCATTATTG 


17880 


GAGTAGGTAT 


CAATTTCACT 


ATTAAAGACT 


TCCCTCAGGA 


ATTAAAAGAA 


AAAGCTGCCA 


17940 


GCTTATTTAA 


ACCTACAGCT 


CCTATAACAA 


GGAATGAATT 


GATCATAGAA 


ATCTGGCGTG 


16000 


CTTTCTTCGA 


AACACCAGCA 


GAAGAGCTAT 


TATACCTATA 


CAAAAAACAG 


TCATTCATTC 


18060 


TAGGAAAAGA 


ACTCACTTTC 


ACACTAGAGC 


AAAAAGACTA 


CAAGGGACTT 


r*fT A A Aft Af*A 


18120 


TCTCAGAAAA 


TGGAAAACTT 


TTAGTTCAAT 


GTGATAACGG 


AAAAGAAATC 


TGGCTAAATA 


18180 


GTGGCCAAAT 


TTCTCTCAAT 


AGTTGGAAGT 


AAAATAACAC 


AATTATAATA 


TAAACGATAT 


18240 


AAAAATAACT 


TCAGATTAGT 


AATTCAATTA 


AGTTTTACGG 


ATCTGAAGTT 


TTATTGGCTC 


18300 


TAAAAATAAA 


AAAGAGAGTT 


ACAGACTCTC 


ATTAAAACGG 


AGAATAAGGG 


ATTCGAACCC 


18360 


TTGCGCCAGT 


TACCCGACCT 


AACCATTTAG 


CAAACCGTCC TCTTCAGCCT 


CTTGACTAAT 


18420 


TCTCCAATTA 


ATGGGCACGA 


GTGGACTCGA 


ACCACCGACC 


TCACGCTTAT 


CAGGCGTGCG 


18460 


CTCTAACCAC 


CTGAGCTACG 


CGCCCAAGTT 


AAAAAACTTG 


GTAATTTGAA 


CAAAGTTCAA 


18540 


AGCGGGTGAC 


GAGAATCGAA 


CTCGCGACAA 


CAGCTTGGAA 


GGCTGTAGTT 


TTACCACTAA 


18600 


ACTACACCCG 


CATAAATACT 


ATCAATAAAA 


TGGCGCGAGA 


CGGAATCGAA 


CCGCCCACAC 


18660 


ATGGAGCTTC 


AATCCATTGC 


TCTACCAACT 


GAGCTACCGA 


GCCTTATTGC 


GGGAGCAGGA 


18720 
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TTTCAACCTA CGACCTTCCG GTTATGAGCC CCACGAGCTA 
TTAATAATAT AAAAGGAGGA TGTGGGATTC GAACCCACGC 
GTTTTCAAGA CCGTTCCCTT CAGCCGGACT TGGGTAATCC 
TGTAGGACTT GAACCTACGA CCACTCGGTT ATGAGCCGAG 
AAGGTCCGAC AAGATCATTA TAGCGGCGAA GGGGATCGAA 
AACCGGACGC TCTAGCCAGC TGAGCTACAC CGCCATGAAT 
CTGCGACACC TTGGTCCCAA ACCAAGTACT CTACCAAGCT 
TAGAAAAATG CACCCTAGAG GAGTCGAACC TCTAACCCCC 
TATCCAGTTG ACCTAAGGGT GCTCCATATT ATGCCGAGGA 
TCCTTACCAA TCGCAGGATT TTAAGTCCTG TGCGTCTGCC 
TCTAAGCGAA CGACGGGATT CGAACCCGCG ACCCCCACCT 
ACTGAACTAC GTTCGCACTG TTTTCTTCTA TCTAAAAATG 
CGCGACCCTC TGATTACAAA TCAGATGCTC TACCAACTGA 
ATATCTTAAT GCGCGTTAAG CGACTTGAAC CCCCACGCCG 
CTGGTGCGTC TGCCAATTCC GCCAAACCCG CATATATGAC 
TGACCCATTG ATTAAAAGTC AATTGCTCTA CCAACTGAGC 
GCGTTACCTT AAACGGTCCG ACGGAATCGA CCCGGTAC 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



CCGAGCTGCT 
ACGCTTTTAC 
TCCAATATTC 
AGCTCTAACC 
CCCCCGACCT 
CGGGAAGACA 
GAGCTACTTC 
TGATTCGTAG 
CCGGAATCCA 
AGTTCCGCCA 
TGGCAAGGTG 
CCGGCTACAT 
GCTAAGCCGG 
TTAAGCGCCA 
CCGTACTGGG 
TAACGAGTCT 



CCATCCCGCG 
ACGCCTGACG 
AAATGGACCT 
AGCTGAGCTA 
CCCGGGTATG 
GGATTCGAAC 
CCGAGTTAAA 
TCAGGTACTC 
ACCGGTACGA 
CCCCGGCCTC 
CTGTTCTACC 
GACTTGAACA 
CTCATTTGTT 
GATCCTAAAT 
CTCGAACCAG 
AAAATAACTT 



18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19718 



(xi) SEQUENCE DESCRIPTION ; SEQ ID NO: 100: 
CCCTGGAAAA GTCTGGATAG TCAATCCTCT TCACACAATG ACCTGAAAGA ACCCTGAGAA 
TAATTATGGA GAGTAGCATT CTGAGAGGTG TTAGCAGAAC CATATGACAG AGCTGTTTGA 
AGACGGAATA TTGAGGAGAA AAATCCTGAG CCTACCACTT GGAGT'GGAA AGAGCTGACT 
GTTAGATCAT GGTTTATTAT CCACAACCTG TGGATAACTT TGTGAATAAG AGAAGTTGCT 
AAAGAAGGAG ATATATAACG ATGAAGAAAA TCAAACCGCA TGGACCGTTA CCAAGTCAGA 
CTCAGCTAGC TTATCTGGGA GATGAACTAG CAGCTTTTAT CCACTTCGGT CCTAATACCT 
TTTATGACCA AGAATGGGGG ACTGGACAGG AGGATCCTGA GCGCTTTAAC CCGAGTCAGT 



60 
120 
180 
240 
300 
360 
420 
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TGGATGCCCG 


TGAGTGGGTT 


CGTGTCCTCA 


AGGAAACGGG CTTCAAAAAG TTGATTTTGG 


480 


TGCTCAAGCA 


CCACGATGGC 


TTTGTCCTTT 


ATCCGACAGC TCACACAGAT TATTCGGTTA 


S40 


AGGTCAGTCC TTCGACGAGA GGAAAGGGCG 


ACTTGCTCCT 


TGAAGTATCC 


CAAGCTGCCA 


600 


CAGAGTTTGA 


TATGGATATG 


GGGGTCTACC TCTCACCGTG GGATCCCCAT 


AGTCCCCTCT 


660 


ATCATGTGGA 


CCGAGAAGCG 


GACTACAATG 


CCTATTATCT 


GGCTCAGTTG 


AAGGAAATCT 


720 


TATCAAATCC 


TAACTATGGG 


AATGCTGGTA AGTTCGCTGA CCTTTGGATC 


GATGGTGCCA 


780 


GAGGAGAGGG 


CGCGCAAAAG 


GTTAATTATG 


AATTTGAAAA 


ATGGTTTGAA 


ACCATTCGTG 


840 


ACCTGCAGGG 


CGATTGCTTG 


ATTTTTTCAA 


CAGAAGGCAC 


CAGTATCCCC 


TGGATTGGCA 


900 


ATGAACGAGG 


GTATGCAGGT 


GATCCACTGT 


GCCAAAAGGT 


GAATCCTGAT 


AAACTAGGAA 


960 


CAGAAGCAGA 


GCTGAACTAT 


CTTCAGCACG 


GGGATCCCTC 


GGGCACGATT 


TTTXCAATCG 


1020 


GAGAGGCAGA 


TGTTTCCATC 


CCTCCAGGCT 


GGTTCTACCA 


TGAGGATCAG 


GATCCTAAGT 


1080 


CTCTCGAGGA 


GTTGGTCGAA 


ATCTACTTTC 


ACTCAGTAGG 


GCGAGGAACT 


CCACTCTTCC 


1140 


TTAATATTCC 


GCCGAATCAA 


GCTGGGCTCT 


TTGATGCAAA 


GGATATTGAA 




1200 


AATTTGCGAC 


CTATCGCAAT 


GAGCTCTATA 


AAGAAGATTT 


GGCTCTGGGA 


GCTGAGGTAT 


1260 


CTGGTCCAGC 


TCTTTCCGCA 


GACTTTGCTT 


GTCGCCATTT 


G AC AG ACGGC 


CTTGAGACCA 


1320 


GCTCTTGGGC 


AAGCGATGCA 


GACTTGCCCA 


TCCAGTTAGA 


ACTCGACTTA 


GCTTCTCCTA 


1380 


AAACrTTTGA 


TGTAATTGAG 


TTAAGAGAAG 


ATTTGAACCT 


AGGGCAACGA 


ATCGCTGCTT 


1440 


TTCATGTGCA 


AGTAGAGGTG 


GATGGTGTCT 


GGCAGGAGTT 


TGCTTCGGGT 


CATACTGTTG 


1500 


GTTACAAACG 


TCTCTTACGA 


GGAGCAGTTG 


TTGAGGCACA GAAGATACGT 


GTAGTCATTA 


1S60 


CAGAATCACA 


GGCTTTGCCT 


TTGTTGACCA 


AGATTTCCCT 


TTATAAAACT 




1620 


CAAAAAAAGA 


AGTTGTTCAG 


GAACTACCAT 


TTGCAGAAAA 


AACCCTAGCT 


GTCGCAAAGG 


1680 


GAGAAAATGC 


CTATTTTACA 


GTTAAGCGCA 


CAGAATGTAG 


TGGTCCTTTA 


GAACCTAAGA 


1740 


TTTCGATTCA 


ACCGGGGACA 


GGTGTCCATG 


GTGTCGCCTA 


TCAGGATGAG 


ATTCAAGTCC 


1800 




AACTGGTGAG 


ACTGAAAAAA 


GTCTGACGCT 


ACCAACCTTG 


TATTTCGCAG 


1860 


GAGATAAAAC 


CTTGGATTTC 


TATCTGAACC 


TAACGGTGGA 


TGGTCAGCTT 


GTGGATCAAC 


1920 


TTCAAGTCCA 


AGTTTCATAA 


AAGAAGAACC 


TTTGCGCGAT 


GCAAAGGTTC 


TTTTGGTTAT 


1980 


TAGTGACTTG 


GTAACCAGCT 


GAGGGTGAAA 


GTTACTTGTT 


CAGCTTTTAA 


GAGGTCTTGG 


2040 


TGTTGAATAG 


TTGATACGAG 


TGTTTTGTCC 


AGTCGGCATT 


CTTTGACAAA 


CTTAAAATGG 


2100 


TTGTGGTTTT 


GTTTACTATG 


GATATCCAGC 


CATTTATCTT 


CTTTAGCGAG 


GTAGACTCGT 


2160 
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AGATGGTCAA 


AGAGAGGGAT 


TCCGAGCTCA 


TAGCTTGGTT 


TTCCTGGACA 


GGTTGGATAA 


2220 


AATCCGAGAG 


CTGACCAGAT 


GTACCAAGCA 


GAGAGACTAC 


cattutcttc 


ATCTCCAGGA 


2280 


TAGGCTTCCC 


AACTTGGGTG 


AAAAGCTTTC 


TGACGGAGCG 


TCTTGATAAG 


AAGGGCAGTG 


2340 


TAGTCAGGGT 


AATCGCTGTA 


ACGGAAGAGA 


TAAGGAATGT 


GGAAACTAGG 


CTGGTTGGAA 


2400 


ATGCCTATTT 


GTCCAAAAGG 


AGCAGTAGCC 


ATCTCGCTCA 


TTTCGTGAAT TTCGTAACCA 


2460 


TAGCCTCTTG 


TTTCAAAGAG 


GGGAGCATCT 


TGACAGGCTT 


TCAAAAGATA GTTGCTAAAG 


-2520 


GTTTCTTTTC 


CACCCATCAG 


TTGGATTAAG 


CCAGGGATGT 


CCTGGAGAAC 


CCCTAAAGTA 


2530 


GCTTGAATGG 


CAGAGCATTC 


AGCGTAGTCT 


CCCCCCCAAC 


TATAAGGAGA 


GAAGTCAGGG 


2640 


TGAAAGTTTC 


CTTGATTGTC 


TCGTGCTCCC 


ATGTAACCTG 


TCTCAGCCTC 


AAATAGCTGG 


2700 


CGGTAATTTT 


GTGAAGCAGC 


CTTGTAGGTT 


TCACCGATTT 


CTATGTTCTC 


TAGTTTTTTG 


2760 


GCACAGCTGG 


CGATACAAAA 


GTCACTATAG 


GCATAGTCTA 


GAGTATGGCT 


AACACTTTCG 


2820 


TGGTGGTCGG 


TAGACAGGTA 


ACCTAGTTCT 


TGGTATTGGG 


CTAGTCCGTG 


GCGGCCATTG 


2880 


ATGCCGAGAG 


GGTCGGCTTT 


GCTGGCTCTT 


TCGAGCATGG 


CTTCGAAGAG 


TTCTCCTTCT 


2940 


AGGTCGGGGG 


TCATGTCCTT 


GCAGGCGCTA 


TCTGCGATAA 


TACCGTCTAA 


AAGTGTACCT 


3000 


GGCATCATAC 


CCCGTTCATC 


TGGAGCCAGC 


CATTTTGGAA 


GGAAACCAGT 


ATCCCGGTAC 


3060 


CTATTGAGGA 


AACCTTCTAA 


AAAGCGTTGA 


TAGTGCTCCG 


GTATGATAAG 


GGCAAAGAGG 


3120 


GGGAAGGTGG 


TGCGGAAGGT 


ATCCCAGAAA 


CCATTGTTGC 


TAAAGAGGAC 


ACCAGGCTTG 


3180 


ACAGTACCAG 


TAGCCACATC 


CATGTGGATG 


GCTTCCCCTG 


ATTCATTAAT 


CTCATAAAAA 


3240 


GTCTGTGGGA 


AGAGGAAGAG 


TCTGTAGAGG 


CAGTGGTCAA 


AGAAGGTTCG 


GTCAGCCTCT 


3300 


CCTGTCTCTA 


TAATGTCAAA 


ACGATGGAGG 


AGATTTTCCC 


AATCCACTTG 


GCCACTTCAT 


3360 


TTACAGCTAT 


CAAAATCTTC 


TTGAGGTAGA 


TTGATTAGAG 


CTTGAGAAGG 


AGAGATGAAA 


3420 


GAAGTGGCTA 


GTTGCATCTC 


GGTTTGACTA 


CTTGCTAAGT 


CAATTCGCCA 


GTCTCCAGCT 


3480 


TCTTGGCTGA 


TAGCAAGAAT 


ATCCGTGTTC 


ATTTGCAGGG 


CAGTGAACAT 


CCTTAGCGAA 


3540 


TTTTTGTTAG 


TTTCAGTTTT 


ACCTTCTTGT 


CGCAGGGCAA 


GAGTCCGCTT 


ATCTACTTGC 


3600 


TCTACTGTCA 


CTTCATCTGC 


TGCGTGAAGA 


TAGAGGGAGA 


GGGCTTTGCC 


TTGCTTTTGA 


3660 


TTCAAACGAA 


TAGAAGCACC 


ATAGCAAGTC 


GGTGTGAGCT 


GGGTTTCAAT 


CKSATAACCC 


3720 


AGAGAAAAGA 


GCTTCAAATA 


GTGAGGCTGG 


AAGCAAGCTT 


TATCTATATC 


ATAACAAGAC 


3780 


TGGCGGTGAA 


AGAGGCTGTC 


TCCCCCCAGT 


TGACTGGTGA 


CACGTGTCAG 


AAGGAGCCAA 


3840 


GAGTAGTCCC 


CAATCCAAGG 


ACTGGCCTGG 


TGAGTTAATC 


GAATCCCCTG 


AAAGATAGGC 


3900 


AGATGTGGAT 


CAAAAAACCA 


AGATCCATCC 


TGGTCACTGG 


TCTGCGGCAC 


AAAGTAATTC 


3960 
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ATCCCAAAAG GCACGCCTGT CTATCGCAGG GTATTTCCCC GAGAAAAGGC ATGCTTGTTG 4020 
CTACTTCCAA AACGGGTATC GATGGTATCA AGTAGTGGTT TCATAGTCTT TCCTTTAGCT 4080 
GTTTTTCTAC ATTATATCAG TAATAGAGGG CCTTTAG 4117 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



CTGGTTCAAT 


TATTATTCAC 


TCTAAGTAGT 


CATATGTTCT 


TTATTTATGT 


GAGTTTTTAC 


60 


CTTTTAAACG 


ATCTTGTTAG 


ATGGGAGAAG 


GTTTTAAAAG 


TGACAGATGA 


TAATACAAGA 


120 


AAAGTTCGTT 


T ATT ACT AG C 


CTTTTTTAGC 


ATTGTCATAG 


GCTACATCCT 


GACTTCTTTC 


180 


TTTATTACCC 


TGTATCATTT 


GTGGCAAGAA 


GCGCTTAGAG 


GATTATTATG 


AAATCAAGAG 


240 


TAAAGGAAAC 


GAGTATGCAT 


AAAATTGTGG 


TTCAAGGTGG 


CCATAATCGT 


CTGGTAGGAA 


300 


GCGTGACGAT 


CGAGGGAGCA 


AAAAATGCAC 


TCTTACCCTT 


CTTGGCACCG 


ACTATTCTAC 


360 


CAACTGAAGG 


AAAGACCGTC 


TTGCAGAATG 


TTCCGATTTT 


GTCGGATGTC 


TTTATTATGA 


420 


ATCAGGTAGT 


TGGTGGTTTG 


AATGCCAACG 


TTGACTTTGA 


TGAGGAAGCT 


CATCTTGTCA 


480 


AGGTGGATGC 


TACTGGCGAC 


ATCACTGAGG 


AACCCCCTTA 


CAAGTATGTC 


ACCAAGATCC 


540 


GCGCCTCCAT 


CGTTGTATTA 


CGGCCAATCC 


TTGCCCGTGT 


GGGTCATGCC 


AAGGTATCCA 


600 


TGCCAGGTGG 


TTCTACGATT 


GGTAGCCGTC 


CTATTGATCT 


TCATTTGAAA 


CGTCTGGAAG 


660 


CTATGGGGGT 


TAAGATTAGT 


CAGACAGCTG 


GTTACATCGA 


AGCCAAGGCA 


GAACGCTTGC 


720 


ATCGTGCTCA 


TATCTATATG 


GACTTTCCAA 




AACCCAGAAC 


TTGATGATGG 


780 


CAGCGACTCT 


GGCTGATGGG 


GTGACACTGA 


TTGAGAATGC 


TCCGCGTGAG 


CCTGAGATTG 


840 


TTGACTTAGC 


CATTCTCCTT 


AATGAAATGG 


GAGCCAAGGT 


CAAAGGTGCT 


GGTACAGAGA 


900 


CTATAACCAT 


TACTCGTGTT 


GAGAAACTTC 


ATGCTACGAC 


TCACAATGTA 


GTCCAAGACC 


960 


CTATCGAAGC 


AGGAACCTTT 


ATGGTAGCTG 


CTGCCATGAC 


TGGTGGTGAT 


GTCTTGATTC 


1020 


GAGACGCTGT 


CTGGGAGCAC 


AACCGTCCCT 


TGATTGCCAA 


GTTACTTGAA 


ATGGGTGTTG 


1080 


AAGTAATTGA 


AGAAGACCAA 


GGAATTCGTG 


TTCGTTCTCA 


ACTAGAAAAT 


CTAAAAGCTG 


1140 


TTCATGTGAA 


AACCTTGCCC 


CACCCAGGAT 


TTCCAACAGA 


TATGCAGGCT 


CAATTTACAG 


1200 
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CCTTGATGAC AGTTCCAAAA CGCGAATCAA CCATGGTGGA CACAGTTTTC GAAAATCGTT 

TCCAACACCT AGAAGAGATG CGCCGCATCC GCTTGCATTC TGAGATTATC CGTGATACAG 

CTCCTATTCT TGGTGCACAC CCTTTGCAGG GAGCAGAAGT TCTTTCAACT GACCTTCGTG 

CCAGTGCGGC CTTGATTTTG ACAGGTTTCC TAGCACAGGG AGAAACTGTG CTCGCTAAAT 

TGCTTCACTT GGATAGAGCT TACTACGGTT TCCATGAGAA GTTGGCGCAG CTACGTCCTA 

ACATTCAGCC GATTGAGGCA AGTGATGAAG ATCAATAAGA AATCAAGCTA CCTACTCAAC 

CGTTTACTTT T A GT CATC AT AGTACTGATT TTAGGTACTC TGCCTCTAGC AATCGGTTTA 

ATGCTACGTT ATGGAATCTT GGCCAAGGCT CAACATCCAT GGGCTATCCT GTCTCCACCA 

AAATGGCAGG AATTGATTCA TAAATTTACA GCAAATTAGC CTCGAGAACC AGCCTTTTTC 

TAAAGATAAC GAGAAATATG AACAAAAAAA CAAGACAGAC ACTAATCGGA CTGCTACTCT 

TATTGCTTTT GTCTACAGGG AGCTATTATA TCAACCAGAT CCCGTCCGCA CCTAATAGTC 

CCAAAACCAA TCTTAGTCAG AAAAAACAAC CGTCTGAACC TCCTACTCAA GCATTGGCAG 

AGAGTCTCTT AACAGACGCA CTCAACACTC AAATAAACGG GAGTCTGGAG TGGAATCCCT 

CAGGTGCTTT TATCGTCAAT GGTAATAAAA CAAATCTAGA TGCCAAGGTT TCAACTAACC 

CCTACGCTGA CAATAAAACA AAGACAGTGG CCAAGGAAAC TCTTCCAACC GTACCTAATG 

CCCTCTTGTC TAAGGCCACT CGTCAGTACA AGAATCGTAA AGAAACTGGC AATGGTTCAA 

CTTCTTGCAC TCCTCCAGGT TGGCATCAGG TCAAGAATCT AAAGGGCTCT TATACCCATG 

CAGTCGATAG AGGTCATTTG TTAGGCTATC CCTTAATCCG TGGTTTGGAT CGTTTTGATG 

CCTCAACAAG CAATCCTAAA AACATTGCTC TTCAGACAGC CTGCGCAAAT CAGGCACAAG 

CCGAGTATTC GACTGGTCAA AACTACTATG AAAGCAAGGT GCGTAAAGCC TTGGACCAAA 

ACAAGCCTGT CCCTTACCGT GTAACCCTTT ACTACGCTTC AAACGAGGAT TTACTTCCCT 

CAGCTTCACA GATTGAACCC AAGTCTTCCG ATGCAGAATT GGAATTCAAT CTTCTAGTTC 

CCAATGTTCA AAAGCCACTT CAACTGGATT ACCGAACTCG AGAACTAACT GTAACTCAGT 

AAAAGATACG CCTACACTCC TATCTCACTT ATGCATGTAC GACTTCTTTT TACTAGTTTA 

ACCAGCACTA AGACAGGTAC TAAGACAAAA TAGCAACTTC TAAAACTAAC TTCCACTTTT 

GGGAGAGAGA TGGAAGTTAC TTTGAGA 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5717 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY : linear 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1660 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2727 
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ixi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


102: 






TTTTTTCTAG 


Mill ns*u ■ v*u 




CTAAAAAATA 


AAAAACAATT 


TTTGAAAATT 


60 


ATGTTAGCAG 


GAATTGCTVC 


AAA1 1 V-V*>\ * 1 


TTATCACTTA 


CAGGTTTACT 


TGTTTTATTG 


120 


TTCACATCGT 


nlivWl luwi 




TTTTTTATCA 


TTAACTTAGC 


TATGATTTTT 


180 


ATTAATTCAA 


i 'i V.L 111 


TCAGTATGAT 


AGTCGTATTA 


TTTTAAGATA 


CTTGAATTCT 


240 


AACAATAATA 


ACTTGAATTT 


»■>/ i ft ft #ft ft «•* ft. »H 

TCAATATATA 


GTTCAACTTT 


TAATAGCATT 


TGTTATTATT 


300 


TATTTTCCTT 


TGAGTCAACT 


ATTACAGTTT 


TTGACACCCA 


ATATTATTGT 


TCGTAGTATA 


360 


GGAGGGGTGG 


TTGTTTCTAT 


ft /«M^/*HIMM*<ll 

ACTGCTTTCT 


ATATTATATA 


TGATAGGAAG 


GACGAAATAT 


420 


GTTCTACGTA 


AATAGTTATG 


TTTTTGCTTA 


TAAAAAAGAA 


GGTATAATGT 


ATTTACGTCG 


480 


TCGGAGTATG 


CGGGAAATAG 


CTATAGAACC 


TCAAATTTCG 


CAAGAATTTA 


TCAACGATCT 


S40 


ATTTAATACT 


tV**f* ft ft ft ft 

TGTAAGGAAC 


TATTAGAGAT 


AGAAGAAGTA 


TTAGGCAGTA 


AACTAACATT 


600 


TGAACTATAA 


ATGAACAAAT 


TTTAATTTCG 


GATGAGATAG 


ATATTGATAG 


TAGATATTCT 


660 


AGAACTAAAG 


G TT ACT ATT C 


GTTATTTTAT 


AATGAAGAGT 


ATAATAAAAT 


ACAGAATAAA 


720 


ACACTATTAG 


it* i\ mint x 

TATTAGGAGC 


AGGAGTCTTA 


GGATCTTATA 


TATCTCTAAC 


TCTAACTATG 


780 


TATGGAGTGA 


WAAAwTTAT 


TGTCGCTGAT 


TACGATATAA 


TAGAACCATC 


AAATTTAAAT 


840 


AGGCAAATTC 


TTTATACAGA 


GTCGGATGTT 


GGTAAGGAGA 


AGATTAATGT 


TCTTTCTGAA 


900 


AAAATACACA 


MV* IA1 AA 1 1L 


AUAlul 1v»AVj 


GTAGTACCTA 


TTTCTATTAA 


AGTTTCTTCA 


960 


GTAGAAGAAT 




1 \» 1 IVAtWUAA 


TATCGGACTA 


TAGATTTTAT 


CGTTAAAGCA 


1020 


ATTGATACGC 


CCATTGATAT 


TATAAAAATT 


GTCAATCAAT 


TTGCTGTATC 


GCATAAGATA 


1080 


TCCTACATAT 


CACGAGGGTT 


TAATGGATGC 


TATCTTATTA 


TTGATAATAT 


AT AT ATC CCT 


1140 


ACCATCGGTT 


CTTGCTTTGG 


TTGTCGGAAT 


ATAAACAAAG 


ATATAAATAA 


GTACACTTTA 


1200 


TCTGATAAGA 


CAAACTGGCC 


GACTACACCA 


GAGATGCCTG 


CTATTTTGGG 


AGGGATAATG 


1260 


ACTAATTTAA 


TAATTAAAAT 


ATTTCTGGGA 


TGTTATAATC 


AAATCCTAAT 


AGATAACGCT 


1320 


TACGTTTATA 


ATATGAGAAA 


TCATCCTCTA 


AGTCAAGAAA AATATGTTCT 


GGAAAACGGA 


1380 


GAATGTCCAA 


TTTGTAAAAA 


AATAATAAAG 


TGAAAGATAA 


CAATATTAGA 


GCGAAAACAT 


1440 


TTATTCGTTC 




TG CT T ATT AT 


CAGGAGGAGT 


AGCTTTTTTA 


TCTGCTATTG 


1500 


GCCAGTTCAC 


TCTTATAGAA 


ACACAATTAA 


TAGTATTGTT 


CTTGGGTATT 


ATTTTTGCTA 


1S60 


TATATTATGC 


TTACTACAAT 


AAAAATATTC 


AAACATCATT 


GGAAAATATA 


GTATGGCTTT 


1620 



RNROnr.lDt <WO 9ft1R931A2 I > 



WO 98/18931 



PCT/US97/19588 



768 

TTTCATCGTT TGAGATTTTA TTTTTGCTTG TTAATTTTAC AACATTTATT CACTTACCAG 1680 

TGGATATTTT TATTGGTATC ATAATATTTT TAATCCTGTG GATATTTATT ATGTTAGGTA 1740 

TAGTGTGTCT TAGTTATTAT ATAACTTTAT TATTTAGCAA GGAGGCTTAG TATGT TT AAA 1800 

AAAATAGGTA TAATGAGCAT TTGCATATAT ATAATTATTT TATACTCCTT GAGAATGTAT I860 

C GTATT ATC A ATAATATTGA AACAATCTTG CTAACGGTTA TATGCTTAAT GTTATTGTTT 1920 

TTTTTAAGAC GTTTATTTGA TAAAGATAAG TAAATAGATG TTAAGTAAAA ATCTAGAATA 1980 

TAAACGACGT GCAATGAGTA TGATTGAAGT TACCCATTTA TCAAAAAGTT TTGGTGATAA 2040 

AATAGCTTTA AATAATATAA GCTTCACTGT TAAAGAAGGT TAGATTTTTG GATTTTTAGA 2100 

A CC ATCTGGT TCTCGAAAGA CCACAACGAT TAATATTCTG ACTGGGCAGT TCCTTGCCGA 2160 

TAAAGGACAA TCTATTATTT TGGGACAAAA ATCTCAAAAT TTAACAACCG GTGAATTAAA 2220 

GAGAATTGGA TTGGTTAGCG ATACAACTGG ATTTTATGAG AAAATGTCTC TGTATAACAA 2280 

TCTTCTTTTT TATAGTAAAT TTTATAATAT TAGTAAATCA CGTGTTGATA ATTTGTTAAA 2340 

GCGAGTAGGA TTATATGATA CTCGCAACAT GGTAGCAGGA AAATTATCCA CTGGAATGAG 2400 

CCAACGAATG CTTTTACCAC GACCTCTTAT CAACAACCCC GCTGTACTCT TTCTCGATGA 2460 

ACCGACCTCA GGTCTAGATC CCACAACTTC TCGAACAATT CATGAGTTAA TTTTAGAATT 2520 

GAAAACAGCA GGGACAACGA TTTTTCTAAC GACTCATCAT ATCAATCAAG CAACTCTTTT 2580 

ATGTGATTAT GTTGCCTTAT TAAATAAAGG GAAATTAGTT GAGCAAGGAC CTCCTTCTGA 2640 

ACTCATTCAA AGATATAATA AAGATAAAAA GATTAAGGTT ACAGATTATA ATCCCAATCA 2700 

GATAACTTTT GATTTTACAT CACTAGAACA GGTATCTCAG ACTGATCTGG AAAATATTTT 2760 

TTCAATTCAT TCATGTGAGC CTACTTTAGA AGATATTTTT ATCACATTAA CAGGAGGAAA 2820 

CCTAAATCCT TAAACGGTTT CTGGCTTTGG TATGCTTGCG TTGTCAAATC ATCCTTTCCA 2880 

ATAAGAGTAT TTTATTGCAA GTTTTAGTGC CTTTTGCTTT CACATATTTT TATAAATATC 2940 

TTATGGAAAC ACAGGGGAAG GTCAACGATC AACAGGCATT AGTTCTTTTG ATGATGTGTT 3000 

TACCTTTTTC TTTTTCTTTC GCTGTTGGAA GTCCTATAAC TATTATCTTG TCTCAAGAAA 3060 

AAGAAAAGTA CAATTTACAA ACTCTTCTGT TGAGTGGTGT TAAAGGCTCC GAATACATTT 3120 

TATCAACTAT GTTTCTTCCT TTTTTGCTAA CTTTTGTGAT TATGCGAACT ACTCCTCTTA 3180 

TTTTAGGAGT TACAATTGTA CATACTTTTA ATT AT ATT AC AATCGTTCTT CTAACCTCTT 3240 

TATCCATCAT TTTATTCTAT TTATTGATAG GTTTAACCCC GAAGAGCCAA GTAGTAGCTC 3 300 

AGGTTATCAC TCTTCCTGCT ATGATTTTAG TT G CTTTC1T ACCGATGCTA TCTGGTTTGG 3360 

ATAAGACAGT TGCGAAGATA ACAGATTATA GTTTTATGGG ACTATTTACT AAGTTTTTCA 3420 
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CAAAATGGGA GGAATTTTCA TGGAATAAAA CTCTAATTCC TAATCTAACA CTACTTATTT 3480 

GGATTGTTCT TCTATTAACT TTAATTACGA TAACTATTAG GAAAAAGAAA ATTTCTTAAT 3540 

TGAGTTATTT TAATGATTAT AAACACAAGT GGCAAGGAAA AAATGAACTG ATCTTTTTGA 3600 

CAGCAATTCT ACAGAATACT CTTATTGCTA TATTTTGATT TGAGTGTACG AAAAAAGAAA 3660 

AATAACAATA GTGCTCATAC TAATTGCAGA AGTTTTGGGT GATAAGATAA CTGATAAATT 3720 

GCAATAAAAA ATGCAACATT TTTAAATCTC CTCTATAACT GCTTCAAAAA GTGCTTCAAA 3780 

ACCTCTCTTG TAATCCAAGT ATTTTTGGGG ACGGTGATTA ATAAGCTAGC AAAGCATCAT 3840 

TAAGGATTTT TTCGCTAATT GTTCCCAAAT CGGTTTAAGA AAATACTCAC GAAGAAGTCC 3900 

ATTCGCATTC TCATTACTTC CCCTTTGCCA AGATGAATAC GCATCCGCAA AATAAAACAG 3960 

AATTCCCATT TGTTCAATTA AAGGGTAACA AGCAAACTCT TTTTCTCTGT CCGAAGTGAA 4020 

AGTCTTTAAC TATTCTTTTG GAAAGAGTCT TGTCAGGTCT TCAATAGCAG TCAACATGGA 4080 

TTTAGCTCTT TTTACTTGAC AAGTGCTAGT AGAAATAATA GAATAGTAAA AAACCTTTAA 4140 

AGCAGTCCAG AGAGGCAGCT AAGGTTACAC GGTGAAAGGG TGGAGACTAC CCATTTTTCC 4200 

TGGAACCTTG CTGTTGGCAG GTTCCTTTTT TCCTGGCTTC TGTTGGCCAG ACTCTCTCAC 4260 

TAGTAAAGGT AAAAGGAGAA ACCTATGCGA GAACATCGTC CAATCATTGC TCTTGATTTT 4320 

CCTAGTTTTG AGGCGGTCAA GGAATTTTTA GCTCTTTTCC CAGCAGAAGA AAGCCTTTAT 4380 

CTCAAGGTAG GGATGGAGCT TTATTACGCA GCGGCGCCTG AGATTCTCTC CTACTTAAAA 4440 

GGTTTGGGTC ATAGTGTCTT TTTGCATCTC AAACTTCATG ACATTCCTAA TACACTCAAG 4S00 

TCAGCCATGA AGATCTTGTC TCAGCTTGGT GTCGATATGA CTAATGTCCA TGCGGCTGGT 4S60 

GGTGTAGAGA TGATGAAGGC GGCGCGTGAA GGTCTTGCGA GTCAAGCCAA ATTGATCGCT 4620 

GTAACTCAGC TCACATCAAC GTCAGAAGCT CACATGCACG AGTTTCAAAA TATCCAAACC 4680 

ACTCTGCAAG AGTCTCTGAT TCACTATGCC AAGAAGACAG CTCAACCTCC CTTCCATGCT 4740 

GTTGTTTGCT CGGCTCACGA AGTACAAGTC ATCAAGCAGG CTACCAATCC ACATTTTATC 4 800 

TGTCTGACAC CAGGGATTCG TCCACCTGGT GTTGCAGTTG GAGATCAAAA ACGAGTCATG 4860 

ACACCTGCTG ATGCCTATCA AATCGGCAGT GACTATATCG TAGTGGGACG TCCCATTACC 4 920 

CAAGCTGAGG ATCCTGTTGC AGCTTATCAT GCCATCAAGG ATGAATGGAC ACAGGACTGG 4 980 

AATTAAAGAA CTAGATTAGA AAAATAAAAG GAGAATACCA TGACACTTGC TAAAGATATC 5040 

GCTAGCCACC TCTTGAAAAT CCAACCCGTT TACCTCAAAC CAGAGGAACC CTTCACTTGG 5100 

CCATCTGCTA TCAAGTCACC GATTTACACT GATAATCGTG TGACACTAGC CTATCCAGAA 5160 
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ACTCGTACCC TAATTGAAAA TGCTTTTGTG GAAGCTATCA AAGAAGCCTT TCCTGAAGTA 5220 

GAAGTGATTG CAGGAACTGC AACAGCAGGG ATTCCACACG GAGCCATTAT TGCTCATAAG 52 80 

ATGGACTTGC CTTTTGCCTA CATCCGTAGT AAACCAAAAG ACCACGGAGC TGGTAATCAA 5340 

ATCGAAGCTC GCGTAGCTCA AGGTCAAAAA ATGCTACTGG TTGAAGACCT TATTTCAACG 5400 

GGTGGTTCAG TTCTTGAAGC TGTAGCAGCA GCCAAGCCAG AAGGAGCAGA TGTACTTGGA 5460 

GTTGTAGCGA TTTTCAGCTA CCAATTGCCA AAAGCAGATA AGAACTTTGC AGATGCTGGT 5520 

GTTAAACTTG TGACGCTTTC AAACTATAGC GAGCTTATCC ATCTAGCCCA AGAAGAAGCT 5580 

TACATCACGC CAGAGGGCCT TGATCTTCTA AAACG C TTTA AAGAAGACCA AGAAAATTGG 5640 

CAAGAAGGTT AGGTCAGTAA GATAAAGAGA GACCACGCTA CCCACTCTCT TTTACCATTT 5700 

TATTTAAAAT ATGACAG 5717 



(2) INFORMATION FOR SEQ ID NO: 103: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5558 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 103: 



CCTGGACTTT 


CTAAAATGAA 


ATCTTGCGAC 


CTGGATCAAG 


CCCTTCATCA 


GCATTTTTCA 


60 


GAAGAAGAAT 


TAGCTGGTCA 


CTTTCATGTC 


CTTCTATGGA 


CTTTTTTTAC 


AATGGCATTC 


120 


CTATCACACC 


CAATACCTAT 


CTAAGCGCCT 


CGTTCGTAAA 


CTTTATTGCA 




180 


TAAATTTCCT 


AATTGTTGAA 


CCAATTGCCC 


GTTTTATACT 


AAGTTCTTTT 


CAGAAACCAT 


240 


TTACTGGGGA 


AGAAGTTGAA 


GATTTTCAAG 


ATGATGATGA 


AATCCCAACT 


ATTATCTAAG 


300 


CCAGTTCTGT 


AAACTACTAA 


TATTTGAAAT 


CCACTTCCTT 


TTAGGGTGCA 


ATGGTTATAA 


360 


ATGAATTTTT 


GAGAGGATCA 


GAATGAAAAA 


ACTAGCAACC 


CTTCTTTTAC 


TGTCTACTtST 


420 


AGCCCTAGCT 


GGGTCTAGCA 


GCCTCCAACG 


CAGTCTGCGT 


GGTGATGATT 


ATGTTGATTC 


480 


CAGTCTTGCT 


GCTGAAGAAA 


GTTCCAAAGT 


AGCTCCCCAA 


TCTGCCAAGG 


AGTTAAACGA 


540 


TGCTTTAACA 


AACGAAAACG 


CCAATTTCCC 


ACAACIATCT 


AAGCAAGTTG 


CTGAAGATGA 


600 


AGCCGAAGTG 


ATTTTCCACA 


CAAGCCAAGG 


TGATATTCGC 


ATTAAACTCT 


TCCCTAAACT 


660 


CGCTCCTCTA 


GCGGTTGAAA 


ATTTCCTCAC 


TCACGCCAAA 


GAAGGCTACT 


ATAACGGTAT 


720 


TACCTTCCAC 


CGTGTCATCG 


ATGGCTTTAT 


CGTCCAAACT 


GGAGATCCAA 


AAGGGGACCG 


780 


TACAGGTGGT 


CAGTCCATCT 


GGCATGACAA 


GGATAAGACT 


AAAGACAAAG 


GAACTGGTTT 


840 
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CAAGAACGAG 


ATTACTCCTT 


ATTTCTATAA CATCCGTGGT 


GCTCTTGCTA 


•tywtajvtac 


900 


TGGTCAACCA 


AACACCAATG 


GCAGCCAGTT 


CTTCATCAAC 


w*AAAv7TwTA 




960 


TTCTAAACTC 


CCTACAAGCA 


ACTATCCACA GAAAATTATT 




_. _./»* _. afiftTRC 
AMmvwM iuw 


1020 


AAACCCTAGT 


CTAGATGGCA 




LI 1 I \m\* * wnn 


GTGATTGACG 


GTATGGA1VI 




TGTGGATAAG 


ATTGCTAAGG 


clcaaaaauv 


•IV" & A. & A. AfZAT? 


IV fv r^f*** « fv m 

AAGwwAACTA 


(TTGWIAIwAt. 


1140 


AATCGACAGC 


ATCGAAGTGG 


TGAAAGACTA 




TL.1 lAAAAAC 


LAAAAAAAIA 


120.0 


CAGTATCCAC 


ATTCGGTACT 


GTA1 i -t-TTT 


i» & f^tV^lV* _TT 


CTTAAGTTAA 


MP % ffMt _l K m _t 

ATTATTAAAA 




TCCCATATTT 


GGTCTATCCA 


GCCTTCATAA 


■m. _, j WVH^^^^*!* 


CGTGGCAGAC 


CATAAGGATA 




GATCCCCTAT 


ATTCTTTGAG 


AGCGCGTTTG 


AGCTLnlA-L I 


TTGCATCCAC 


m i_%_i^_p^ k ft ■ A^^^ 

ATCCAAATGG 


1 ISA 


TTGGTCGGCT 


CGTCCAGCAC 


TAAAACGTTG 


TTTTCACGAT 


TCATCAAGAG 


ACAGAAACGA 


1440 


ACCTTGGCTT 


GCTCTCCCCC 


TGATAATACT 


TGAATCTuGW 


TTTCAATATG 


TTTGC/rTGTC 


1500 


AAACCACAAC 


GGGCAAGGGC 


TGCACGGACT 


TCTGtTTTGAT 


TAAGGGCAGG 


AAAGGCATTC 


1S60 


CACACAGCTT 


CAAGAGGAGT 


TTGGCGATTA 


CCGCCTTCTA 


CTTCCTGCTC 


AAAATAACCA 


1620 


ACTTCTAAAT 


AATCTCCACG 


CTCCACTTCC 


CCAGCGAilG 


GCGAGATAAT 


GCCCAAGAGA 


1680 


CTCTTCAAGA 


GAGTTCTTTT 


TCCAATACCA 


TTAGCACCAA 


TAATCCCAAC 


CiTITGATTG 


1740 


CGTTCGAAGG 


TAAGATTTAA 


AGGCTTAGTA 


AwAwnLwu 1 


CGTAACCAAT 


TTGCAAGTTC 


1 AAA 


TTGGCTTGGA 


AGATAAAGCG 


CCCTGGTGTA 


CGAGCTGGTT 


TGAAATCAAA 


CGATGCTTTT 


1DOU 


GGTTTCTCAC 


TT^GGAGTTC 


GATAATAlvL 






ACGAGACATA 


1920 


GCCATATTAC 


GAGTTGCAAC 


ACGGGCTTTA 


TTACGAGCCA 


CAAAGTCCTT 


GAGO!V.^«CA 


179U 


ATCTCTTTCT 


GCTGGCGTTC 


GTAGGCTGCC 


TCTAGCTCAG 


ATTTCTTCAT 


AGCATAAACT 




TCTTGGAACT 


GGTAGTACTC 


ACCAGAGTAA 


CCCGTCAGCT 


GTTGATTTTC 


WAWATGATAw 


7 1 Oft 


ACAATATTAA 


TAACGTCATT 


GAGGAATGGA 


ATATCGTCCG 


AAATGAGAAC 


AAAGGCATTC 


2160 


TCATAGTTTT 


GGAGATAGCG 


CTTGAGCCAA 


TCAATATGCT 


CAGCATCCAA 


GTAGTTGGTC 


2220 


GGCTCGTCCA 


ACAGCAAGAT 


ATCAGGCTTT 


TCAAGGAGAA 


GTTTTCCCAA 


AAGCACCTTG 


2280 


GTTCTTTGCC 


CACCTGACAA 


AGAAGTTACA 


TCCGTATCCA 


TGCCAAAGTC 


CATAACACCA 


2340 




* i'- v \< A 












AGAGCACGCG 


CTACTTCGTC 


AATCTTAGCA 


TCCAAGGTAT 


AGAAATCACG 


ACTCTCCAGA 


2400 


CGGTCTTGAA 


GTTCTCCTAC 


TTCTTCCATG 


AGAGCATCAA 


CATCCGCGCC 


GTCTTCAGCC 


2460 


ATTTTCATAT 


ACAGCTCATT 


GATACGAGCT 


TCAGCTTTGA 


AAACCTCATC 


AAAAGCCGTA 


2520 


CGGAGAACAT 


CACGCACCGA 


CTGTCTTTCA 


GCAAGGACAG 


AGTGCTGATC 


CAAGTAACCA 


2580 
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GCCCTCACAT ATTTGGACCA CTCAACCTTT CCTtcItCTG GCAGCATTTT ACCACTCACG 
ATACTCATAA AGCTTCATTT TCCTTCACCA TTGGCACCGA CCACGCCCAT ATGTTCTCCC 
TTGAGGAGAC GGAACGACAC ATCTTCAAAA ATTCCACCGT CACCAAAACC CTGACTCAGA 
TTTTTAACTT CTAAAATACT CATTTTAATT CCTTACCTTG TTTTTATGTA ATCGTTTATA 
AAGGAGCCAA GCCAGATAGC CACCCAAAGT GTTGCTCCAC AAATCATCAA TCTCAAAGAC 
CCGATTCAAA TCAAAGAAAA AGTCCAAGAT TAATTGCGTA CACTCGATTC CAACACTCAC 
AAGAAAACTA AAAAGAAGCA CCTTTTTTGT TTTCCCCAAA TTTGGAAATA GATAAACGAG 
TTGGAAAATC AGAGGAAAAA ACAAGAAGAC ATTGAGGATA TTTTCTAAAA AAATCCAACA 
TAATTGTCCA ATCTCACTCA CTTCCCCCAG TTTCCAGAGA GAATTGAAAG GAGTCAAAAG 
AAAAACCAGG CGTCCAAGAT GCTGAATACC TGGAGTTCCC ACTCCCACGG TAGATTGTTC 
TTGAGGACTA AAGCAAAAAC AGACAATGCA AATGCTATAG AAAATGACTC CCCAGACCAA 
AATATGATTA TAAGTCTTCT TCATCATTAA GGATTTACCC CTGCGACTGC CTTCTGGCGC 
TCACCTTTCA TTCTCTTACA GCCCAATTGT CCACAAGCTG CGTCAATATC TCTACCATGC 
TCTTGACGAA CCACACACTT GACCCCTTTT TTCTTAAGCG TATCATACAA AGCCAACACG 
CACTCTTTGG GACTACGGCT ATATTGGTCA TGCTCACTAA CTGGGTTATA AGGAATCAAG 
TTTACATAAG ACAATTTCTT GATGTTCTTG AGCAATTCAG TCAATTCCAA GGCTTGTTCT 
ACACCCf CGT TGACTTCATT AAGCATGATA TATTCAAAGG TTACACGACG CTTTCTTGTC 
TCAATGTAGT ATTCAATACC AGCAAAGAGT TTTTCAATCG GAAAGGCACG CTTAATCTTC 
ATGATACTTG AACGAACTTC ATTGTTAGGT GCCTGAACAG ACACCGCAAC ATTGACCTGA 
ACCCCTTCAT CAGCAAAGTC ACGAATTTTA TGAGCCAAAC CTGAGGTTGA AACCGTCATC 
TGACCAGCAC CGATAGCCAT TCCTTTATCA TCATTGATAG TACGAAAGAA ATTCAAGACA 
TTCTTGTAAT TATCAAAGCG CTCACCGATT CCCATGACAA CGATATGGCT GATGCGTTCA 
TCCTCACCAC GCTCATCAAA GTATTTCTGA ACCACCATGA TTTGCCCTAC GATTTCACCG 
TTATTGAGCT CACGTTGCTT CTTAATCAAA CCAGACGCAC AGAAGGTACA AC CG AT ATT A 
CAGCCGACCT GAGTCGTCAC ACAGACAGAT AAACCATACT GTTGACGCAT CAGTACAGTC 
TCAATTAACA TACCGTCGGG CAATTCAAAO AGATATTTGA CTCTACCATC AGCAGACTCT 
TGCACAATAC GTTGTTTCAA GGGATTGACC ACAAACTGGT CATTGAGCTT AG C AATCAAA 
TCCTTGGAAA GGTTGGTCAT TTCTTCAAAT GACTGCACAC GTTTACGGTA GAGCCATTCC 
CACATTTGAT CTGCACGGAA TTTCTTTTCT CCCTCCTCCA ATACCCATTC CTGCATGGTT 
TGATGTACCA AACTATGAAT TGAGGGTTTC ATTTCTTCTC CTTATTCTCT ACTCACTTCT 



,2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4360 



RNsnonirv <-wo qmwwia2 i > 



WO 98/18931 



PCT/US97/19S88 



773 

GACCAATCAC AAAATGACGT TGTCCCTTCT C G TCTTTCTG ACGACGTCTA TTTTTCTTAT 4440 

CTCCATTCGA CTTTCGTTTA GTTTGACTCG G TTTCTTTCC TTTTCTAGAA CCT GT 'IT C T T 4500 

CTTCCGTCTT ACGCATTTTC TTGTCAAATG ATGCTCGCTT AGGGGCTTCA TTTTCTAAGA 4560 

CAAAATAGGC ACAACCATAA CTACAATACT CTAAAAGGTA GTCTTGTAAA CGACTGATTT 4620 

TTTCAAGTTT TTCTTCTGTT CGGTCATCCT TGTAAAAACC TCGTACGCGA AGCTCTTCGT 4680 

TGCTCCAGTC CCCCACGATA TAATCAAACT TGGTTAATAC TTCTGAAAAA CGCTGATTAA 4740 

AAGTCGTCAC ATCAAAGCCA TCCTTCATAT TTTCAACCAA GGAAAAAGCT ATCCCTTCCG 4800 

TTTCCACCTT GTCCCCGTGT AAATGGAACT CCGGACCAGG AAACTTGTTA TAGTTGTATA 4860 

ATTCAGGTGC AATTTCTTTT CGCATAGATA TCCTTTTTTC ACGATTACTT AATACTTTAT 4920 

TCTACCATAA TTTCTAGCAG TTAGCACGTT TCTCATAAAA ATCAAAAAAG TCTCACGATT 4980 

TTGTCACACC AGAATCTTAT AACCTAAAAA GAGAAGAACA ATTCTTCCCT CCAACTATCA S040 

TTATTTACCA CCTGCGTACA ATTCATCTAC TTTATTCCAG TTGATTACTC AAAAGAAAGC 5100 

TTTGATGTAG TCAGGACGCA C G TTGCGGTA TTTCACGTAC TAAGCATGTT CCCAAACGTC 5160 

CAACCCCAAG ATTGGTTTTT TACCTTCTGA GATTGCTGTG TCTTGGTTTG CTGTTGAAGT 5220 

CACTTCAAGT TTCCCTTCTT TGTTGACAAC CAACCATGCC CAACCTGAAC CAAAACGAGT 52 80 

TCTTGCTGCT CCAGTGAACG CTCCTTCGAA TTCTTCAAAT GAACCAAATG TTGCATCGAT 5340 

TGCTGCTGCC AGTTCTGCTG AAGGAGCTCT TTTCTCGGGA GTCATCAATT CCCAGAAAAG 5400 

AGCGTGGTTC AAGTGTCCGC CACCATTCTT GATAACTCCT TGACGGATAT CAGCTGGGAT 54 60 

AGATTCTACA TCAGCAAGCA AGGCTTCAAG GTCTTCACCG ATTTCAGGGT GTTTTTCTAA 5520 

AGCTGCATTG GCATTGTTGA CATAAGTTTG ATGGTGTT 5558 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

CGAATTGTAA ATATCATATT GTTTTTGCAC CCAAATATCC TCGTCAAATC ATTTATGGCA 60 

GATACAAAGC TAGTATCGGA AGAATCATAC CTGACTTATG TGAGCGTAAG GGTGTAATAA 120 

TCCATGAAGC GAATCCTTGT TCAGACCATA TTCACATGCT TATCAGTATT CCTCCGAAAC 180 
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TTACTCTTTC CTCCTTTATG GGCTATTTAA ACGGCAAGAG CAGTTTGATG ATTTTTCATA 240 

AGCATGCGAA TTTAAAATAC AAATATGGCA ATCGCAAGTT TTCGTGTAGA GGCTATTATG 300 

TACATACGGT AGGCCGTAAT CAGAAAGTGA TAGCTGAATA TATTCAGAAT CAATTACAAG 360 

AAGACAGAGT AGCAGACCAC CTCACGTTAT TCGAGTCAGT AGATCCGTTT ACTGGCGAAA 420 

TAAATAAGAG GAAGTAACTA AGCTGCTTTA GCACCTGCTC GGGAAAGTGG TGCGCGAGGA 480 

AGCTATTTCG GTGGGCCTTT GGCCCTGGCC GGTAGAAGCG GCTTATAGCC GCAGAACAAA 540 

CCACCAGTTC ACACTGGTGG TTTTGATTTA AAAAACTTGA TACATAAAAA TAAAAGTCTA 600 

TATAAAGGAT GCTAAAATTC CTGTTCTCCC ATTTGGACAA TATCCTAAAT AGTTACAATA 660 

TATGCTCTAT ACTTTTTCTT AGGAGAAAGC TAGATGTACA GACGTTTGAG AGATTTGAGG 720 

GAGGATCATG ATCTGCCCCA AAAGCAAATA GCTACAATAC TTTC G TTT AC AAATTCACCT 780 

TATCCCAAAA TTGAACGGCG TCAGCATGCG TTGACGGCTG ATGTATTGGT TAAACTCTCA 840 

GATTTCTATG ACGTCAGTAC AGACTATTTA TTGGGATTAA CTGATTTTCC TGATAAAATT 900 

CCCTTTAGAA AATAATCTCC TCAATTTCAT AGAGTTTGAA AATGAGTGAG ATTTTTTATT 960 

TGCCCTTTGA CAACTGAATA GCCTAAAATG CTACTTTCCT C A TTT G TGGA GCAAATTTGA 1020 

ATGGCTCGCC ATGATAAGAG CGATTTTAAA ATCATCAATA AAATAGAGCG ATACTTTATA 1080 

TGCCATGATA CAAATGATAT ACAATGATAC TTCTGACCGT TCAGCCTGCC AACGTAAAAG 1140 

AGCAGCAAGT GAAATTCTTA TCATGACTTC ATCAGTCATG CCACCTTGAA TGTGTGAGTT 1200 

TCTTAGATAA ACGCAATTAA TCCTCAAAAC G TTC C CCGAA CCTTTTCAGT TCTACAGACG 1260 

CATCACGTGG AGTGTGTAAG CTTGTTGCTA AAAGCGTAAA AACCTTGGAA CGAAAGGAAT 1320 

AATAGACTTT CTGCGAAACA AAAATATAAT ACAATAAAAC TATGAATGAT GAAGCAAGTA 1380 

AACAATTGAG CGATACCCGT TTCAAGATCC TTCTAGCTGT TCACCGCACG ACTTTTGAAG 1440 

AGATGTTAGC TGTCTTAAAA ACAGCTTATC AACGTAAACG CGCAAAAGGT GGACGAAAAA 1500 

GCAAATTAAG CCTACACGAT CTCCTTATGG TAACTATTCA ATACATGCGA GAATAGAGCA 1560 

CTTATGAACA AATTGCGGCT GATTTTGCCA TTCACCAAAG CAACTTAATC CGTCGGAGTC 1620 

AATGGGTTGA AGCAACTCTT ATTCAAAATG G TTTT ACGAT TTCAAATTCT GCCTTAATTC 1680 

TGTAAAAACA GTAAAATTCC AAGCATTCTA ACGTAAGAGT TITTTTCT T T CTGAAAAAAT 1740 

GGTATAATAG CAATCAAAAC TAGAAAATAA AACGGAATTT GGAACAGATT TGTCTCTATC 1800 

CTAGTAGAGT GGTGATACTA TGAAGATTAG TAAGAGGCAC TTATTAAATT ATTCCATCTT I860 

GATTCCCTAC TTGCTTTTAT CTATTTTGGG CTTGATTGTC CTCTATTCGA CCACCAGTGC 1920 

TATTTTAATT GAAGAAGGCA AGAGCGCCTT GCAGTTGCTT CGAAACCAAG GAATCTTTTG 1980 
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CATTGTTAGT 


TTGATACTGA 


TTGCCTTAAT 


TTATAAATTC 


AGACTAGATT 


TTTTGAGAAA 


2040 


TGAGCGACTA 


ATCATTTTAG 


TTATATTAAT 


AGAAATGCTT 


TTATTGTTCT 


TGGCTCGTTT 


2100 


TATTGGTATT 


TCCGTAAACC 


GGGCATACGC 


TTGGATTTCG 


CTTCCAGGAA 


TAACTATTCA 


2160 


GCCAGCTGAG 


TACTTAAAAA 


TCATTATTAT 


TTGGTATTTA 


GCTCACCGAT 


TCTCCAAACA 


2220 


GCAAGAAGAA 


ATAGCTACTT 


ATGATTTTCA 


AGTTTTGACT 


CAAAATCAAT 


GGCTTCCCCG 


2280 


TGCTTTTAAT 


GATTGGCGAT 


TCGTTCTCCT 


AGTTCTGATT 


GGAAGTTTGG 


CAATTTTCCC 


2340 


TGATTTAGGA 


AATGCGACTA 


TTTTAGTCTT 


GGTTTCCTTG 


ATTATGTATA 


CAGTTAGTGG 


2400 


AATCGCTTAT 


CGCTGGTTTT 


CAACCATTCT 


GGCGCTCGTA 


TCTGCCGCTT 


CTGTCTTTGT 


2460 


CTTGACCACT 


ATCAGCCTAA 


TCGGTGTTGA 


GACCTTTTCA 


AAAATTCCAG 


TATTCGGCTA 


2520 


TGTAGCCAAG 


CGCTTTAGTG 


CCTTTTTTAA 


TCCTTTTGCC 


GATCGTGCTG 


ATGCAGGTCA 


2580 


CCAGTTAGCT 


AATTCTTATT 


TTGCCATGGT 


CAATGGCGGT 


TGGTTTGGTC 


TAGGTCTTGG 


2640 


AAACTCGATT 


GAAAAACGAG 


GTTATTTGCC 


AGAAGCTCAT 


ACAGACTTTG 


TCTTTTCTAT 


2700 


CGTGATTGAA 


GAATTTGGCT 


TTCTTGGTGC 


CAGTCTTATT 


TTAGCTCTCT 


TGTTTTTCAT 


2760 


GATTTTGCGG 


ATTATCTTGG 


TCGGTATCCG 


AGCGGAGAAT 


CCTTTCAATG 


CCATGGTTGC 


2820 


ACTCGGTGTC 


GGAGGGATGA 


TGTTGGTTCA 


GGTATTTGTC 


AATATCCGAC 


GGATTTCGGG 


2880 


CTTGATTCCA 


TCTACAGCAG 


TGACTTTCCC 


CTTCTTATCC 


CAGGGTGGAA 


ATAGTCTTCT 


2940 


AGTCTTATCA 


GTGGCACTAG 




AAATATTGAT 


GCCAGTGAAA 


AACGCGCTAA 


3000 


ATTGTACCGA 


GAATTCGAAA 


ATCAACCAAT 


GAACCTTCTG 


TTGAAGTAGG 


ATAAAGAAAG 


3060 


CATAGTTTAT 


GTCTCTTCAA 


AAATTAGAAA 


ATTATACTAA 


TAAAACTGTT 


GTGCAAGAAC 


3120 


AAGTCTTGAT 


TCTAACAGAA 


TTACTGCAAG 


ATATTACTAA 


AAATATGCTT 


GCCCCAGAGA 


3180 


CCTTTGAAAA 


AATAATACAG 


TTGAAAGAAT 


TATCAACGCA 


GGAAGATTAT 


CAAGGTCTAA 


3240 


ACCGTCTAGT 


GACTAGCTTA 


TCAAATGATG 


AAATGGTCTA 


TATTTCACGC 


TATTTCTCTA 


3 300 


TCTTGCCTCT 


TTTGATTAAT 


ATTTCAGAGG 


ATGTGGATTT 


AGCTTATGAA 


ATCAATCATC 


3360 


AAAATAATAT 


TGATCAGGAC 


TATTTAGGTA 


AATTATCTAC 


AAC G ATT AAA 


TTGCTAGCAG 


3420 


AAAAGGAAAA 


TGCCCTTGAG 


ATCCT AGAAC ^ ACTTGAATCT 


TGTCCCTGTT 


TTGACAGCCC 


3480 


ATCCAACACA 


AGTGCAACGC 


AAAAGTATGT 


TGGATTTAAC 


AAATCATATT 


CATAGTCTTT 


3540 


TCCCTAAATA 


CCGTGATCTT 


AAGTTGGGGT 


TGATCAATAA 


AGATAAATGG 


TACAATGATT 


3600 


TGCGTCGTTA 


CATCGAAATT 


ATCATGCAGA 


CAGACATGAT 


TCGTGAGAAA 


AAATTAAAAG 


3660 


TGACTAACGA 


AATCACGAAT 


GCTATGGAAT 


ATTATAACAG 


CTCCTTTTTG 


AAAGCTGTAC 


3720 
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CTCATTTGAC 


GACGGACTAT 


AAGCGCTTAG 


CGCAAGCGCA 


TGGTCTGAAT 


TTAAAACAGC 


3780 


CTAAACCAAT 


CACCATGGGT 


ATGTGGATAG 


GTGGTGACCG 


TGATGGAAAT 


CCATTTGTTA 


3840 


CAGCAAAGAC 


CTTGAAGCAG 


TCTGCACTCA 


CTCAGTGTGA 


AGTCATCATG 


AACTACTATG 


3900 


ATAAAAAGAT 


TTACCAACTT 


TATCGTGAAT 


TTTCTCTTTC 


AACTAGCATT 


GTCAACGTCA 


3960 


CCAAGCAAGT 


CAGAGAAATG 


GCTCGTCAAT 


CCAAGGATAA 


CTCGATTTAC 


CGCGAAAAAG 


4020 


AG CTTTACCC 


TCGTCCCTTG 


TTTGATATTC 


AATCAAAAAT 


TCAGGCAACT 


AAAACCTATC 


4080 


TGATTGAGGA 


TGAAGAAGTT 


GGGACTCGTT 


ATGAAACCGC 


CAATCATTTC 


TACAAGGATT 


4140 


TGATTGCCAT 


TCGAGATTCT 


CTACTAGAAA 


ATAAGGGCGA 


CTCCTTGATT 


TCAGCTGATT 


4200 


TTGTGGAATT 


ATTCCAGGCA 


GTAGAGATAT 


TTGGTTTTTA 


CTTAGCATCA 


ATTGATATGC 


4260 


GACAAGACTC 


TAGCGTCTAT 


GAAGCCTGTG 


TGGCAGAACT 


CTTGAAATCA 


GCAGGAATTC 


4320 


ATTCTCGTTA 


TAGCGAGTTG 


AGCGAACAAG 


AAAAGTGTGA 


CCTTCTCTTG 


AAAGAATTAG 


4380 


AAGAAGATCC 


CCGAATTCTT 


TCTGCCACTC 


ACGCAGAAAA 


ATCAGAATTA 


TTAGCAAAAG 


4440 


AATTAGCTAT 


TTTTAAGACG 


GCTCGTGTTT 


TGAAAGATAA 


GTTGGGAGAT 


GATGTCATCC 


4500 


CTCAGACCAT 


CATTTCACAT 


GCAACCAGCC 


TTTCTGATAT 


GCTAGAATTA 


GCTATTCTGT 


4560 


TAAAAGAAGT 


AGGACTGGTG 


GATACGGAAA 


GGGCGCGTGT 


TCAGATTGTT 


CCCCTTTTTG 


4620 


AAACAATTGA 


AG A CTTGG AT 


CATTCAGAGG 


AAACAATGAG 


AAAATATCTT 


TCTCTT AGCC 


4680 


TTGCCAAAAA 


ATGGATTGAC 


TCACGAAATA 


ACTACCAAGA 


AATCATGCTT 


GGCTACTCTG 


4740 


ACAGTAATAA 


AGATGGCGGT 


TACTTGTCAT 


CATCTTGGAC 


CCTCTACAAG 


GCTCAACAAC 


4800 


AATTGACTGC 


TATTGGAGAT 


GAATTTGGCG 


TTAAGGTTAC 


CTTCTTCCAT 


GGTCGTGGTG 


4860 


GTACTGTCGG 


TCGTGGTGGT 


GGGCCAACCT 


ATGAAGCCAT 


TACATCTCAA 


CCGCTCAACT 


4920 


CTATCAAGGA 


TCCTATCCCC 


TTGACGGAGC 


AGGCTGAAGT 


AATTGGGAAT 


AAATACGGTA 


4980 


ACAAAGACGC 


CGCTTACTAT 


AACCTTGAAA 


TGCTAGTATC 


GGCAGCTATT 


AACCGTATGA 


5040 


TTACTCAGAA 


GAAGAGCGAT 


ACCAATACCC 


CAAATCGTTA 


TGAAACCATT 


ATGGATCAAG 


5100 


TAGTGGACCG 


TAGTTACCAT 


ATCTACCGTG 


ATTTGGTCTT 


TGGTAATCAC 


CATTTCTATG 


5160 


ATTATTTCTT 


CGAGTCAAGT 


CCAATCAAGG 


CTATTTCAAG 


TTTTAATATT 


GGTTCTCGTC 


5220 


CAGCCGCTCG 


TAAGACTATT 


ACTGAAATCG 


GTGGTTTGCG 


TGCCATCCCT 


TGGGTATTCT 


5280 


CATGGTCACA 


GAGTCGTGTT 


ATGTTCCCTG 


GATGGTACGG 


GGTTGGTTCA 


AGCTTCAAGG 


5340 


AATTTATCAA 


TAAAAATCCA 


GAGAATATTG 


CTATCTTACG 


AGATATGTAC 


CAAAATTGGC 


5400 


CTTTCTTCCA 


ATCGCTTCTT 


TCAAATGTTG 


ATATGGTTTT 


GTCAAAATCA 


AATATGAATA 


5460 


TTGCTTTTGA 


ATATGCTAAA 


CTTTGTGAAG 


ACGAGCAAGT 


TAAGGCCATC 


TATGAGACTA 


5520 
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TTTTAAATGA ATGGCAAGTT ACTAAGAACG TTATCTTGCC TATTGAAGGA CATGACGAAC 5580 

TCTTAGCTGA CAATCCATAT CTAAAACCTA GTCTGGATTA CCGTATGCCT TACTTTAATA 5640 

TTCTCAACTA TATTCAGTTG GAGTTGATTA AACGCCAACG TCGTCGACAA TTGTCCAGTG 5700 

ATCAAGAACG ATTGATTCAT ATCACCATCA ACGGAATTGC GACAGGATTC CGTAATTCAG S760 

GTTGATAATT TTCAAGACTG AATCCTAAAA CTGAATATCA AAAAAATTCT AATAGACTAT 5820 

TGACAAGTAG TTTAAAAATG AT AT AATTT A ACCATTCAGA AAAGTAATCA TACAAACTTT 5880 

TTAGAGAGTC TGTGGTAGCT GAAAACAGAT AAGTGGCAAT GATGAAAATT GGGCTCAATG 594 0 

CTATTTAGAA TTTGAAATTA TAAAAATTCG GTAAGCACAC CTTACAGTGC ATCTCGTTAT 6000 

TGCGAGACTG AGCGATAGGG AAATTCCCTA TAATTGAGGT CCTACCCCGC ATCGACGTCC 6060 

TCACACAAGT TTTTTGTGTG AGGATTTTTT TCATCGAGCT TAGTATGGAA AGAAAACGAT 6120 

GGCGTCCCTT GTTTAGATAA GTGAAATATG TTAAAGGAAA TAAAAAGGAG AAACAGAATG 6180 

AAAAATAAAC GTTTAATTGG AATTATTGCT GCATTAGCAC TCTTAGTAGC AGGAAGCTTG 6240 

ATTTATTCTT CAATGAATAA ATCAGAAGCT CAGAATAATA AGGATGAGAA GAAAATAACC 6300 

AAGATTCCTG TGCTTCAATT TGTGAGCCAT CCATCCCTTG ATTTGATTTA TAAAGGGATC 6360 

CAAGATGGAC TTGCAGAAGA AGGATATAAA GATGATCAAG TTAAAATTGA TTTTATGAAC 64 20 

TCAGAAGGTG ACCAAACTAA GGTTGCGACA ATGACTAAAC AATTGGTTGC AAATGGGAAT 64 80 

GACCTTGTGG TTGGTATCGC AACACCAGCA CCCCAAGCCT TGGCTAGTGC AACAAAAGAC 6540 

CTACCGGTTA TCATGGCCGC T ATT AC AG AC CCAATTGGTC CTAACTTGCT TAAACATTTG 6600 

AAAAAACCAG GTGGCAACGT TACAGGGGTA TCTGACCACA ATCCACCTCA ACAACAAGTT 6660 

GAACTCATCA AGGCTCTCAC ACCGAATGTG AAAACAATCG CACCTCTTTA CTCAACTAGC 6720 

GAAGACAATT CAAAA 6735 
(2) INFORMATION FOR SEQ ID NO; 105: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6516 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CT AG AG CATC CCAGCAGGTA AATTGGCTTC AGCTGGCAAA AAAGTTGCCC TCGTTGAACG 60. 

CAGCAAGGCT ATGTACGGTG GAACTTGTAT CAACATTGGT TGTATCCCAA CTAAAACCTT 120 
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CCTAGTTCCT GCTCAAAAGG ACTTGTCTTT TCAAGAACTC ATTGCTACTA AAAACACCAT , 180 

CACTGGTCCC CTCAACGGTA AAAACTATCC GACTGTTGCT GGTACAGGCG TAGATATCTT 240 

TGATGCGGAA GCTCACTTCC TTTCAAATAA AGTCATCGAA ATCCAAGCTG GTGATGAAAA 300 

GAAAGAACTG ACTGCTGAAA CAATCGTCAT CAACACTGGT GCTGTTTCAA ACGTCTTGCC 360 

AATCCCTGGA CTTGCTACAA GCAAAAACAT CTTTGACTCA ACAGGTATCC AAACCTT G GA 420 

CAAATTACCT GAAAAACTTG GAATCCTTGG TGGCGGAAAT ATCGCTCTTG AATTTGCCGG 480 

CCTTTACAAC AAACTTGCAA GCAAGGTCAC AGTCCTAGAT GCCTTGGATA CATTCCTACC 540 

TCGTGCAGAA CCTTCCATCG CAGCTCTTGC TAAACAATAC ATGGAACAAG ATGGCATTGA 600 

ATTGCTTCAA AATATCCATA CTACTGAAAT CAAAAACGAT CGTGACCAAG TGCTTGTCGT 660 

AACTGAAGAC GAAACTTACC CTTTCGACGC CCTTCTCTAC GCAACTGGAC GCAAACCAAA 720 

TGTAGAACCA CTTCAACTTG AAAATACACA TATTGAACTA ACTGAACGTG GTGCTATTAA 780 

AGTAGACAAA CACTGTCAAA CAAACCTTCC TCCTCTCTTT CCAGTTGGAC ATGTCAACGG 840 

TGGCCTTCAA TTTACTTACA TTTCACTTGA TGACTTCCCT G TT C TTTACA GCTACCTTGC 900 

TGGAGATGCC ACCTATACAC TTGAAGACCG TCTCAATGTG CCAAATACTA TGTTCATCAC 960 

ACCTGCACTT TCACAACTTG GTTTGACTGA AACCCAAGCA GCTGATTTGA AACTTCCATA 1020 

CGCTCTTAAG GAAATCCCCG TTGCAGCAAT CCCTCGTGGT CACCTAAATG GAGACCTTCG 1080 

CGGTGCCTTC AAAGCTGTTG TCAATACTGA AACAAAAGAA ATTCTTGGAC CAAGCATCTT 1140 

CTCAGAAGGT TCTCAAGAAA TCATCAACAT CATCACTCTT GCTATGGACA ACAAGATTCC 1200 

TTACACTTAC TTCACAAAAC AAATCTTCAC TCACCCAACC TTCCCTGAGA ACTTGAATGA 1260 

CTTGTTTGCG ATTTAAGTTG AGATTTAATC GTATCGAACA GCCCTCTTT G GGCTGTTTTT 1320 

ACTTCTGCGG AATCTCAAAT CTGTCTTTCT CCTCTTTTAT GATATAATAG AAACATCAAC 1380 

TTAAAAACTA CTTTGGGCCT TCTTGCTGGG CCTTCTTCCC ACTTCGTTTT AAGCCGTCTT 1440 

GGACGTGGAA GTACCCTCCC AGGCAAAGTC GCCCTTCAAT TTGATAAAGA TATTTTACAA 1500 

AACCTAGCTA AGAACTACGA GATTGTCGTT GTCACTGGAA CAAATGGAAA AACCCTGACA 1560 

ACTGCCCTCA CTGTCGGCAT TTTAAAAGAG G TTTATGGTC AAGTTCTAAC CAACCCAAGC 1620 

GGTGCCAACA TGATTACAGG GATTGCAACA ACCTTCCTAA CAGCCAAATC TTCTAAAACT 1680 

GGCAAAAATA TTGCCCTCCT CGAAATTGAC GAAGCCAGTC TATCTCGTAT CTGTGACTAT 1740 

ATCCAGCCTA GTCTTTTTGT CATTACTAAT ATCTTCCGTG ACCAGATGGA CC G TTTCGGT 1800 

CAAATCTATA CTACCTATAA CATGATATTG GATGCCATTC GGAAAGTTCC AACTGCTACT 1860 

GTTCTCCTTA ACGGAGACAG TCCACTTTTC TACAAGCCAA CTATTCCAAA CCCTATAGAG 1920 
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TATTTTGGTT 


TTGACTTGGA 


AAAGGGACCA 


GCCCAACTGG 


CTCACTACAA TACCGAAGGG 


1980 


ATTCTCTGTC 


CTCACTGCCA 


AGGCATCCTC 


AAATATGAGC 


ATAATACCTA 


TGCAAACTTG 


2040 


GGTGCCTATA 


TCTCTGAAGG 


TTGTGGATGT 


AAACGTCCTG 


ATCTCGACTA 


TCGTTTGACA 


2100 


AAACTGGTTG 


AGTTGACCAA 


CAATCGCTCT 


CGCTTTCTCA 


TAGACGGCCA 


AGAATACGGT 


2160 


ATCCAAATCG 


GCGGGCTCTA 


TAATATCTAT 


AACGCCCTAG 


CTGCTGTGGC 


CATCGCCCGT 


2220 


TTCCTAGGTG 


CCGATTCGCA 


ACTCATCAAA 


CAGGGATTTG 


A£AAGAGCCG 


TGCTGTCTTT 


a i QU 


GGACGCCAAG 


AAACCTTTCA 


TATCGGTGAC 


AAGGAATGTA 




GATTAAAAAT 




CCAGTCGGTG 


CAACCCAAGC 


TATCGAAATG 


ATCAAACTAG 




ATTTAGCCTA 


O Jt A A 

44UU 


TCTGTCCTCC 


TTAATGCCAA 


CTATGCAGAT 


GGAATTCACA 








GACTTTGAAC 


AAATCACTGA 


CATGGACATT 


CCTGAAATCA 






2S20 


TCTGAAATCG 


CTCGTCGCCT 


CCGAGTGACT 


GGCTATCCAG 


LlCAuAAAA 1 


f* Af**rR A i Af?G 


2580 


AGTAATCTGG 


AGCAAGTTCT 


CAAGACCATT 


GAGAATCAAG 


ALTuL nAUU\ 


TGCCTATATT 


2640 


CTCGCAACTT 


ATACTGCCAT 


GCTGGAATTT 


CGTGAACTGC 




TCAGATTGTT 


2700 


AGAAACGAGA 


TGAACTAATG 


GTTTATACTT 


CACTTTCCTC 


& & & & r- b.^C-TiC* 




2760 


ATCAGCTCAA 


CATTCCCCAC 


CTCTACGGAA 


ATCTCATGAA 


(p»p *• a. Ct~LflCj[~L 
1 nv. CnLuuVAt 




2820 


CATCCTCATG 


CTCAAGTATG 


TGGCTGAAAA 


ACTGCGAGCC 


LATu * \»At^.V» 




^ a oa 


TTCTCTCCAT 


GATGACTTTG 


ATGAAAATCA 


CTACGACATC 


GCCTTTTTCG 




^ a a a 
2940 


AGACTTTGAA 


CAAAGTATCA 


TTGCAGACGA 


CCTACCTGCT 


AAAAAAuAVin 




1 AAA 


CTACATCCAA 


AACGACGGTG 


TAGTTCTCGC 


TATCTGCGCT 


l i i w w/w\\* 


TATTGGGTCA 


n A£ A 


ATATTATGTT 


GAAGCTTCAG 


GAAAACGTAT 


CGAAGGGCTA 


GGGGTCATCG 


GACACTACAC 


t i on 


GCTCAACCAG 


ACCAATAACC 


GTTTTATCGG 


TGACATCAAG 


ATTCACAATG 


AAGATTTCGA 


linn 


TGAAACCTAC 


TATGGATTTG 


AAAATCACCA 


AGCTCGTACC 


TTCCTCTCTG 


ATGACCAAAA 


3240 


ACCGCTGGGA 


CAGGTTGTCT 


ATGGAAATGG 


AAACAACGAA 


GAAAAGGTCG 


GTGAAGGGGT 


3300 


TCATTATAAG 


AATGTCTTTG 


GTTCCTACTT 


CCACGCGCCT 


ATCCTCTCTC 


GTAATGCCAA 


3360 


TCTGGCTTAT 


CGCCTAGTTA 


CTACTGCCCT 


CAACAAGAAA 


TATGGTCAGG 


ACATCCAACT 


3420 


CCCTCCCTAT 


GAGGACATTC 


TCAGCCAAGA 


AATCGCTGAA 


GAGTACAGTG 


ACGTCAAAAG 


3480 


CAAGGCTGAC 


TTTTCTTAAA 


CAAAGGAAAA 


TGATATCAAA 


GAACTCCGTT 


ATCTTGTCGG 


3540 


AGTTTTTTGT 


CTTTTCTTTT 


ACCCTTCTCC 


CTTGCATTTT 


CTCTCATTTT 


TTGCCAAAAT 


3600 


AGAGGGGTAG 


AAAGAAGGTA 


GCATATGTCT 


AAATTACAAC 


AAATCCTAAC 


ATATCTTGAA 


3660 
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TCACAAAAAC 


TAGACCTCGC 


TGTCGTATCT 


GACCCCCTCA 


CAATCAATTA 


CCTCACTGGT 


3720 


TTTTACAGTC 


ATCCCCATGA 


ACGCCAAATG 


TTCCTCTTTG 


TCCTAGCAGA 


TCAGGAACCT 


3780 


CTCCTCTTPC 


TCCCAGCTCT 


TGAAGTAGAA 


CGTGCAAGTA 


GCACCGTTTC 


CTTCCCAGTA 


3840 


GTGCGCTATG 


TCGATTCTGA 


AAATCCATGG 


CAAAAAATCA 


AACATGCTCT 


TCCACAACTT 


3900 


GACTTCAAAC 


GTGTCGCTGT 


TGAGTTTGAC 


AATCTCATCT 


TGACCAAATA 


CCATGCTTTG 


3960 


AAAACAGTTT 


TTGAGACTGC 


TGAGTTTGAC 


AACCTCACTC 


CTCCTATCCA 


ACGCATGCGC 


4020 


CTCATCAAAT 


CAGCTGATGA 


AGTGCAAAAA 


ATGATGGTTG 


CAGGTCTTTA 


TGCTGACAAG 


4080 


GCTGTTCATG 


TTGGTTTTGA 


CAATATTTCT 


CTTGATAAGA 


CTGAGACAGA 


TATCATCGCA 


4140 


CAAATCGACT 


TTGCCATGAA 


ACGTGAAGCT 


TATGAAATGA 


GCTTTGATAC 


CATGGTCTTG 


4200 


ACTCGTGATA 


ATGCTGCGAA 


TCCACACGGC 


ATTCCAGCAG 


CTAATAAGGT 


TGAAAATGAT 


4260 


CCTCTTCTCC 


TCTTTGACCT 


GGCTGTTCTG 


GTCAATGGCT 


ATGCGTCAGA 


TATGACTCGT 


4320 


ACACTCCCTG 


TCGGCAAACC 


AGACCAATTC 


AAGAAAGATA 


TTTACAACTT 


GACTCTTGAA 


4360 


GCCCAACAAC 


CTGCTCTTGA 


CTTTATCAAG 


CCAGGTGTGA 


CTGCTCATGA 


AGTGGACCGC 


4440 


GCTCCCCGTG 


AGGTCATCGA 


AAAAGCTGGT 


TATGGTGAGT 


ACTTCAACCA 


CCGTCTCGGG 


4S0O 


CATGGTATCG 


GTATGGATGT 


CCATGAATTC 


CCATCTATCA 


TGGAAGGAAA 


CGACATGGTC 


4560 


ATCGAAGAAG 


GCATGTGCTT 


CTCTGTTGAA 


CCAGGTATCT 


ATATCCCTGG 


TAAAGTCGGT 


4620 


GTTCGTATTG 


AAGACTGCGC 


TGTTGTTACC 


AAGGATGGCT 


TCAACCTCTT 


TACAAGCACC 


4680 


AGCAAAGATT 


TGCTTTATTT 


TGATTAAACT 


ATATAGCCCC 


TATGCTTTCC 


TTTCAAAATA 


4740 


TCTAGGGGCT 


ATTTTATTGT 


CATTTTTCTG 


CTATTATGCT 


AAAGAAATTG 


GCTGCAATAA 


4800 


TCTAACCCTA 


AGTGTCTGGA 


ATGATAACGA 


GGGTGCTCTC 


CGCTTTTATC 


AAAGACAAGG 


4860 


GATGAAACCC 


CAAGAAACAA 


CAATGGAAAT 


CATAATTCAT 


TAAGAACTCA 


TCTATCAAAA 


4920 


GATGTTAGAA 


AAACTTCAAT 


TTCACTAGAA 


AATGAGGAAA 


ATCTCCCCAC 


AATAAAACGC 


4980 


ATACTATCAG 


GTATTGTGTA 


CTGACCCCAA 


ACAGTTAGAC 


AATTAATTTA 


TCCGAAGGAT 


5040 


TTAGTTCTGT 


ACTGCACAGG 


ACTAAGTCCT 


TTTAGTTTTA 


CCTTAATTCC 


TTTGTTGTTG 


5100 


TAGTAATCAA 


TATAGTCTAT 


AATGACTTGT 


TCCAATTGGT 


TAAGTGATTT 


AAATGTTTTC 


5160 


TCATAGCCA'jf 


AAAACA11TC 


GGATTTTAAa 


ATGCCAaAGA 


AAGAT1CCAT 


CATACCGTTG 


5220 


TCTTGGCTGT 


TTCCCTTGCG 


TGACATAGAT 


GCTTGAATTC 


CCTTATTCTC 


TAGGAACCGA 


5280 


TGATAAGAAT 


CGTGTTGGTA 


TTGCCAGCCT 


TGGTCACTAT 


GGAGAATCGT 


ATTCTCGTAG 


5340 


TGCTTCTCTT 


TGAATGCCTG 


TTCCAACATT 


GTTTGTACTT 


ATTCTAAATT 


AGGCGAACAA 


5400 


CAAAGATTAA 


AAGCAATAAT 


TTCGCTGTTA 


AAGCCATCTA 


AAACTGGTGA 


TAAGTAAAGC 


5460 
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TTTTGAGTAC TTGCTCGAAT GGCAAATTCA CTCACATCTG TCTACCACTT TTCCATTGTT 5520 

TTAGAGCCTT CAAATTGGGC TTGAATGAGA TTCTCTGCCT TCTTACCAAC GTCTCCTTTA 5580 

TGAGAAGAAT ATTTTCGTTT CTTTCGCATT TTAGCTTGTA AATTGACTAC TTTCATCAAG 5640 

CCTTGAACTC TTTTATGATT TACCAGATAA CCACCATTTC TTAGTTCTAA ATGAACCCGG 5700 

CGATAAGCAT AATTTCCCTT GTGTTCGATA AACATGGATT GAATTTCAGT TTTAACCTCT 5760 

TGGTCTTTAT CT G TTTTGTC TAGCTGTTTC AACTGATACT AGTAGGTCCA ACGAGCTACT 5820 

TTAATGGCTT CTAGAAGAAG ATCTAACGAA AACTCAGTCA TTAATTCTTG AACAATTTCT 5880 

GTCTTTCTTC TTTCTCTTTT TCCTCCTTCA ATCGGAGTTC TCTTAACTTT TTTAGGATGG 5940 

CATTCTCCGC TCTCAGGTAC TCTCCCTCTT GTTTTCTCAA CAATAGTATA CCCGTTTTTC 6000 

CTGTATTGTG CTAGCCACTT AAGAAGTATC GTACGACTTG CGAGACCGTA TTCAAGAGAA 6060 

ACTCTATCTT TACTCCAGCC TTCATGTCAG ACTTTATTAA CCCCAATTAT TCACCCCAAA 6120 

TCTAAAAACC ATCCAGAATC CTTCCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC 6180 

CAATGGGTGT TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTCTAGAAAA 6240 

AAACACAAAA AGAAAGGAAA CTCACATCAA CAGTTTACCA AATCATCACT TCCAAAACAA 6300 

GTCTTTTTAC CAACTATCTT TCGATGGACG TCATTTAACC CAGTATGCTG GTCTTATCTT 6 3 60 

TTTTCAGGAA CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAACT ATTTAGTAAC 6420 

GAATGAmCAA CGCCGCTACT GTCGTTATTC CGATTCAGAT ATCCwTCTCC AGTTCCTCTT 6480 

TCAACTGTTA ACAGGTTATG GAACGGAATA TGCTTG 6516 
12) INFORMATION FOR SEQ ID NO; 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

TTTTCAACCC ATATCGTGGC TCCTGAATAC TACTTACTGA CAACTATGCT ATCAGAGACT 60 

TCTCTACTTG TTTTCTATAT CATTTTCATC CATAGAAAAC AACTCATCCA CTTGGGACAT 120 

ATCTTTAGCT ATACTGTTCG ATACTCTCTC TTTTCACTTT CCTTTGTAGC AATTTATTTC 180 

CTGATTAATT TCGTGTATCC TGTAGATATG CTCATTAATT TGCCATTTTT GATTAATACT 240 

GGTTTGATTG TCTTGCTATC AGCTATCTCT TATATTAGTC TACTTGTCTT CACAAAAGAT 300 
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AGCATTTTCT ATGAATTTTT AAACCATCTC CTAGCCTTAA AAAATAAATT TAAAAAATCA 360 

TAGGAGTTTA AAATCAAACA ACTAACCGTT GAAGATGCCA AACAAATTGA ATTAGAAATT 420 

TTGGATTATA TTGATACTCT CTGTAAAAAG CACAATATCA ACTATATTAT TAACTACGGT 480 

ACTCTGATTG GGGCGGTTCG ACATGAGGGC TTTATCCCTT GGGACGACGA TATTGATCTG 540 

TCCATGCCTA GAGAAGACTA CCAACGATTT ATTAACATTT TTCAAAAGCA AAAAAGCAAG 600 

TATAAGCTCC TATCCTTACA AACTGATAAC AACTACTTTA ACAACTTTAT CAAGATAACC 660 

GACAGTACGA CTAAAATTAT TGATACTCGA AATACAAAAA CCTATGAGTC TGGTATCTTT 720 

ATCGATATTT TCCCTATAGA TCG C TTTGAT CATCCTAAGG TCATTGATAC TTGTTATAAA 780 

CTGGAAAGCT TCAAACTGCT GTCTTTCAGT AAACATAAAA ATATTGTCTA TAACGATAGC 840 

CTTTTAAAAG ATTGGATACG AACAGCCTTC TGGTTACTCC TTCGACCGGT TTCTCCTCGT 900 

TATTTTGCAA ATAAAATCGA GAAAGAAATT CAAAAATATA CTCGT G AAAA TGGGCAATAT 960 

ATGGCTTTTA TCCCTTCAAA ATTTAAGGAA AAGGAACTCT TCCCAACTGG TACCTTTGAT 1020 

AAAACAATCG ATTTACCCTT TGAGAATTTA AGCCTTCCTG CACCTGAAAA ATTTGATACT 1080 

ATTTTGACAC AATTTTATGG AGATTATATG ACCCTACCAC CAGAAGAAAA ACCCTTC T AC 1140 

AGTCATCAAT TTCACGCTTA TAAATTGGAG GATTAGGATC CAATATTTAG AAAAAAAAGA 1200 

AATTAAAGAA ATTCAACTAC CCCTCCTGGA CTATATTGAT GAGACTTCTA AGAAACATGA 1260 

TATTCCTTAT TTTCTCAGTT ATGGAACCAT GCTTGGACCC ATCCGCCACA AAGCTATGAT 1320 

TCCTTGGGAT GATGATATTG ATATTTCCCT TTATCCTGAG GATTATGAGC GTTTACTGAA 1380 

GATTATTGAA GAAGAAAATC ACCCTCGCTA CAAGGTTCTT TCCTACGATA CATCTT C TT G 1440 

GTACTTCCAT AATTTCGCAT CGATTTTGGA CACTTCTACT CTTATAGAAG ACCATCTTAA 1500 

GTACAAGCGT CATGATACCA GCCTTTTCAT CGATGTCTTC CCAATTGATC GATTTACACA 1560 

CTTGACCATT GTCGACAAGA GCTATAAGTA TGTGGCTCTT CGTCAACTAG CTTATATCAA 1620 

AAAATCACGA GCAGTTCACG CTCATAGCAA ACTAAAAGAT TTTCTTAGAT TATCTAGCTG 1680 

CTACGCTCTC CGATTTCTCA ATCCTCGCTA CTTTTACAAC AAAATTGATC AACTAGTCAA 1740 

AAATGCTGTA ACCAACACTC CTCAATATGA AGGACGACTT GGGATCCGTA AGGAAGGGAT 1800 

GAAAGAAATC TTCCCAGTTG ATACCTTTAA AGAACTGATT TTAACTGAGT TTGAGGGCCG 1860 

TATGTTCCCT CTTCCCAAAA AATATGACCA ATTTTTAACC CAGATGTATC CCGATTATAT 1920 

GACACCACCA TCAAAAGAAA TGCAAGAGTG GTATAGTCAT AGCATTAAAG CTTATCGCAA 1980 

AAACTCATTG AGGGGGATTA TACAAACTAC TAAGATAGAG GTTATTCAAA AACATAATTT 2040 

TAGTAGAAAA TGAAATACAT ATTCCCACAA TAAAACGCAT CATATCAACG TTTTTGAAAA 2100 
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ACCTTGATAT 


GATGCGTTTT 


ATAATTTTAA 


AGACTTTTTT 


rTATACTACA 


TTCAAATAAG 


2160 


ATGCGAACAA 


ATLAAiTnOA 


AAATTCAAAT 
TGCTATATCT 


TAATTTATAG 


AAATATTTTA 
TTTAAATCCC 

All /%/»n ft wwv 




2220 
2280 


GT AC l\» 1 TCT 


AAATTCAGTC 


TATTTTTCTA 


A a w * w * 


AAGCTACGAC 


TTTCAAGTAC 


CTTAAGCATG 


GCATTAGCTG 


TATTTAGCCC 

A n A V> A AVlV.U\» 


TRTCAACAfiG 


2340 


CGCACCCCGT 


CTTCAATGGC 


TGAACGACGA 


ATTTGCTCAC 


W/\ 1W1 1 W\J A 1» 




2400 


1TTGTTCCTA 


CTGTGTTAAT 


GATAGCTTGA 


ATTCTTCCTT 






2460 


TCCTTATCGT 


CATCACCAAT 


CTTACCAACA GGTTGGGCTT 




Ai. 1 AuUAAAU 




AAGGCTG CTG 


TCCCTTCTGT 




rriTJiArCAA 


TG a 1 1 1 W»AA 


Al.l> ALwAVA. \. 




AAGTTCAAGG 


CTTCTTCTTT 






AGACGACAl l 


ACLAAAAwli 


9 AAA 


GCCAAGTGTA 


CATAAGAAGC 




1 lAi ftwAunw 


Wl~i 1 J IS-LAA 


it ^'wt it r+r* x piv* x 
ACTAGCATCA 


2 vug 


GAACCCATAA 


CTTCACCTGT 






GCAAGCTGTC 


TACCTTAGCT 


2 row 


AGTTTGGTAA 


AGGAGAAGAC 






GGGTGCTTTC 


AGGGTAAAGT 




CCATTTTGGT 


AGCCAAGTTC 


1 nnnL 1 1 




TGAGTTTCGT 


CGCTACTTGA 


2B0Q 


GCCATAGGAA 


TATTGGTTAC 




AATCCAACAG 


TACGGCTGGC 


ACGTGGATTG 


2940 


ACCTCAATAA 


CGTAGACTTT 


TTCATCCTTC 


ATAACAAACT 


GGATGTTCAT 


CATTCCAAW. 


i nnn 
J uuu 


CAGTGAAGAC 


CGATTGCTAA 


GCGTTTGGTG 


TAGTCTGCGA 


TGGTCTCCTG 


AACCTTTTGC 


i aha 


GACAAGGTTT 


GTGCTGGGTA 


AACAGCCATT 


GAGTCACCTG 


AGTGGACACC 


i^o x rr*ww^ 
AOi»AWl*TTWw 


J IaU 


ATATGCTCCA 


TGATACCAGG 


AATGAGTACA 


TTTTTACCAT 




A^V*A l/TIW! 
AlvAAVf i 


1 1 flfl 
J lOU 


CACTCTTGCC 


CAACGATATA 


ACAGTCGACA 


AGAACTGGGT 


GGTCTGGACT 


MVjV_1_ 1 1 AnUA 




GCAGTTCGCA 


TGTAAGAACG 


AAGGTCTTCT 


TCGTTTTCAA 






3300 


CCAACTACAT 


AACATGGOCG 


GACAAGAACT 


GGGAAGCCAA 


f f'lTfyflAfSf* 
I la 1 X w^wnul. 

GAATATCCAA 




3360 
3420 


GCTTCTTCTT 


CATTGGTAGC 


CGTTTCTCCT 


GGTGGCTGTG 


TTCTTTGAGA 


CCTTGCTCGA 


AGACGTCACG 


GTCTTCGGCA 


CGATCTAGGT 


CAGCAACCTG 


TGTACCAAGG 


3460 


ATGGTCACAC 


CTGCTTTTGC 


CAATGGCTCC 


GCAAGGTTGA 


TGGCTGTTTG 


ACCACCGAAC 


3S40 


TGAACCATAA 


CTCCCTTTGG 


TTGTTCCAAG 


TCAATGACGT 


TCATAACATC 


TTCGAATGTC 


3600 


AATGGCTCAA 


AGTAAAGCTT 


ATCTGATACA 


GAGAAGTCTG 


TTGAAACGGT 


CTCTGGGTTT 


3660 


CAGTTCATGA 


TGATAGCTTC 


ATAACCAGCT 


GCCTGGATAG 


CCTTAACAGA 


GTGAACGGTT 


3720 


GCGTAGTCAA 


ACTCAACCCC 


TTGACCGATA 


CGGATTGGAC 


CTGAACCTAG 


GACAAGTACA 


3780 


GATTCTTTAT 


CAGATCTGAT 


AGATTCATTT 


TCCCAACCAT 


AGGTTGAATA 


GAAATATGGC 


3840 
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CTTTCGGAGT 


CGAACTCTGC 


CGCACAAGTG 


TCTACCATCT 


TATAAACTGG 


AACAATCTTG 


3900 


TTTTCCAAGC 


GAAGTTGGCG 


AACTTTATCA 


TCAGTCGTTC 


CCCAGAGTTC 


AGCAATCTTA 


3960 


CGGTCTGAAA 


AACCATTAAG 


TTTGGCTGTT 


TTCAAAACTT 


CTAAATCTTG 


TGGATGAGCA 


4020 


CCCAATTCTT 


GCTCAATTTC 


AAAGATATGC 


AAGAGTTTAT 


CAAGATACAA 


CATATCAATT 


4060 


TTTGTAAGCT 


CTGCAATTTC 


TTCAGGTGTG 


TAGCCACGAC 


GAATCGCTTC 


TGATACGTAG 


4140 


AAGAGACCCT 


CATCTTGGGC 


TTTGACAACC 


TTTTCAATCA 


AGCCATCATC 


AGAAACTGCT 


4200 


GCAACTTCAG 


GTATTTCATT 


GTGGTGCACC 


CCAATTTCAA 


GGGAGCGGCA 


GGCCTTGAGA 


4260 


AGAGATTCCT 


CGATGTTACC 


ACCGATTGCC 


ATGACTTCTC 


CAGTCGCCTT 


CATTTGTGTA 


4320 


CCGAGACGGC 


GTTCACCCTT 


TTCAAACTTG 


TCAAATGGGA 


AACGTGGAAT 


CTTAGCAACT 


4380 


ACGTAGTCAA 


GGGCTGGTTC 


AAACATGGCA 


TAGGTTCAAC 


CTGTAACTGG 


GTTTATAACC 


4440 


TCATCCAACG 


TCAAACCTAC 


TGCAATCTTG 


GCAGCCAACT 


TAGCAATCGG 


ATATCCTGTC 


4500 


GCTTTAGAAG 


CAAGGGCTGA 


CGAACCTGAT 


ACACGAGGGT 


TTACTTCGAT 


AACATAATAC 


4 560 


TTGAAGCTGT 


TAGGATCAAG 


AG CTAGCTGA 


ACATTACATC 


CACCTTCAAT 


CTTGAGGCCA 


4620 


CGAATAATGC 


TCAAGCTCGC 


ATCACGAAGC 


ATTTGGTTTT 


CATAGTCTGA 


CATGGTTTGC 


4680 


GCAGGGGCAA 


ATACAATGGA 


ATCCCCTGTG 


TGAATCCCAA 


CTCGGTCAAA 


GTTTTCCATG 


4740 


TTACAAACAA 


CCAAGGCATT 


GTCAGCTGAC 


TCACGCATCA 


CTTCCTATTC 


AATTTCCTTG 


4800 


AAACCGGCAA 


TCGAACCCTC 


AATCAAACAT 


TGGGTAACAC 


CTGACAATTT 


CAAACCATTT 


4860 


TCAGTCATTT 


CACGCAATTC 


TTTCTCGTTG 


GCACACATAC 


CACCACCAGT 


ACCACCAAGG 


4920 


GTAAAGGCTG 


GACGAACGAT 


GACTGGGTAG 


CCAATTGTCG 


CTGCAAACGC 


AACTCCTTCT 


4980 


TCTACTGTGT 


TAACAATTTC 


AGATTCTCGA 


ATGGCTTCTT 


CAAGCTCTTC 


CATCAATTGT 


5040 


TTAAAGAGGT 


CACGGTCCTC 


CGCTTGGTCA 


ATGGCAGATA 


ATTTGGTACC 


CAGAAGTTCA 


5100 


ACGCCAAGCT 


CGTCTAGGAT 


ACCATTTTTA 


GATAATTCCA 


TGGCCATGTT 


GAGACCTGTC 


5160 


TGACCACCGA 


GTGTTGCTAG 


CAAGGCATCT 


CGACCTTCCT 


TACGAACAAT 


ACGTGTCACA 


5220 


AACTCAAGTG 


TAATCGGTTC 


AATGTAAACC 


TTGTCAGCAA 


TTTCCTTGTC 


CGTCATGATG 


5280 


GTTGCAGGAT 


TTGAGTTAAC 


CAAAACAACC 


TCATAACCTT 


CCTCTTTCAA 


CGACAAGCAA 


5340 


GCCTGACTCC 


CACCGTACTC 


AAACTCAGCA 


GCCTGACCAA 


TAATAATCGG 


ACCAGAACCA 


5400 


ATCACCATAA 


TTTTTTGAAT 


ATCAGTACCT 


TTAGGCATAT 


ATAAGATATT 


AAGGGTGTCA 


5460 


AGCGGACAAA 


GCTAAAATAG 


GAGTTATGAC 


GAAGAACTGT 


CAGTTCTAGG 


AATAACTATC 


5520 


TTTTTAGCAC 


CGTCCGTAGC 


CCGTATTCAG 


TTCAGCAAAT 


ACGGAGCACC 


CTTCTCCTTT 


5580 


CTATTCGTCG 


CCTCTCAGGG 


CGACATTAAA 


TAAGATACAA 


AGGACGAATA 


GAAAGCGATT 


5640 
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GAATTTTACG 


AAATCAAGGA 


AGGATTGACA 


ATCCAAGTTG 


GTTTCTCTAC 


ATTCTGAGCT 


5700 


TTCCGTCCGT 


GTTCAGTTAC 


ATAAATTCTC 


CCACGAGCTT 


TTACTCGTTC TTAGTTTGAT 


5760 


TGTTTAAAAA 


CTTCCATCAT 


CTCGATAAAC 


TCGTCAAATA 


GGTAGCTAGC 


GTCGTGTGGC 


S820 


CCAGGAGCTG 


CATCTGGGTG 


GTATTGAACA 


GAGAAAGCAG 


GTTGGTATCT 


GTGGCGCACA 


5680 


CCTTCCACTG 


ACTTGTCATT 


GATTTCTTCG 


TGGGTAATAA 


TCAAGTGCTC 


TGGCAAATCC 


5940 


TCGCGGCTGA 


CTGCATAACC 


ATGGTTCTGG 


CTGGTGAAGT 


CTACTCCTCC 


TGTTGCGATT 


6000 


TCACGTACCG 


CATGGTTGAA 


TCCACGGTGG 


CCAAACTTCA 


TCTTATAGGT 


CTTAGCCCCG 


6060 


TTTGCCATTG 


CAAAGAGTTC 


GTGTCCCATA 


CAAATACCAA 


AGATTGGAAT 


TTTTCCTTGT 


6120 


ACACCGCGAA 


TCATGTCGAG 


TGCTTGTGGA 


ACGTCTTCTG 


GGTTACCTGG 


ACCATTTGAC 


6180 


AACATAACTC 


CGTCAGGATT 


GAGATGGAGA 


ATTTCTTCAG 


CCGTTGTCGA 


ATAAGGAACA 


6240 


ACTGTCACGT 


TACAGTTGCG 


TTTAGAAAGT 


TCACGTAGGA 


TTGAGTGCTT 


GAGACCAAAG 


6300 


TCCACTAGCA 


CCACCCTCAA 


ACCAACTCCT 


GGAGCTGGAT 


AAGACCTTTT 


AGTAGAAACC 


6360 


TGTTTGATAT 


TGTCTGTCGG 


TAAAACTGTT 


GCTTCGAGCT 


CCTCCGTCAC 


ATGGTCCATA 


6420 


CTCTCCCCAA 


CATGGGTCAA 


CCTTGCACGC 


ATAGTACCAT 


GCTTACGGAT 


AATCTTGGTA 


6480 


AGAGCACGCG 


TATCAATTCC 


TGAAATCCCT 


GGAATTTTCT 


TGGCTTTCAA 


AAATTCATCC 


6540 


AAGCTCATTT 


GGTTGCGCCA 


GTTGCTAGCT 


CTACCCGCTT 


CTTCAAAAAC 


AACGACTCCC 


6600 


TTACAAGTTG 


GAATAATGGA 


TTCATAATCA 


TCACGATTAA 


TACCATAATT 


TCCTACCAAA 


6660 


GGATAAGTAA 


AGGTCAAGAT 


TTGTCCATTA 


TAAGACTGGT 


CTGT AATCG A . 


TTCTTGGTAG 


6720 


CCGGTCATCC 


CTGTATTAAA 


GACGATTTCG 


CCTGTTACAT 


CAATATCTGC 


TCCGAAGGCC 


6780 


TTCCCTTCAA 


AAACTGTGCC 


ATCTTCTAAT 


ACTACAATTC 


TTTTTCTCAT 


ATTTTCACCT 


6840 


CTCGTGGACG 


CTCACTGGCG 


TCTTTTAACG 


TCTTGTGTTT 


TAGTTGGCGT 


TTCTACTCGC 


6900 


TACTACGGAT 


TCTAAGATTG 


CCATTCGAAC 


AAAGACACCA 


TTGGTCATTT 


GTTGGACAAT 


6960 


CCGTGATTTT 


GGTCCTTCAA 


CCAAGTGGTC 


TGCTATTTCT 


ACATCACCAT 


TGATTGGACC 


7020 


TGGGTGCATG 


AGGATTGCTG 


TTTCTTTCAA 


ACGATCGTAA 


CCTTCTTGAG 


TCAACCCATG 


7080 


TTCGGCATGG 


TAGTCTTCTT 


TTGAAAATAC 


AGCTCCACTA 


TCATGGCCTT 


CGTGTTGCAC 


7140 


ACGGAGAAAC 


ATCATGACAT 


CAACCTGATC 


AATGATTTCA 


TCAATGGTTA 


CAAACTGTCC 


7200 


ATACTCTGCA 


AACTCTTGAC 


TTCTCCATTC 


CTCAGGTCCA 


GCGAAAAAGA 


GTTCAGCTCC 


7260 


CAAGCGTTTC 


AAAATCTGCA 


TATTGGATTT 


GGCAACGCGT 


GAGTGGTCCA 


ACTCACCTGC 


7320 


AATAGCAACT 


TTAAGACCCT 


CAAAGTGGCC 


AAATTCCTCA 


TAAATGGTCA 


TCAAATCAAG 


7380 
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CAAGCTCTGG CTAGGGTGTT GGCCCGAACC ATCTCCACCA TTGATGATGG AAGTCGTAAT 7440 

CCTTGGACTA GCAATCAATT CTCTATAGTA GTCGACCTCT GCATGGCGAA TCACACAGAC 7500 

ATCCACTCCT AAAGCACACA GAGTCAAAAT GGTGTCATAA AGTGTCTCAC CCTTATTAAC 7560 

CGAGCTAGTC TTCACATCAA AGTCAAGTCG TTCCAATCCA AGTTTAATCT CTGCGACTTC 7620 

AAAGGACTTA TGTGTCCGTC TAGAATCCTC AAAGAAGAGA TTGGAAACAA TCGGATGGTC 7680 

TTCATAGGGA AGCTGGGCTC CATTTTTAAA CTCAATTCCT CGCTTGATCA ATTTCATTAC 7740 

TTGATCGACA GTGAGGTCTT CCATGGACAC CACATGGTTC AATGCTTGTT CATTTTCTGA 7800 

CATCGCTACT CCTTTAACTT TCTAAGCTTC TTCAGTAATC AGAACTCTGT CTTGGTCATC 7860 

AAGTTCTGTC ATCTCTACGA TGATTT CTT C AGAACGACTG GTTGGGATAT TTTTTCCAAC 7920 

CTAATCTGGA CGGATTGGCA ATTCTCTATG TCCACCATCG ACTACAACTG CTAAACTCAC 7980 

ACGCGCAGGA CGACCATGAC CGACAATATT ATCAATAGCA GCACGGATGG TACGACCTGT 8040 

ATAGAGCACA TCATCCACCA AGATAACTTC GCCCTCTGTC ACATCCACAG AAACCAAAGA 8100 

AGTATCTTCT CCACTTTTAA CATCATCACC GAAAGGTTTA CTATCCAATT CCACAACAGC 8160 

AACTGAAAGA TTTTCTAACT GCTTCAAACG TTCTTGGATT CGGTGGGCAA TAAAGACACC 8220 

ACGACTTTTA ATACCAGCCA AGACGATCTT ATTC AAATCT TTGTTGCGTT CGATAATCTC 8280 

ATAAGTAATA CGCGTAATCG CTCGTTTGAC GGTCAATTCG TCTACAACTT CTTTTCTTTT 8340 

CATGACAAAC CTCCAAAAAG AAAAGTCTCC TTAAACAAGG AGACTTGAAA TTTATAGCCA 8400 

AGCGAGCCCT ACTGCACACA GTATAGACTT CACCCTTCTA CTTTATCGCG CTCCTTCCCT 84 60 

GCCTCACGGG ACAGGTTTAA AGGAATATTT AGTTATCATT TACTATAGCA CAAAGCATGC 8520 

TTAAAATCAA GCAAAAAGTT TCAATCTACC ATCTTACAAA TTGCTAAAAT CATATAATTG 8580 

TGGGTACTGG TCACACTCTG GATTTTTTCG ATGGCAAATG GCTCTTCCAA AATAAATCAT 8640 

GGCCTCATGG GCAGCTAACC ACTGCTCAGG CGGCAAGATA TCCATGACCC GCTTTTCCAC 8700 

CTCAAGTGGC GTCGCTGATT TTTTGACAAT ATCGTGGTGT TTGCAAATAC GCTCCACATG 8760 

AGTATCCACT GCAAAGGCTG GAATTCCAAA TCCTACACTC ATGACAACAT TGGCTGTCTT 8820 

GCGACCAACA CCTGCCAAAC TCTCCAATTC TTCACCTGTC TGAGGGACTT GACCATCAAA 8880 

ATCGTCTAGT AACTGTTGGG CACATTTTTT AAGGAATTTA GCTTTATTCC GATACAATCC 8940 

CAAGCGAGAA ATATGTGAAG CAATCTCACT CTCTGTCGCT ACAGACATAG CTTGGGGTGT 9000 

TGGAAACGCA ACAAAGAGAC CTGGTGTGGC CTTATTTACC GCTGCATCTG TCGTCTGGGC 9060 

TCATAACATG ACCGCAACCA GGAGTTCAAA ATGATTGGTA AAATCAAGAC TAGGCTTGGC 9120 

ATCTGGGAAG AGGGCAATGA TTTCTTCTAG CACCTTTCGT GCTCGTTTTT TTGACAAGAC 9180 
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CATTATTCAT 


CTCCGTCAAA 


TAGTCCTTGT 


AAGCCAGCAA 


AAGGACTGTT 




9240 


TTTACTGCTT 


TTTGAGCTTG 


GTATTCTTCC 


TCTGTCATGA 


TTTGCCAGTC 


ATTTCCTGAG 


9300 


ATAAATCCTT 


GACCAGCTTC 


TTCTTCACCC 


GTCAAGACCT 


TGATAGGAAT 


GTTTAGCAGG 


9360 


ATATTCTCTG 


ATACACTCTC 


AGCAAGGTCA 


AGCTCCCCAT 


TTTCGATGGG 


CAAGACCAAG 




TCATCATCTA 


AAACTTCTTG 


ATCTAGCTGG 


TTAGTTGCGC 


CTTCCATGAA 


AACTTCCGTG 




ACTGGATAAG 


ATTCAACTAA 


CTCAACTGGC 


TCCATACTGC 


GACTCGACGC 


AAGAACAATG 


731 U 


GTATAAGATA 


GTTGATAATC 


TAAGAAATAC 


ATACGGTCTT 


CATATTGTAC 


TTTCCCAACT 


9600 


GCAACGATAT 


CTTTTACATC 


TAAAATTTCT 


TGATTACGTG 


CACGCAGGTC 


ATCAACTAAA 


9660 


TCTAACCTTT 


GTTCAAAGTT 


CAAACCTTCA 


G ACTGCTT AC 


GAA'ITTCTTG 


AATATTTAAT 


9720 


TTCATACTTC 


CTCCATAAAG 


ATTTACTCTC 


TTGATTATAC 


CATGAAAAGG 


CTACAAATCA 


9780 


CCACACCAAA 


CTTTGTAATT 


AAAATTCAAA 


ATTTTAACAT 


ATTTACTATG 


ATAGTTTTAT 


9840 


TTTTTAGTCC 


TATACTATAG 


GGAAAGAGTA 


CATCAGATCA 


AGGAGGATGC 


TCACATGGAA 


9900 


GACAAGAAAC 


TCATTCAACT 


CCTATCCAAG 


TTAAATAAAA 


GCTACCAAAA 


CTGTAAACAG 


9960 


GGT ACCGC AG 


ATGATATTCG 


ACTACAAGAG 


CTGCTAAACA 


CTACTATGCA 


AGAGCTCAAA 


10020 


AAAACGGAAC 


AGTTGAACAA 


CAGTATCTTA 


ATTG AT CTTG 


AGAAATTTTA 


CCAACCTACC 


X0080 


AGTCTTCTGA 


TTGGACTGGG 


TAG CCT AAAA 


CTAAACGATC 


AAGCACGCAC 


TGCTTGGCGA 


10140 


AACTATGATA 


AATTCCATTA 


CGATCATGTC 


AAACACGTAC 


TAAGTCTCTA 


TGGACCTGTT 


10200 


TTTGAATTTT 


AGAGCATAGA 


ATTTCCAGTT 


TTCTGTTGAC 


AAAATTTCCT 


TAAAGGTATA 


10260 


ATATAAAGAT 


ACTAATACTC 


GGAGGTAAGG 


GAGACATGAA 


CAACTAAGTC 


TATCAAATAA 


10320 


AGAACCTTTA 


TTTAGTAGAT 


CTTUTTTITG 


TCTCTTTTTG 


TGTGCTCTTT 


TATGCTCTTT 


10380 


TTCTGGCATG 


TTAATAGAGT 




TAGACTTTGG 


GCTCTACTAC 


GTAAACTAGA 


10440 


GCTTTTTGTT 


ATGCACTATG 


AACATTCTAG 


AAAGGGAAAT 


CATATGATAA 


AAATCAATCA 


10500 


TCTAACCATC 


ACACAAAACA 


AAGATTTACG 


AGATCTTCTA 


TCTGACCTAA 


CCATCACCAT 


10560 


CCAAGACGGG 


GAAAAGGTTG 


CTATTATTGG 


TGAAGAAGGA 


AATGGCAAAT 


CAACCTTACT 


10620 


TAAAATTTTA 


ATGGGGGAAG 


CTTTGTCTGA 


TTTCACTATC 


AAGGGAAACA 


TCCAATCTGA 


10680 


CTATCAGTCA 


CTGGCCTACA 


TTCCTCAAAA 


AGTCCCTGAG 


GACCTAAAAA 


AGAAAACTTT 


10740 


ACACGACTAC 


TTCTTTTTAG 


ATTCTATTGA 


TTTAGACTAC 


AGTATCCTCT 


ATCGTTTGGC 


10800 


GGAGGAATTG 


CATTTTGATA 


GCAATCGTTT 


CGCAAGTGAC 


CAAGAGATTG 


GCAATCTATC 


10860 


AGGGGGCGAA 


GCTTTGAAAA 


TTCAGCTTAT 


CCATGAGTTA 


GCCAAACCCT 


TTGAGATTCT 


10920 
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ATTTTTAGAT 


GAACCTTCAA 


> ATGACCTAGA 


. CCTTGAGACA 


GTTGATTGGC 


TAAAAGGCCA 


, 10980 


GATTCAAAAG 


ACCAGGCAAA 


. CCGTTATTTT 


CATTTCCCAT 


CATGAAGACT 


TTCTTTCTGA 


11040 


AACGGCACAC 


ACTATTGTTC 


ACTTGCGACT 


GGTCAAACAC 


CGTAAAGAAG 


CGGAAACGCT 


11X00 


ACTAGACCAT 


TTACACTATG 


ATAGCTATAG 


TGAGCAGAGA 


AAGGCTAATT 


TTGCCAAACA 


11160 


AAGTCAGCAA 


GCTGCTAACA 


ACCAAAGAGC 


CTACGATAAA 


ACCATGCAAA 


AACATCGGAG 


11220 


AGTTAAGCAA 


AATGTAGAAA 


CTGCGCTTCG 


AGCTACCAAA 


GATAGTACTG 


CCGGTCGCCT 


11280 


ATTGGCTAAA 


AAGATGAAAA 


CTGTCCTCTC 


ACAAGAAAAA 


CGCTACGAAA 


AGGCAGCTCA 


11340 


GTCCATGACT 


CAAAAGCCAC 


TTGAAGAGGA 


ACAAATCCAA 


CTTTTCTTTT 


CACACATCCA 


11400 


ACCATTACCA 


GCTTCTAAAG 


TCTTAGTCCA 


ACTGGAAAAA 


GAAAATTTGT 


CCATTGACGA 


11460 


CCGAGTTTTG 


GTTCAAAAAC 


TACAACTAAC 


TGTCCGTGGC 


CAAGAAAAAA 


TCGGTATTAT 


11520 


CGGGCCAAAT 


GGTGTTGCGA 


AATCAACTCT 


GTTAGCCAAC 


TTACAGACAC 


TTCTGAATGA 


11580 


TAAAACAGAG 


ATTTCACTTG 


GTTTTATGCC 


ACAAGATTAC 


CACAAAAAAC 


TGCAATTGGA 


11640 


TTTATCCCCA 


ATAGCCTATC 


TCAGTAAAAC 


TGGGGAAAAA 


GAGGAACTAC 


AGAAAATCCA 


11700 


ATCTCACCTA 


GCTAGTCTCA 


ATTTCAGTTA 


TCCAGAAATG 


CAGCATCAAA 


TTCGCTCCTT 


11760 


ATCTGGCGGA 


CAACAGGGAA 


AACTCCTGCT 


TTTGGATTTA 


GTCCTGCGCA 


AACCAAACTT 


11820 


TCTCCTGCTG 


GATGAACCCA 


CACGAAACTT 


TTCTCCCACT 


TCTCAACCCC 


AAATCAGAAA 


11880 


ACTCTTTGCT 


ACCTATCCAG 


GCGGTCT CAT 


CACTGTTTCG 


CATGACCCTC 


GTTTCTTAAA 


11940 


AGAAGTCTGC 


TCGATCATCT 


ATCGCATGAC 


AGAACACGGT 


TTGAAGCTAG 


TTAATTTAGA 


12000 


AGATTTATAA 


ATTTGCAACA 


TAGCAAAAAT 


CCAGAGACGA 


CCTCTGGATT 


CTTTTACATC 


12060 


TGTTTTAAAC 


GTTCAATCCG 


TTCTGAGATA 


GGTGGGTGGG 


TATAAAACAG 


TTTTTGGAAC 


12120 


CCCCCACCTT 


TCTTAGGATC 


ATTGATATAA 


AGGGCACTGC 


TAGCATCATC 


CACGTGCCGA 


12180 


CTCATACGTT 


TGCTATTGTC 


CAACTTATCT 


AGGGCATTAA 


TCATTCCCTG 


GGGATTGCGA 


12240 


GTCAGCTCGA 


CACTAGATCC 


ATCTGCCAGA 


AATTCCCTCT 


GACGAGAAAT 


AGCGAGCTGA 


12300 


ACCAAGGTTG 


CAGCGAGAGG 


TGCCAGTACA 


ATAGCTAGTA 


GGGAAACCAC 


TAGCATAATG 


12360 


ATTTCAAGAC 


CATTTCCATC 


TCCGTCATCA 


TCACTTCGTC 


TGCGACCTGC 


TCCACCCCAC 


12420 


CACATCATAC 


GACCTGCCAT 


ACTAGAAAGC 


ATGGTGATAG 


CACTAGCAAG 


GGCAACTGCA 


12480 


ATAGTCGAAA 


TACGGATATC 


ATAATTACGA 


ATATGACTGA 


CTTCATGTCC 


CATAACAGCT 


12S40 


TCTAGTTCTT 


CACGATTCAT 


GATAGCTAGT 


AGACCTGAAG 


TCGCAGCAAC 


AGCCGCATTT 


12600 


TGAGGATTAG 


AACCTGTCGC 


AAAGGCATTT 


AAGGCTGGAT 


CATCAATGAT 


GAAAACACGG 


12660 


GGCATAGGAA 


TCTGAGCGAC 


CACAGCCATA 


TCTTCCACTA 


CATGGTAGAG 


GTCTGGTGCC 


12720 
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GTTTGCTCAT 


CCACCTCACG 


CCCTCCATTC 


ATGGACATGA 


CAATCTCTGT 


CGATTGAAAA 


12760 


ATCATAGACA 


AAGCGT AG AT 


AAAGCCGATA 


ATCAGTGCAA 


TAACCAAACC 


ACCAAGTCCA 


12840 


GATCTTATAA 


AGAGATAACC 


AACCGCATAA 


CCAACAAGAG 


CTAAGAGTAG 


GAAAAATACC 


12900 


AGCAACAAAA 


TCCAGGTTTT 


TCGTTTATTG 


CTTGCAATTT 


GATCAAACAA 


CATCTTAGTC 


12960 


ACCTAAACCG 


CTAAAATCAA 


CTTTAGGAAC 


CGACTTTTCC 


TCTTCAGGTG 


TTTGAAGGAA 


13020 


ATCTGCCCCT 


TTAAATCCAA 


ACATTCCAGC 


GATAATATTG 


CTCGGGAAAG 


TTTCTAATTT 


13080 


TACATTCTAG 


TTGCTGACAA 


CACTGTTATA 


GAGTTGACGA 


CAGTAAGAAA 


TTTTATTTTC 


13140 


TGTCTTTGTC 


AACTCCTCTT 


GCAATTTAAC 


AAAGTTAGCA 


CTAGCTTTCA 


AATCTGGATA 


13200 


GCTTTCTGCA 


ACTGCAAAAA 


TACCTGAAAC 


CTGACGAGTG 


AGGGCATCAC 


TGGCTTTCAT 


13260 


AGCTTCTGCT 


GGTGAAGTCG 


CTGCCGCCAC 


TTGGTTACGT 


AGTTCTGCCA 


CCTTTTCAAG 


13320 


GGTAGAACCT 


TCATATTTGG 


CATAACCTTT 


TACAGTCTCA 


ATCAAGTTTG 


GCAAGAGCTC 


13380 


ATTGCGACGT 


TTCAACTGAA 


CATCAATCTG 


ACTCCAAGCC 


TCCTTGGTTT 


CCATACGATT 


13440 


TTTAACCAAA 


CCCTTATAGC 


TAACAATCAC 


AAAAATAACA 


ATAAGAGCCA 


TAACTCCAAG 


13500 


AATAATCCAA 


GTCATAATAT 


AAGTCCTTTC 


TCCTTTTACA 


TTAGTACCAC 


TATATCAAAT 


13560 


TTTCTATGAT 


TCTGGTAAAA 


TAAGATGATA 


CTAAAGAAGG 


AAATAACTAT 


GAAACCAAAA 


13620 


ACATTTTACA 


ACTTGCTTGC 


CGAGCAGAAT 


CTTCCACTTT 


CGGACCAGCA 


AAAAGAACAA 


13680 


TTTGAACGTT 


ATTTTGAGCT 


CTTGGTCGAC 


TGGAATCAGA 


AGATTAATTT 


GACGGCGATT 


13740 


ACGGACAACG 


*AGAAGTTTA 


TCTCAAACAT 


TTTTACGATT 


CGATTGCACC 


CATTCTTCAA 


13800 


GGTTTGATTC 


CCAATGAAAC 


TATCAAACTT 


CTTGATATCG 


GGGCTGGGGC 


AGGATTTCCT 


13860 


AGTCTACCAA 


TGAAAATTCT 


CTATCCCGAC 


TTAGATGTGA 


CCATTATTGA 


TTCACTCAAT 


13920 


AAG CGCATCA 


ACTTCCTACA 


ACTCTTGGCT 


CAAGAACTGG 


ATTTGAACGG 


AGTTCATTTC 


13980 


TACCACGGAC 


GTGCCGAAGA 


TTTTGCCCAA 


GACAAGAACT 


TCCGTGCTCA 


ATATGATTTT 


14040 


GTAACAGCTC 


GTGCGGTTCC 


CCGTATGCAG 


GTCCTATCTG 


AATTCACTAT 


TCCCTACCTT 


14100 


AAGGTTGGTG 


GCAAACTATT 


AGCACTCAAG 


GCTAGCAATG 


CGCCTGAGGA 


ATTATTAGAA 


14160 


CCTAAGAATG 


CCCTCAATCT 


CCTTTTTAGT 


AAGGTCGAAG 


ACAATCTCAG 


TACGCCCTAC 


14220 


CGAATAGAGA 


TCCGCGCTAT 


ATCACAGTGG 


TAGAAAAGAA 


AAAAGAAACA 


CCAAATAAAT 


14280 


ATCCACGTAA 


GGCTGGTATG 


CCAAATAAAC 


GCCCACTTTA 


AATTTTTTAG 


TAAACAAATG 


14340 


TTTACAAAAT 


CAGCCTCGCT 


CTTTTATTTC 


TAGGCTCCGG 


AAAAAATGAT 


TTACAAAATC 


14400 


AGCCTCGCTC 


TTTTATTTCT 


AGGCTCGGGA 


AAAAATGATT 


TACAAAATCA 


TTTTTTTCTG 


14460 
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CTATACTATC CTAAGCAAAG GTTTTTAATG TCATCCCGTG AGGTGACGAA GACGCAGAAA 14520 

TATTTAAAAC TCTTTAAAAT CTAAATTTTA AAGAACTCTT ACTCTCAGGG CCTATTGCTG 14580 

TAAAATAATC GGCTCTTTTT TGATGCCCAA AAGTGAGGTT TATATGAAAC AAGAATCAAC 14640 



TCTTGATTTG TTAC 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: €405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



14654 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

AGAAAAATCT GCTTTACAGA AAATAAAAAT AATAGGAGAA AATCTATGTC AGATTTGAAA 60 

AAATACGAAG GTGTCATTCC AGCCTTCTAC GCATCTTATG ATGATCAAGG AGAAGTAAGC 120 

CCAGAACGTA CGCGTGCCTT GGTTCAATAC TTCATTGATA AAGGTCTTCA AGCTCTTTAT 180 

GTCAATGGTT CTTCTGGTGA ATGTATCTAC CAAAGCGTTG AAGATCGCAA GTTG A TTTT G 240 

GAAGAAGTCA TGGCGCTAGC AAAGGTAAAT TGACCATTAT TGCCCATGTT GCTTGCAATA 300 

ATACTAAAGA TAGTATGGAA CTTGCTCCCC ATGCTGAAAG CTTGGGAGTA CATGCTATTG 360 

CAACGATTCC ACCAATTTAT TTCCGCTTGC CAGAATACTC AGTTGCCAAA TACTGGAACG 420 

ATATCAGTTC TGCAGCTCCA AACACACACT ACGTCATTTA CAACATTCCT CAATTGGCAG 480 

GGGTTGCTTT GACTCCAACC CTTTACACAG AAATGTTCAA AAATCCTCGT GTTATCGGTG 540 

TGAAGAACTC TTCTATGCCA GTTCAAGATA TCCAAACCTT TGTCAGCCTT GGTGGACAAG 600 

ACCATATCGT CTTTAATCGT CCTGATGAGC AGTTCCTAGG AGGACGCCTC ATGGGGGCTA 660 

GGGCTGGTAT CGGTGGTACT TATGGTGCTA TGCCACAACT CTTCTTGAAA CTCAATCAGT 720 

TGATTGCGGA TAAGGACCTA GAAACACCGC GTGAATTGCA GTATGCTATC AACGCAATCA 780 

TTGGTAAACT CACTTCTGCT CATGGAAATA TGTACGGTGT CATCAAAGAA GTCTTGAAAA 840 

TCAATGAAGG CTTGAATATT GGATCTGTTC GTTCACCATT GACACCAGTG ACTGAAGAAG 900 

ATCGTCCAGT TGTAGAAGCG GCTGCTGCCT TCATTCGTGA AACCAAGGAG CGCTTCCTCT 960 

AATCTAAAAG GAGGTATTTA TGACATATTA CGTTGCAATT GATATCGGTG GAACCAACAT 1020 

CAAGTATGGT TTGGTTCATC AAGAGGGGCA ACTTCTTGAA TCGCATGAAA TGCCAACTGA 1080 

GGCGCATAAG GGTGGACCTC ATATCTTACA AAAGACCAAA GATATCCTAG CTAGTTATTT 1140 

AGAAAAAGGC CCAGTAGCAG GTGTTGCCAT ATCTTCTGCT GGGATGGTGG ATCCGGATAA 1200 
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^ h s*-ft*T**f* 

GGGTGAGATT 


TTCTATGCTG 


rww*f* a. ft AT 




GCAGGCACCC 


AGTTCAAAAA 


1260 


ft ft ft Ik 


UnAAuL 1 1 In 




TCAGATTGAA 


AATGATGTCA 


ACTGTGCAGG 


1320 


TCTTGCTGAG 






GCGACCAAGT 


GTGACACTTT 


GCTTGACCAT 


1380 


TGGAACCCGT 


ATCGGTGGTT 


GCTTGATTAT 


uVjaI nuwnAA 


RTCTTgCATC 

VJ ft \m A A A V 


CTTTTAfiCAA 

VJ A A A A rlw^ IM* 


1440 


TTCAGCCTGT 


0+ ft ft 

GAAGTCGGGT 


ft •»» ft >m^^l>H>»*|> 

AT ATGCA X A 1 




CfTTTTCAAG 
4 • a a 


ACTTGGCTTC 

AVi ft 4 VJ^JN^ A 4 w 


1500 


TACAACAGCT 


TTAGTGAAAT 


ATCTACLlwA 




CAAfiATGTTC 


ATCAGTGGAA 


1560 


TGGCCGTAGA 


ATTTTCAAAG 


AAGCCACTGA 






AJlCCTATTCA 


1620 


CCGTATGGTT 


GACTATCTAG 


GAAAAGGTCT 


GGCAAATATT 




Pf*AAT^CACA 


1680 


AGTGGTTATT 


CTTGCTGGTG 


MH ft MAX* ft fR^^M 

GTATCATGGG 


GCAAGAuOUT 




w 1 nAun 1 VV.VJ 


1740 


TACAGCCTTG 


AAAGAGGCTT 


TGGTACCAAG 


■MIMft ft r^f* ft ft ft. 

TTTAGCAGAA 


AAAACAVpuAT 


^ Art a a •t* , p*Wir* 

1 AunA I 1 I 


lOvu 


CCATCACCAA 


AATACAGCAG 


GGATGTTGGG 


TGCATATTAT 


(»n l 111 nnAin 


TAAAArAATT 


1660 


CTAGTTTGGC 


TCAGCCAAAC 


TAGGATTTTC 


TTACACGTTT 


TTCTCTACUA 


1 AOLLu 1 lljA 


1 420 

IJftU 


GTTTTTTATT 


TTCCCAGTAG 


CTATTAAAGA 


TTTTTTCCTT 


GCTTTCGCGA 


I I LrA 1 A lA.w/ft 


170V 


AAAAGTAGGC 


ATAAATCAAA 


TCGATAAAGA 


AGAGCATAGG 


AAbTJwAuLVi 


UA1 Ai ItVil i 


2040. 


GGAT AT AGG A 


CGGTTGGCTG 


TGGGTGGCTA 


f~> m ftS* A ft ft /T* 

CAAGAAV.AGT 


V. 1 V- 1 V_> 1 « 1 /\*J 




2 100 


CTTTATTGGG 


AACACTTGTA 


« ft ft ft ^ & ^* 

AAGAGTACAG 


TCTTTGCCCC 


OVTCTCWTTA 




2 160 


GACTATCTAA 


AATAGAAGGA 


GTTGAGCCTG 


AAAGTGAGAA 


Gl>l»w/*u 1 f\<, 1 


AuALAAl 1 ft i 


2220 


CATCCATGAT 


CCTGGTTGTC 


ft ft ft ft 

CAGGCAAAGC 






TTCTAGACCA 


2280 


CACCTAGTCG 


CATAAAACGT 


AATTTCATTT 


CACGGGCGAG 


(jAOuLWAwAA 


V» 1CV.V> A VJ 1 ft W 


2340 


CAAAGAAGTA 


GATACGCTCA 


ft TI"«l'TPf A 

bUATL I I V.uA 




nA A • WV* i IWI 


AGTTGGATTT 

*»\J A 4 WWI ■ * A 


2400 


CGTCAATCAA 


GTCTTGTGTT 


1 V* 1 1 UWl. 1 V.A 




ACTTCTGAGG 


ACTCGTTTGG 


2460 


TCAGTGCACT 


GTGCTTGGAG 


ACTTGGTTGG 


CTTGATTTTC 


TGCCTGATGT 


TGGTATTGGA 


2520 


AAATAAATTC 


TCGGTAGCCA 


GTAAAGCCAC 


ACTTTTTACC 


AAAGCGGGTC 


AAAGCAGCTT 


2580 


GAGAAATATG 


TAATTrrrGG 


GTGACTTGTT 


GAGAAGATAA 


ATCATCTGTA 


ATCGTTTCAG 


2640 


CTTGCAAAAA 


ATAGCGAGCG 


Al-n*CTTGTT 


CTAGGTCTGT 


CATTTCTTCA 


AAATGTGAAT 


2700 


CAATGATAGT 


TGCGATATCT 


GGTTTGTCCA 


TAGGGAAAGC 


TCCTTTACAT 


GAGTCATACT 


2760 


GGAAGACTAG 


ATCAGAGAAT 


AGTCACACTT 


CATTATAACA 


CATAATATAA 


GGATAGATAA 


2820 


ATAAAAACGC 


ATCTCTGTTT 


TAAAAACGAA 


AAAATCGAAA 


AAGCTTCTCT 


CTTTTCCATA 


2880 


ATTTTCTACT 


CAAATTGTGG 


TACAATTAAG 


AGTAAGATTT 


TAAGTTAGAA 


ATGACACTGA 


2940 
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TTTCTATGAG AAAATTTAAC AGCCATTCGA TTCCGATTCC CCTTAATTTA TTGTTTTCAA t 3000 
TCGTCATTTT ACTCTTTATG ACCATTATTG CT CC TTTCT T GTATATGCAG GTTTTGAACA 3060 
AGGATTTTTA CGAAAAAAAG CTAGCTTCAG CTAGTCAGAC CAAGATTACA AGCAGTTCAG 3120 
CCCGTGGGGA AATTTATGAT GCTAGTGGAA AACCTTTGGT AGAAAATACG TTAAAGCAGG 3180 
TTGTTTCCTT TACGCCTAGC AATAAAATGA CGGCTACAGA CTTAAAAGAA ACAGCTAAAA 3240 

AGTTACTGAC TTATGTCACC ATCAGTTCTC CAAATTTCAC AGAACGCCAG CTGGCGGATT 3300 

ACTATTTGCC TGATCCTGAA ATCTATAAAA AAATAGTGGA AGCTCTCCCA AGTGAGAAAC 3360 

GCTTGGATTC AGATGGCAAT CGTCTATCCG AATCAGAACT GTATAACAAT GCGGTCGATA 3420 

GTGTACAAAC GAGTCAACTA AACTATACAG AGGATGAAAA GAAAGAAATC TATCTTTTTA 3480 

GTCAGTTAAA TGCTGTTGGA AACTTTGCGA CAGGAACCAT TGCGACAGAT CCTCTAAATG 3540 

ATTCTCAGGT GGCTGTTATT GCCTCTATTT CAAAGCAGAT GCCTGGCATT AGTATTTCTA 3600 

CTTCTTGGGA TAGAAAGCTT TTGGAAACTT CCC T T T CTTC TATAGTTGGG ACTGTATCCA 3660 

GTGAAAAAGC TGGTCTCCCA GCGGAACAAG CAGAAGCCTA TCTTAAAAAA GGCTATTCTC 3720 

TAAATGACCG TGTAGGAACC TCCTATTTGG AAAAGCAATA TCAAGAGACC TTACAAGGAA 3780 

AACGCTCGGT AAAAGAAATC CATCTGGATA AATATGGCAA TATGGAAAGC GTGGATACAA 3840 

TTGAGCAAGG TAGTAAGGGA AACAATATCA AACTGACCAT TCATTTGCCT TTCC AAGAT A 3 900 

GCGTGGATGC TTTACTGAAA AGTTATTTCA ATTCTGAGCT AGAAAATGGT GGAGCCAAGT 3960 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCGCTT TTGTCTATGT 4020 

CAGGGATTAA ACATGACTTG AAAACGGGAC AGTTGACGCC TGATTCCTTG GGAACGGTAA 4080 

CCAATGTCTT TGTTCCAGGT TCGGTTGTCA AGGCGGCGAC CATCAGCTCA GGTTGGGAAA 4140 

ATGGAGTCTT GTCAGGAAAC CAGACCTTCA CAGACCAGTC CATTGTCTTC CAAGGTTCAG 4200 

CTCCCATCAA TTCTTGGTAT ACTCAGGCTT ACGGTTCATT CCCTATCACA CCGGTCCAAG 4260 

CTCTGGAGTA TTCATCAAAT ACCTATATGG TCCAAACAGC CTTAGGTCTT ATGGCGCAAA 4320 

CCTATCAACC CAATATGTTT GTCGGCACCA GCAATCTAGA GTCTGCTATG GAGAAACTGC 4380 

GTTCAACCTT TGGCGAATAT GGCTTGGGTA CTCCGACAGG AATTGACCTA CCAGATGAAT 4440 

CTACTGGATT TGTTCCCAAA GAGTATAGCr TTGCTAATTA CATTACTAAT G CCTTTGGGC 4500 

AGTTTGATAA CTATACGCCG ATGCAGTTGG CTCAGTATGT AGCAACTATT CCAAATAATG 4560 

CTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT AAGGGAGGAC 4 620 

TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA TAAGGTCAAT ATATCCGACT 4680 

CCGATATGAG CATCTTGCAC CAAGGTTTTT ATCAGGTTGC CCATGGTACT AGTGGATTGA 4740 
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A A CTGG ACG 


TGCCTTTTCA 


AATGGTGCCT 


TGGTATCCAT 


TAGCGGAAAA 


ACAGGTACAG 


4800 


W W\lAAn\j\# l ft 


TGTCGCAGAT 


GGTCAGCAAG 


CAACCAATAC 


CAATGCGGTG 


GCCTATGCCC 


4860 


PATCTCATAA 

wft A w X Uft A ftft 


TCCCCAAATC 


CCTCTCGCAC 


TGGTCTTTCC 


TCATAATACC 


AATCTAACAA 


4920 


ft A V* V-f lu I Aww 


ftWW A * vWl A A 


GCGCGTGACA 


TTATCAATCT 


GTATCAAAAA 


TACCATCCAA 


4980 


Tunni iAu/VA 


AuOn/w\ A i ft A 


AAA ft a 


ACACCTATTG 


CCAAGTTGAT 


TGACAGTTAT 


5040 


TCXAAGTTAC 


Lnuu A ft l 


GATTAAGACG 

V*ft A A ftft\*nw V 


GCTACGCGTC 


TGGCCTTTTA 


TACGATTGGG 


5100 


ATGTwTowTa* 


A 1 OA i w> 


TftAATTTCCA 

1 VJftft All wW>ft 


AAAAATCTCC 

ArtlWl AW* WW 


TTTCTGCTAA 


GAGAGAATTG 


5160 


ACATATTGTT 


CTATTTGTGG 


Awu i 1 lunwA 


KArflACGATC 


fPTftTTPTAT 

W A A \£ A> A W A A 


CTGTACTdAl* 

W A \* A #^W A A 


5220 


CCGACTCGTG 


ACwAuAwAAw 


AATTXTAGTT 


fTTCAGGATA 

V* A A WftWXft A ft 


CTACAGATGT 


GGCAGCCATG 


5280 


GAAAATATCC 


X A X ink X 

AAGAATawCA 


v\jI»Aw. IV. inl 


wA i u 1 ww> i 1V> 


ft X VJUl. w i wft A 


*T**I 1 ^* I'l "T A*Pfi 

A A W A WW A ft A V# 


5340 


AATGGTATCA 


GTCCGGA^AjA 




AAUAwLU t a ft 


1 WftW 1 WW AWl 


TATGCATACT 

A ft AS \#V*ft * ftw * 


5400 


GAGGTTTCAG 


AAGTGATTGT 


IAjwwAw 1 AA 1 


V>w a Aw nvV>uu 


ATRCTCAACP 

ft A VJ\J A vAAUL 


U>ftW A A WWft A V£ 


5460 


TATCTTTCAC 


GTTTGCTCAA 


GCkbGL 71 wr 


A i LAAuu 1 1 A 


wVaw w J w a ftV«»w 




5520 


GCTGTGGGAG 


CGGACATTGA 


vtA iwwVjUAt 


uAnu a wftLftw 


A W A A ftWUftxJW 


CATTGAAAAT 


S580 


CGGACAGAGT 


i \* 1 AAU IwlA 


uuLA/vi l 1 In 


£■■CAA{'•TCCAT 

LVj/WV. J V W ft A 


TCATTTATAA 


AAAATCAAAG 


5640 


X It X X 

AGGCTGAAAA 


TCGTTCCTAT 


CGGCCTCTTT 


TTCTATAflTG 

I A W A ft A ftU> 1 \9 


TCATGACTAC 


GCTCAGGTTC 


5700 


AAGTTTTAAA 


AAALUv\ul.A 


AA 1 A lUA A /\ A 


A rTAAAHACC 

ft W A ftftft VftwW 


G ACT ATTCT A 

\J^^W 4 4 4 W 4 • » 


GTAGAATTAG 


5760 


GALAAATAAT 


A 1 unnnLnnA 


WWft 1 iftl * Sp t 


TTTATATCGT 

AAA ft A ft A WW A 


CGACGGAGTG 


CGGAACGCGA 


5820 


X rTPTTTTTr 

nu i V_ 1 A_ 1 v* 1 w 


LI i i LnuL lu 


ftwftV* 1 w i wft 1 


GCGTGCGGTC 

WWW) £ %*W^*%J A W 


GATTACGACC 


GTTTCACAGT 


5880 


w AAwAw 1 1 iw 


TTTATCA(3TC 
1 1 iAl ^ft W 4 w 


ACTCAGGTCA 


CTTTATCAAA 


ACACAGGAAT 


TTACTCATGC 


5940 


TCCCCGGCAA 

A WWwVJWWWftft 


GAAGACCGTC 


TCATGACCAA 


TGAAACCATT 


GATTCGGATA 


AGAAAGTTGC 


6000 


ACCAAGTGCT 


ATCTACGAAG 


AAGGTGCAGT 


GGTCTTTCCA 


GTCCTTCACG 


GGCCAATGGG 


6060 


AGAAGATCGC 


TCTGTTCAAG 


GATTCTTGGA 


AGTTTTGAAA 


ATGCCTTACG 


TTCGTTGCAA 


6120 


CATTTTGTCA 


TCAAGTCTTG 


CCATGGATAA 


AATCACGACT 


AAGCGTCTTC 


TGGAATCTGC 


6180 


TGGTATTGCC 


CAAGTTCCTT 


ATGTGGCTAT 


CCTTGAAGGC 


GATGATCTGA 


CTCCTAAAAT 


6240 


CCCTGAAGTG 


GAAGAAAAAT 


TGGCTTATCC 


AGTCTTCACT 


AAGCCGTCAA 


ACATGGGGTC 


6300 


TAGTGTCGGT 


ATTTCTAAGT 


CTGAAAACCA 


ACAAGAACTC 


CGTCAAGCCT 


TAAAACTTGC 


6360 


CTTCCGATAT 


GACAGCCGTG 


TCTTGGTTGA 


GCAAGGAGTG 


AATGC 




6405 


(2) INFORMATION FOR SEQ ID NO: 108: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11309 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
CGAGCTCGGG TACCGGGATT TTAAGGAGTT TGATATGTAT AACCTATTAT TAACCATTTT 
ATTAGTATTA TCTGTTGTGA TTGTGATTCC AATTTTCATG CAACCAACCA AAAACCAATC 
CAGCAATGTA TTTGATGCCA GTTCAGGTGA TTTGTTTGAA CGCAGTAAAG CTCGCGGTTT 
TGAAGCTCTA ATGCAGCCTT TGACAGGGAT TTTAGTCTTT TTCTGGCTAG CCATTGCCTT 
AGCATTGACG GTATTATCAA GTAGATAAGA AAATAATGGG CAGGACTAGG TCTTTGCCTC 
TTTTTATTTT TAAAGGATGT TTGAGAAGGT TTTACAGTAA AAGAAAATTA AAAAATCTAG 
AAAGAAAATA TGAAAGATAG AATAAAAGAA TATTTACAAG ACAAGGGAAA GGTGACTGTT 
AATGATTTGG CTCAGGCTTT GGGAAAAGAC ACTTCCAAGG ATTTTCGTGA GTTGATTAAA 
ACCTTCTCCT TAATCGAAAC AAAGCACCAA ATTCCTTTTG AAGAAGATCG TACTCTCACA 
TTAGAAATTA AGAAAAAACA TGACATTACC CTCAAGGGGA TTTTTCATGC CCATAAAAAT 
CGCTTTGGCT TTGTTAGTCT GGAAGGCGAG GACGACGACC TTTTTGTAGG GAAAAATGAT 
GTCAACTATG CTATTGATGG TGATACCCTC GAGGTAGTGA TTAAGAAAGT CCCTCACCCC 
AATAAGCGAA CAGCACCACA AGCCAAAATT ATTGATATCC TAGAACACAC TTTGACAACA 
GTTCTCCCGC AAATCGTTCT GGATCAGGAA AAACCTAACT ATGCTCCCTA TATTCGTTCA 
AAAAATCAGA AAATCACTCA ACCGATTTAT GTTAAGAAAC CAGCCCTAAA ATTAGAAGGA 
ACAGAACTTC TCAAACTCTT TATCCATAAA TACCCAACCA AGAAACATGA TTTCTTTCTC 
GCGAGTGTTC TCGATGTAGT GGGACACTCA ACGGATGTCG GAATTGATGT TCTTGAGGTC 
TTGGAATCAA TGGACATTGT ATCCGAGTTT CCAGAAGCTC TTGTTAAGGA AGCAGAAJtGT 
CTGCCTCATG CTCCGTCTCA AAAGGATATG GAAGCTCGTC TCGATCTAAG AGATGAAATT 
ACCTTTACCA TTGACGGTGC GGATGCCAAG GACTTGGACG ATCCAGTGCA TATCAAGGCT 
CTGAAAAATG GCAATCTGGA GTTTGGGGTT CACATCGCAC ATGTTTCTTA TTATGTGACC 
GAGCGGTCTG CCCTTGACAA GGAAGCCCTT AACCGTGCGA CTTCTCTTTA CCTGACAGAC 
CGAGTGGTGC CAATGCTTCC ACAACGACTA TCAAATGGCA TCTGCTCTCT CAATCCCCAA 
CTTGACCGCC TGACCCACTC TGCTATTATG GAGATTGATA AACATGGTCG TGTGGTCAAC 
TATACCATTA CACAAACAGT TATCAAGACC AGTTTTCGTA TGACCTATAG CGATGTCAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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GATATCCTAG 


CTGCCCATGA 


AGAAAAGAGA 


AAAGAATATC 


ATAAAATTGT 


ATCAAGTATC 


1560 


GAACTCATGG 


CCAAGCTTCA 


TGAAACTTTA 


GAAAACATGC 


GTGTGAAACG 


TGGAGCTCTC 


1620 


AATTTTGATA 


CCAATGAAGC 


GAACATTTTA 


GTGGATAAAC 


AAGGTAAGCC 


TGTTGATATC 


1680 


GTTC TTCGGC 


AGCGTGGTAT 


TGCCGAGCGG 


ATGATTGAGT 


CTTTTATGTT 


GATGCCTAAT 


1740 


GAAACAGTTG 


CCGAACATTT 


CAGCAAGTTG 


GATTTGCCTT 


TTATCTATCG 


AATTCACGAG 


1800 


CAGCCTAACG 


CTGAAAAGGT 


TCAGAAGTTT 


ATTGATTATG 


CTTCGAGTTT 


TGGCTTGCGC 


1860 


ATTTATGGAA 


CTGCCAGTGA 


GATTAGTCAG 


GAGGCACTTC 


AAGACATCAT 


GCGTGCTGTT 


1920 


GAGGGAGAAC 


CTTATGCAGA 


TGTATTGTCC 


ATGATGCTTC 


TTCGCTCTAT 


GCAGCAGGCT 


1980 


CGTTATTCGG 


AGCACAATCA 


CGCCCACTAT 


GGACTAGCTG 


CTGACTATTA 


TACTCACTTT 


2040 


ACCAGTCCAA 


TTCGTCGTTA 


TCCAGACCTT 


CTTGTTCACC 


GTATGATTCG 


GGATTACGGC 


2100 


CGTTCTAAGG 


AAATAGCAGA 


GCATTTTGAA 


CAAGTGATTC 


CAGAGATTGC 


GACCCAGTCT 


2160 


TCCAACCGTG 


AACGTCGTGC 


CATAGAAGCT 


GAGCGTGAAG 


TCCAAGCCAT 


GAAAAAGGCT 


2220 


CAGTATATGG 


AACAATACGT 


CGGTGAAGAG 


TATGATGCAG 


TTGTATCAAG 


TATTGTCAAA 


2280 


TTCGGTCTCT 


TTGTCGAATT 


GCCAAACACA 


GTTGAAGGCT 


TGATTCACAT 


CACTAATCTG 


2340 


CCTGAATTTT 


ATCATTTCAA 


TGAGCGTGAT 


TTGACTCTTC 


GTGCAGAAAA 


ATCAGGTATC 


2400 


ACTTTCCGAG 


TGGGTCAGCA 


GATCCCTATC 


CGTGTTGAAA 


GAGCGGATAA 


AATGACTGGA 


2460 


GAG ATT GATT 


TTTCATTCGT 


ACCTAGTGAG 


TTTGATGTGA 


TTGAAAAAGG 


CTTCAAACAG 


2520 


TCTAGTCGTA 


CTGGCAGAGC 


CCGTGATTCA 


AATCGTCCTT 


CGGATAACAA 


GGAAGACAAG 


2580 


AGAAAATCAG 


GACGCTCAAA 


TGATAACCGT 


AAGCATTCAC 


AAAAAGACAA 


CAAGAAAAAA 


2640 


CGAAACAAAC 


CTTTTTACAA 


GGAAGTAGCT 


AAGAAAGGAG 


CCAAGCATGG 


CAAAGGGCGA 


2700 


GGGAAAGGTC 


GTCGCACAAA 


ATAAAAAGGC 


ACGCCACGAC 


TATACAATCG 


TAGATACGCT 


2760 


AGAGGCAGGG 


ATGGTCCTGA 


CTGGAACTGA 


AATCAAGAGT 


GTACGAGCTG 


CTCCAATTAA 


2820 


TCTCAAGGAT 


GGCTTTGCTC 


AAGTGAAAAA 


TGGAGAAGTT 


TGGCTGAGCA 


ATGTTCATAT 


2880 


CGCGCCTTAC 


GAAGAGGGCA 


ATATCTGGAA 


CCAGGAACCA 


GAACGTCCTC 


GTAAACTCCT 


2940 


.GCTCCATAAA 


AAGCAAATTC 


AAAAATTGGA 


ACAAGAGATC 


AAAGGGACAG 


GAATGACCTT 


3000 


AGTTCCCCTT 


AAGGTCTATA 


TAAAAGATGG 


CTACGCTAAG 


CTTCTTTTAG 


GACTTGCCAA 


3060 


AGGGAAGCAT 


GACTATGACA 


AACGGGAGTC 


TATCAAACGT 


CGTGAGCAAA 


ATCGAGATAT 


3120 


CGCGCGTGTG 


ATGAAAGCTG 


TTAATCAGCG 


ATAAAAAGAG 


GAATTCAAAA 


TGCAAAAATT 


3180 


AGTTGCCTAT 


AAACGCATGC 


CTTTGTGGAA 


TAAACAAACA 


ATGCCTGAAG 


CTGTTCAGCA 


3240 
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AAAGCACAAT ACAAAAGTTG GGACTTGGGG GAAAATTACT GTCTTGAAGG GAGCTCTCAA 3300 

GTTTATTGAA TTGACAGAAC AAGGGGAAGT TCTAGCTGAA CACCTCTTTG AAGCAGGGGC 3360 

AGACAATCCA ATGGCCCAAC CTCAAGCCTG GCACCGAGTG GAAGCTGCCA CAGATGATGT 3420 

GGAATGGTAC TTGGAATTTT ATTGTAAACC TGAGGATTAT TTTGCTAAAA AATACAATAC 3480 

CAATCCTGTT CATTCACAGG TCCTAGAGGC CATGCAGACA GTGAAACAAG GGAAAGCTTT 3540 

GGATTTGGGT TGTGGTCAGG GGCGTAATTC TCTTTTTCTA GCCCAGCAAG ATTTTGATGT 3600 

CACGGCTGTA GATCAAAATG CACTAGCTCT TGAAATCTTG CAAAGCATTG TGGAGCAGGA 3660 

AGATTTGGAC ATGCCTGTTG GCCTTTACGA TATCAATTCA GCTAGCATTG AACAAGAATA 3720 

TGATTTTATC GTTTCAACAG TTGTTCTCAT GTTTCTACAA GCGGACCGCA TTCCAGCTAT 3780 

TATTCAAAAT ATGCAGGAGA AAACCAGTGT TGGTGGTTAC AACCTTATCG TTTGTGCCAT 3840 

GGACACGGAC GATTATCCTT GCTCGGTTAA CTTCCCATTC ACCTTTAAAG AAGGAGAACT 3900 

G GC AG ACT AT TACAAGGATT GGGAATTGGT TAAGTACAAT CAAAATCCAG GCCATTTGCA 3960 

CCGTCCCGAT GAGAATGGCA ATCGTATTCA ACTACGCTTT GCGACCTTAC TAGCTAAGAA 4020 

AATCAACTAA ACACACATGA AGATTAGGAA TTTTCCTCAT CTTTTTTCTT TTTTACGAAT 4080 

GATATAGAAA AGGAGGGAAT TCATGTTTGT TCCGAGAGAT GCTAGGGGAG AATTGGTAAA 4140 

TG TGTT AG AG GATAAACTTG AGAAGCAAGC ATACACCTGC CCAGCTTGTG GAGGCCAGCT 4200 

CCATTTCCGT CAAGGACCAA GTGTACGGAC CCATTTTCCC CATAAATCCT TAAAAGACTG 4260 

TGATTTTTTC TTTGAAAATG AAAGTCCAGA ACACCTGGCC AATAAGGAAT CCCTCTATCA 4320 

CTGCTTGAAA AAAGAGACAA ACGTTCAATT AGAGTACCCG CTTTCAGAAC TTAAACAGAT 4380 

TGCGGATGTA TTTGTAAATG GCAATCTAGC TCTAGAACTT CAGTGTAGTC CCTTGCCTCA 4440 

GAAAGTCCTT AAAGAGCCAA GTGAGGGCTA TCGTAGTCAG GGTTACCAAG TACTGTCGTT 4500 

GCTGGGTCAA AAACTCTGGC TCAAGGAGCG TTTGACTCGT CTACAGCAAG GTTTTCTTTA 4560 

TTTCAGTCAA AACATGGGCT TTTATGTTTG CGAATTAGAC AAGGAAAAAC AAGTTTTAAG 4 620 

ACTCAAATAC CTGATTTACC ACGATCTCCG CGGTAAACTC CATTATCAAA TCAAGGAATT 4 680 

TTCCTATGGT CAAGCTAGTT TATTCGAAAT ATTGCGTCTT CCCTATAAGA GACAAAAAAT 4740 

ATCTCATTTT ACAGTTTCTG AGGACAAGGA CATCTGTCGC TATATCCCGC AACAACTTTA 4800 

TTATCAAAAT CTCTTTTGGA TGAAAGAACA AGCAGAAGCC TATCAAAAGG GAGAAAATAT 4860 

CCTGACTTAT GGACTGAAAG AATGGTATCC ACAAATTCGA CCAATAGTGC GCAAATTTTT 4920 

CCAGATTGAA CAAGACTTGA CTAGCTATTA TCAGCACTTT TATACCTATT ACCAAAAAAA 4980 

TCCTCAAAAT GATTGGCAAA AGCTTTATCC ACCAGCCTTT TATCAGCAAT ATTTCTTGAA 5040 
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AAATATGGTA 


GAATAGAAAG 


GATGGAGGAA 


TCTAATGGTA 


TTACAAAGAA 


ATGAAATAAA 


S100 


TGAAAAAGAT 


ACATGGGATC 


TATCAACGAT 


CTACXCAACT 


CACCAGGCTT 


GGGAAGAAGC 


5160 


CTTAAAAGAT 


TTAACAGAAC 


AATTGGAGAC 


AGTAGCCCAG 


TATGAAGGCC 


ATCTCTTGGA 


5220 


TAGTGCGGAT 


AACCTACTAG 


AAATCACTGA 


ATTTTCTCTT 


GAAATGCAAC 


GCCAGATAGA 


5280 


GAAGCTTTAC 


GCTTATGCTC 


ATATGAAGAA 


TGACCAGGAT 


ACACGTGAAG 


CTAAGTATCA 


5340 


AGAGTACTAT 


GCCAAGGCCA 


TGACACTCTA 


CAGCCAGTTA 


GACCAAGCCT 


TTTCATTCTA 


5400 


TGAGCCTGAA 


TTTATGGAGA 


TTAGCGAAAA 


GCAGTATGCT 


GACTTTTTAG 


AAGCTCAACC 


5460 


AAACCTGCAG 


GTTTATCAAC 


ACTATTTTGA 


CAAGCTTTTG 


CAAGGCAAGG 


ATCACGTTCT 


5520 


TTCACAACGT 


GAAGAAGAAT 


TATTGGCTGG 


AGCTGGAGAA 


ATCTTTGGTT 


CAGCAAGTGA 


5580 


AACCTTCGCT 


ATCTTGGACA 


ATGCGGATAT 


TGTGTTCCCT 


TATGTCCTAC 


ACGATGATGG 


5640 


TAAAGAAGTT 


CAGCTATCTC 


ATGCGACTTA 


CACACGTTTG 


ATGGAGTCTA 


AAAAACGTGA 


5700 


GGTTCGCCGT 


GCTGCCTATC 


AAGCTCTTTA 


TGCGACTTAC 


GAACAATTCC 


AACACACCTA 


5760 


TGCCAAAACC 


TTGCAAACCA 


ATGTTAAGGT 


GCAAAATTAC 


CGTGCTAAAG 


TTCGTAACTA 


5820 


CAAGAGTGCT 


CGTCATGCAG 


CCCTCGCAGC 


GAATTTTGTT 


CCACAAAGTG 


TTTATGACAA 


58B0 


TTTGGTAGCA 


GCAGTTCGCA 


AGCATTTGCC 


ACTCTTACAT 


CGCTATCTTG 


AGCTTCGTTC 


5940 


AAAAATCTTG 


GGGATTTCAG 


ATCTCAAGAT 


GTACGATGTC 


TACACACCGC 


TTTCATCTGT 


6000 


TGAATACAGT 


TTTACCTACC 


AAGAAGCCTT 


GAAAAAAGCA 


GAAGATGCTT 


TGGCACTCTT 


6060 


GGGTGAGGAT 


TACTTGAGCC 


CTGTTAAACG 


TCCCTTCAGC 


GAGCGTTGGA 


TTGATGTTTA 


6120 


CCAAAATCAA 


GGCAAGCGTT 


CAGGTGCCTA 


CTCTGGTCGT 


TCTTATGATA 


CCAATGCCTT 


6180 


TATGCTTCTC 


AACTGGCAAG 


ACAATCTGGA 


CAATCTCTTT 


ACTCTTCTTC 


ATGAAACAGG 


6240 


TCACAGTATG 


CATTCAAGCT 


ATACTCGTGA 


AACTCAGCCT 


TATGTTTACG 


GGGATTACTC 


6300 


TATCTTTTTG 


GCTGAGATTG 


CCTCAACTAC 


CAATGAAAAT 


ATCTTGACGG 


AGAAATTATT 


6360 


GGAAGAAGTG 


CAACACGACC 


CAACACGCTT TGCTATTCTC AATAACTTCC 


TAGATGGTTT 


6420 


CCGTGCAACA GTTTTCCGCC 


AAACTCAATT 


TGCTGAGTTT 


GAACACGCCA 


TTCACCAAGC 


6480 


AGATCAAAAT 


GGGGAGGTCT 


TGACAAGCGA 


TTTCCTAAAT 


AAACTCTACG 


CAGACTTGAA 


6540 


CCAAGAGTAT 


TATGGTTTGA 


GTAAGGAAGA 


CAATCCTGAA 


ATCCAATACG 


AGTGGGCTCG 


6600 


CATTCCACAC 


TTCTACTATA 


ACTACTATGT 


ATATCAATAT 


TCAACTCGCT 


TTGCGGCCGC 


6660 


CTCAGCCTTG 


GCTGAAAAAA 


TTGTCCATGG 


TAGTCAAGAA 


GACCGTGACC 


GCTATATCGA 


6720 


CTACCTCAAG 


GCAGGTAAGT 


CGGACTATCC 


ACTTAATCTC 


ATGACAAAAC 


CTGGTGTTGA 


6780 
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TATGGAGAAG GAAGACTACC TCAACGATGC CTTTGCAGTC TTTGAACGCC CTTTAAATGA 6840 

GTTTGAAGCC CTTGTTGAAA AATTAGGATT GGCATAAAAT GGTTGAATCG TATAGTAAGA 6900 

ATGCTAACCA TAACATGCGT CGTCCTGTCG TCAAAGAAGA AATTGTAGAC TTGATGCGTC 6960 

AGCGTCAAAA GCAGGTCACA GGTTTCTTGA AAGAATTGGA AGACTTTGCC CCCAAGGAAA 7020 

ATATTCCTAT TATTCCCCAT GAAACGGTTC CTTATTTC CG ' I T ITC T T ATG GAAACCATGC 7080 

AGCCTAAAAA TATTCTGGAA ATTGGGACGG CTATCGGTTT TTCAGCTCTC TTGATGGCTG 7140 

AACATGCGCC AAATGCTAAG ATTACAACTA TTGATCGTAA TCCAGAAATG ATTGGTTTTG 7200 

CCAAGGAAAA TTTTGCCCAG TTTGACAGTC CCAAGCAAAT CACTCTCCTA GAGCGAGATG 7260 

CGGTGGATGT CTTATCTACA CTGACAGAGT CTTATGATTT CGTCTTTATG GATTCTGCCA 7320 

AGTCTAAATA CATCGTCTTT CTGCCAGAAA TCCTCAAACA TTTGGAAGTT GGTGGTGTGG 7380 

TTGTCTTGGA TGATATTTTT CAAGGTGGTG ATGTTGCCAA GCATATTATG GAAGTCCGTC 7440 

GTGGTCAGCG AACCATTTAT CGAGGCCTTC AAAAATTATT TGATGCAACC TTAGACAATC 7500 

CAGAACTCAC CGCAACATTA CTGCCTTTAG GAGATGGTAT TCTCATGCTT CGTAAAAATG 7560 

TAGCAGATGT TCAACTGTCT GAAAGCGAAT GATTTTCAGA AAAATTTAAC AAAAAATAGT 7620 

AAAATAGATA GAGTAACACT TATCTCAAAG GAGTAGACAT GAAGAAAAAA TTATTGGCAG 7680 

GTGCCATCAC ACTATTATCA GTAGCAACTT TAGCAGCTTG TTCGAAAGGG TCAGAAGGTG 7740 

CAGACCTTAT CAGCATGAAA GGGGATGTCA TTACAGAACA TCAATTTTAT GAGCAAGTGA 7800 

AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC CATCCAAAAA GTTTTTGAAA 7860 

AACAATATGG CTCAGAGCTT GATGATAAAG AGGTTGATGA TACTATTGCC GAAGAAAAAA 7920 

AACAATATGG CGAAAACTAC CAACGTGTCT TCTCACAAGC AGCTATGACT CTTGAAACAC 7980 

GTAAAGCTCA AATTCCTACA AGTAAATTAG TTGAGTTGGC AGTTAAGAAG GTAGCAGAAG 8040 

CTGAATTGAC AGATGAAGCC TATAAGAAAG CCTTTGATGA GTACACTCCA GATGTAACGG 8100 

CTCAAATCAT CCGTCTTAAT AATGAAGATA AGGCCAAAGA AGTTCTCGAA AAAGCCAAGG 8160 

CACAAGGTGC TGATTTTGCT CAATTAGCCA AAGATAATTC AACTGATGAA AAAACAAAAG 8220 

AAAATGGTGG AGAAATTACC TTTGATTCTG CTTCAACAGA AGTACCTGAC CAAGTCAAAA 8280 

AAGCCGCTIT CGCTTTAGAT GTGGATGGTG TTTCTGATGT GATTACAGCA ACTGGCACAC 8340 

AAGCCTACAG TAGCCAATAT TACATTGTAA AACTCACTAA GAAAACAGAA AAATCATCTA 8400 

ATATTGATGA CTACAAAGAA AAATTAAAAA CTGTTATCTT GACTCAAAAA CAAAATGATT 8460 

CAACATTTGT TCAAAGCATT ATCGGAAAAG AATTGCAAGC AGCCAATATC AAGCTTAAGG 8520 

ACCAAGCCTT CCAAAATATC TTTACCCAAT ATATCGGTGG TGGAGATTCA AGCTCAAGCA 8580 
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GTAGTACATC 


AAACGAATAG 


TCCAAATCAA 


TGAGTCAGGG 


k k ft • ft ft 

AAAAAACTCG 


ACTTCAWjAA 


8640 


AAAATGAAGC 


AAACATTCCC 


ACAATAAAAC 


GCATAGTACA 


AGGTTTGTAC 


TGCCCCCCAA 


o »vw 


AAAGTTAGAC 


AATTAATTTA 


TCCGAAGGAT 


TTAGITCTGT 


A7TGCACAGA 


GCTAAGTCCT 


am ft 

O I OKI 


TTTAGTTTTA 


TCTTAATTCT 


CTTATTGTTG 


TAATAATCAA 


TATACTCTAT 


AATGGCTCGT 


Ob«V 


TCCAATTGAT 


TAAGTGATTT 


AAATGTTTTC 


TCATAGCCAT 


AAAACATTTC 


m IMMMI ft ft ft 

GGATTTTAAA 


O BOA 


ATGCCAAAGA 


AAGATTCCAT 


CCTACCGTTG 


TCTTGGCTGT 


TGCCCTTACG 


him « *m m. ft mm 

TGACATGGAT 


8940 


GCTTGAATTC 


CCTTACTCTC 


TAGGAAGCGA 


TGATAAGAAT 


CGTGTTGATA 


TTGCCAGCCT 


9000 


TCGTCACTAT 


GGACAATCGT 


ATTCTCGTAG 


TGCTTCTCTT 


TGAATGCCTG 


TTCCAACATT 


9060 


AACGATCAAT 


CAATTTAATC 


ATGTACCTAA 


GATTAGAATT 


GTTTATCCCA 


AATTTATTTG 


9120 


AAAGCTTCTC 


TAAGCTATAT 


CCTTGTTTTC 


TAAGTTCATA 


GATCTGAACT 


TTATCATCAT 


9180 


AAGTTAATTT 


CATAATAAAA 


ACACCCCAAA 


ACTTAGATTT 


TTTCTCTCTA 


ACTTTTGGGG 


9240 


TGTACTTCAT 


GTACACCTGA 


TATGATGCCT 


TTTATAATTT 


TAAACACTTT 


TTGACCAGCC 


9300 


TCATTTTTTT 


AACTTCATAC 


TCAGTGAAAA 


GCAAAGATTA 


AACTAGGAAG 


CTAGCTGTAG 


9360 


GCTGCTCAAA 


GAACAGCTTT 


GAGGTTGTAG 


ATAAAACTTG 


TCAGGTCACC 


AACATATATA 


9420 


ATGTGAACCT 


GACGTGGTTT 


GAATAGATTT 


TAGAAGAGTA 


TGAGTCTGGA 


ACTTTTAATG 


9480 


GATAATGCAA 


GATTCCATAG 


AATGGGTAAG 


CTAGAGTTCT 


TATCTGAAGA 


GTTTGGGCAT 


9540 


AAACTTTTAC 


CTTTTCCTCC 


CTACTCATCT 


TAGTATAGAA 


AAGTGAATCT 


GAAATAGTAC 


9600 


ATAACTGCTT 


CTAAAACATT 


CTTATAAATT 


GATTTAAATT 


CTCAAATCAT 


ATTATTCAGT 


9660 


TCTTATTTCA 


TTTTGTTCTA 


CAATCCTGTT 


GAGAAGACAC 


GTGTTCATAT 


CAAAAAGGTA 


9720 


TTCGCAAGTT 


CCAATACCTT 


TTTACGAGGC 


TCTGTTCTCT 


TAVrrWO'lT 


TCAACTCACT 


9780 


ATATCTCCTA 


TCGTTCTAGT 


TCAGAAGGCT 


AGGCTATAAT 


TATGATTGAT 


m ft ^ m ft ft ft fVt^H 

AAGAAGTATC 


9840 


ATTCCAAGTA 


TTGGGAGTGA 


ATGTTTCAAA 


ATCATGGGTT 


TCTATAATGC 


TCAGGCTGGC 


9900 


ATTTGCTAGA 


CCGCCATCTT 


TACGAACAAG 


TGGTTCTTTA 


TACCCTAGGA 


GAGTACGAAG 


9960 


ACTGGCAGTA 


AGATTGGCGC 


CGTGTCCGAC 


AATTAGAATA 


CGTTCAGCTG 


GACTATCTTT 


10020 


TAATGATTTG 


ATAAATTGGA 


TGGTCCGTTG 


AGTTGTACTA 


TAGAGGGATT 


CGGCTCCGAA 


10080 


CATTCGAGTG 


TCAAATTGAG 


CAAGATTTGA 


ACGAAAAGCC 


TGGATTTGTT 


GCGGGTAAAT 


10140 


AGCTTCCAAC 


GTTGCAATTT 


TCAAACCTTC 


TAACTTCCCA 


AGTTGCCATT 


CACGGAGATT 


10200 


AGGAACGATT 


TCTAAAGAAC 


AGGGGGTATA 


GACTTGACTT 


TGGATAATCT 


CAGCAGATTT 


10260 


GACCGCTCGA 


GGTAAATCAC 


TTGAATAAAT 


CTGATCAAAA 


GCAATTTCCT 


TGAGATACTG 


10320 
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ACCAAGTCGT TTTAGGGTTT CAATGGATTC AGGAAGAAGA GGAGAATCAC CACTAGCACC 10380 

TTGAAAACGA CCTTCTTGGT TCCAGAGGGT ACGACCGTGG CGGACAAAGT AGAGTTTCAT 10440 

TACTTGATGT CCTCCAAAAT ATCTACAAAG TCTGCCTTTA CAAAGCTAGC CAAGTCTTGT 10500 

GGCCCGACGA TAATGCTGTG TCCGACTTCG CCTGCAGAGA CAATCATTTG ATCCAAATCT 10560 

AGAGCAATTT TATCGATAAA AATGGCATAA TTGTGTTTCT GACGAATTCC GACAGGATTA 10620 

TTGGCTCCAT GAATGTAACC A G TT G TTTTT TCTAAGTCCT TTTGTGGAAT CATGCTCACT 10680 

TTTTTATTGC CAGAAATTTT AGCTAGTTTC TTTTCAGACA ACTGCTGAGT GATAGGGACA 10740 

ATTCCGATAA TCGGTCCGGT CTTGTCTCCC AAAAGCGCCA AGGTTTTGAA AATCTGATCT 10800 

CGTTCATAAC CTTGAGGAAG CTCTCCTTCT AGGGCATTGA TTTGAATCCC CTGATGAGGG 10860 

ATAGCTGCTT TAGATAGGAT TTGTTCCACC AATGTTTTTT TGATTTTAAC TTTTTTTGCC 10920 

ATTATTTATA TTTATCCTCC AATTGACTCA TCCAAATACC AAGCCAGATT CCCAGCGCAA 10980 

AGAAGAAGGC GATGATGACA TAACCGACAA GTGAAAGTCC TGTGTATTGG ATACTTTCAG 11040 

CGTTTCCTGC ATTTGGAATT AACATCAAAA GGCTACTTGA TAGGACGATA CCGATGATGA 11100 

AATGATAGAC GAACTGTTTA CGCAGTTCTT CTAGTTCTCC GTCCGTCCAA GCGTAGGCCA 11160 

CTTCTTCTTT CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA 11220 

CATGACCTGC CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT 11260 

GGATATGGGC ACCGTCGGTA TCCGCATCG 11309 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 5548 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CCATAGTCTA ACAAGTCTTT GTAAAGGTTT ATCCCTGATT CATGTAAAGA TTGTCTAAAG 60 

AATCAAAAAA AGCCACTTTT GAAAAATGCC TGCTCCTAAA AATAGCTTTA AAAATTATTA 120 

GTCCTGTGCG AAAGATTCGT TACCAACAAA AATCCTGAAG CAACTGCC7C TGCCAAGCTG 180 

ACTCGTCACC GTGACTTGGC CACCTAATAA TTGACTGAGT TCTTTGACAA TGGCAAGGCC 240 

AAGACCAGTG CCACCAGTTT GTCTGCTTCG ACCTTTATTA ACTCGGTAAA AACGTTCAAA 300 

AATACGATCC TGCTCTAATT GACTAATACC AATCCCTGTA TCTGATACAG AAATCTTAAT 360 

GCCTTCGTTC ACCTTTTGGG TCTTGACCTC AATTTTTCCC CCTTGTTCAG TGTAACGGAT 420 
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GGCATTGGAT 

4 4 wV»rt 4> 


AAAAGATTGA 


CTAAGATTTG 


GCAAACTAAT 


TGACTATCTG 


ATACGAGGGT 


480 


CACATCATCT 

WAwA A ^» A IVl 


GGCACCTCCA 

\J%VW^#*W^W> A \#W-ft 


CCTTTAGCTG 


TAAATCCTTC 


TTCTTGACCT 


GAGGTTGCAA 


540 


UW 1 1 IWw I w 


AAA TCOTGT A 


CAAATTCTCC 


CAAAGAAAGG 


GTCGTCCATT 


GTATAGCCAT 


600 


1 1 v* 1 lunwkw 


TTAGATAAGG 


TAACAAGATG 


CTCAACAATA 


TGCTCAAGAC 


GCAAACTTTC 


660 




A*rn*rf*T*AGAA 


n\3 4 V»«^ 4 W V» 4 4 


GAGCGCTTCT 


TCTTCAGCTG 


ACATCCCCTT 


720 


nnlliul I AwA 


UwAAAdA-WW A 


TAATCGAAGT 


AACTGGTGTC 


CTCAATTCAT 


GGGAGGCATT 


780 




Ow 1 AAA 1 i In 




AGTTCTAATC 

rWJ 4 4 W 4 rU% 4 Vi 


GTTGTTAAAT 

W 4 4 VP 4 4 4 *■ 4 


CATATAGCAA 


840 






#\ A 4VWU1\AA* 


GCTAAAAACC 


GGAACTGCTG 


TCACTTCTAA 


900 


» it m^* it ft ^*f*r* ft 

AATwAAGTCA 


CCCTCATuAA 


AwwwAw> A 1 Aw 


1 1L1 IU A 1 A A 


VICPTTi • J I" J I U P 

nAW\» A A A A A 


TTTCATCAAA 


960 


i^liluA 


At. 1 AAA 11 ww 


f* AAT ATCC AT 
uAA 1 A A w wA A 


www i i a unuw 


TCATCAACTG 

A W T\ A W rtrt » V» 


XACTTATCTC 


1020 


GCCL»l^»wAwA 


T. LwwAAAn 1 


An iVlAUOLAw 


A G AGOG ACTC 


CATAATAACA 
wn A a nr* w^v 


TCTGACCTTG 

A W A \##%>>W A A W 


1080 


AGCGCAAACT 


It A It It It ^f+^fr^^ 

AAAAACViTLC 


ft. f*T*P & 




AVl/^UV. 1 wwr\ 


A A V» AAA V#*JV*V» 


1140 


TACATCCTTG 


TATT GCTGAT 


f*f"**TV' & 

(.LlVl A 1 UUUn 


VinL 1 1 luv« 1 1 


TTTACGCCAC 

AAA /\WwWnw 


ACACATACTG 


1200 


AGCCAAAGAC 


TTTAAGTCTT 


CTTGCCCTTT 


TTCTAAAAAG 

A 1 W 1 fWV>^\9 


TATTCACTAC 


TGGTCAAGAG 


1260 


AGCTTGCTGC 


AAGGTCTCAA 


AAuwAAw a 1 v_ 


wwAA 1 AWW/\#\ 


AGGCAAAAGA 


GCCAGTAGCC 


1320 


ACCTACTCCC 


AAAGAAAGGG 


w i AUnAuAAn 


unuAL^UA l w 


w w A A A AW X W* 


TCCAAGTTAA 


1380 


TGCCATCCCT 


GCAATCAGAA 


TGAGf5C?TAAC 


& ACATTG 


ACTAGCCAAA 


ATTGAAGGTA 


1440 


GCGTTTCATC 


TATAACTCCT 


TG A. ft.f*T*P A*T A 

1 WAAw X 1 A 1 A 


APCATAACCC 

Awwr% 1 r%nww W 


CGAATCGTTC 


GAATaAATTG 


1S00 


AGGGGCTTTA 


GGATTGTCTT 


wAAl AS A A A 1 w. 


rCTCAACTTA 

ww A WnoW A A r% 


CCAATATGAA 


CCTCCACCAA 


1560 


ACGTGTTTCC 


TGCCCAAAGT 


CATACCCCCA 


GATACGTTCC 


AAAAGACGCT 


CTCTAGTCAG 


1620 


TGTCATGTTG 


GGATGTTTCA 


TAACATACAG 


CAAGAGTTCA 


AATTCTTTTG 


GGGTCAAACT 


1680 


CACTAACTTA TTCGCCTTGT 


AGACTTCATG 


ACGCTCAGGG 


TATACTTTCA 


AGGTCCCAAA 


1740 


TAGCCAAGAA 


TCCTCAGCGA 


TATTATCTGA 


ATCATCTCCT 


TCTTCTTCTC 


CTTTAGTTCG 


1800 


CCTGAGCACA 


CCCTTCACAC 


GCGCCAGCAA 


TTCTCTAGGG 


CTAAAAGGCT 


TGGTCAGGTA 


1860 


GTCATCAGCC 


CCTAATTCCA 


AGGCCAAAAC 


CTTATCAAAT 


TCATCACTTT 


TCGCAGAAAC 


1920 


CATCATAATT 


GGAGTTTTGA 


CGCCTTTGGC 


TCTCAGCCGC 


TTACAAACTT 


CCATGCCATC 


1980 


TAATTGTGGT 


AACATGATAT 


CAAGCAAGAT 


AAAATCAAAG 


GCTTCTGTTT 


CTCCCAAAGC 


2040 


TAAGGCCTTC 


CGTCCATTTG 


TCACCAATTG 


AGTAGAAAAG 


CCTTCCTTAC 


TTAAATGGTA 


2100 


GTCAAGCAAT 


TTCAGAATGT 


GTTCTTCATC 


ATCCACTAAT 


AAGACTTGTT 


TTGTCATCTA 


2160 
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TTATCTCCTA 


TTGGTAACAT 


TATAACACAA 


. TTATCAGAAA 


TCCTAACATT 


GCTAAATCAG 


2220 


ATTAAATTTG 


CCTATCAAGA 


. CTAGTATCTG 


GTCAAACCCT 


CAATCATCTC 


CTTGTGCTCT 


2280 


GGATAGGTCG 


CCAGTAGATC 


TACCCTTTCA 


AATAATTCAA 


AATCCTCAAA 


TTCAAAACCA 


2340 


GGAGCAACAA 


GACAAGAAAC 


CAGAGCATCA 


TCCTTATCAA 


CTGTTGATCC 


CCAAATAGTG 


2400 


CCCTTAGGAA 


CACAGTAGTG 


AAGTTGTTGC 


CCTTTGGATA 


TCTCCAGGCC 


TAAAGTGACT 


2460 


GCTTCGTAGT 


GACCATCTGC 


TGTAATCATG 


TGAACAGTAA 


GTGGGGATCC 


TGCATGAAAA 


2S20 


TACCAGATTT 


CATCTGCTGT 


CAATCGGTGA 


AAATGTGAAG 


GATTCGTTTC 


TTCTAATAAG 


2SB0 


AAATAAATAC 


TGCTATAAAG 


CGCCCTTCCC 


TTACCAGCAA 


GGTTTATAGT 


GTCTGAAGCT 


2640 


TTTTTTGTTT 


GTCTAAAATA 


GCCACCTTCA 


ATATGGGCAG 


CTAACTCTAG 


AG TTCTT ATC 


2700 


AAGTCTTCTT 


TATCCGTCGG 


AGCCAATCGG 


TTGAAGTAAC 


TCTTGTTCAA 


AGTGGTTTTA 


2760 


CGATTTCAAG 


AACTCCTCTC 


AGTTCTGAGG 


ACACGCTAAT 


GATTGATGCG 


ACGGAAGTAC 


2820 


AAATCAATCC 


CCCTAAAAAA 


AGAATTAGCG 


AATGATTCTG 


GTAAAAAAAA 


TCCCACCCTA 


2880 


TGAAGGCTCA 


AGCGATTGTC 


ACAAGTCAAG 


GGAGAATTCT 


TTCTTTGGAT 


ATCGCTGTCA 


2940 


ACTATTGTCA 


TGATATCAAG 


TTGTTCAAAA 


TGAGTCGCAG 


AAATATCGGA 


CAAGCTGGTA 


3000 


AAATCTTGGC 


TGACACTGGT 


TATCAAGCGC 


TCATGAACAT 


ATATCCTCAA 


GCACAAACTC 


3060 


CACGTAAATC 


CAGCAAACTC 


AAGCCACTAA 


CAGTTCAAGA 


TAAAGCCTAT 


AACCATCCGC 


3120 


TATCCAAGGA 


GAGAAGCAAG 


GTTGAGAACA 


TCTTTGCCAA 


AGTAAAAACC 


TTTAAAATGA 


3180 


TTTCAACAAC 


CTATCGAAAT 


CATCGTAAAC 


ACTTCGGATT 


ACGAATGAAT 


TTGATTGCTG 


3240 


GCATTATCAA 


TCATGAACTA 


GGATTCTAGT 


TTTGCAGGAA 


GTCTATTATT 


TGGTTAGGTG 


3300 


AATTAGTGAA 


GCCTTTAGGC 


AAGTGTCTCT 


GCTTACGACG 


TCATCGACTC 


TAAATCGATT 


3360 


ATATTTAGGG 


GTCATGACTA 


GTGAAGCAGT 


TAGCTAGTTC 


GCATATAAGC 


GGCTAGCGTC 


3420 


TAACAATTAG 


GAACTTTAGT 


TCCAATAACT 


TTAAGATTAC 


CACCTTTTAG 


GACATAAATC 


3480 


GATCATATTT 


ATGTCCTAAA 


ACTAGTGAAG 


CGCCTAGCCA 


AAGTCCGAAT 


AGGATTTiJGC 


3540 


GTTAGTTACT 


TAGATTGCTT 


TGCAATCAAG 


TAACTTTGGC 


GATTTACATC 


TTCTCTGGCG 


3600 


CTTCTACTCC 


AAGCAAGCGA 


AGGGCTTCTT 


TGAGAACGAC 


TGCGCTTGCG 


TAGCTGAGGG 


3660 


CTAGACGGCT 


GTCGCGTTCT 


GGGCTTTCAT 


CCAAOATACC 


TGTATGTGCA 


TAGTATTTGT 


3720 


TAAAGGATTG 


AGCCAGGCTA 


ATTGCAAATT 


TAGCAATGAT 


AGAAGGTTCA 


AAGTTATCTG 


3780 


CCGCACGCTT 


GATAATACGT 


GGGAAGTCTT 


GAATGAGTTT 


AATGATTTCC 


CAGCTTTCAG 


3840 


TATCATTCAA 


GCTATAGTTG 


CCAGCTGTTT 


CTGGTTTGAA 


ATCGGCTTTG 


CCTAAGATAG 


3900 


ATTGGATACG 


AGCGTAGGCA 


TATTGAACGT 


AAGGTCCAGT 


TTCACCCTCG 


AAGGATACCA 


3960 
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TAGCCTCTAG 


GTCGAAGTCG 


TATCCATTTG 


Wl ft f+^t V**T* 

TACCOTCGCT 


TTTGAGGTCA 


TAGAATTTAA 


4020 


TGGCTCCAAT 


CCCAACAGCA 


TGTGCTACTT 


{XiTCTTT GIT 


TTCTAGTTCA 


GGATTTTTAG 


4080 


CCTCGATTTG 


GACCTTGGCA 


CGGCTAACAG 


CCTCTGWAAC 


AGTAGGCTCT 


AGCAAGATGA 


J 1 Aft 


CATTCCCTTT 


ACGAGTACAG 


Av*"*~A n.1~i~w,l. 


CTTCTTTTGT 


AAOCAAACCA 


AAAGGAACCT 


4200 


GAGTAATGTC 


GTCACTCCAG 


TCGTAGCCCA 


TCTCTTGCAA 


GACAGCTTTG 


AGCTGTTTAA 




AGTGGGCAGA 


TTGTTCTTGA 


CCAACGACAT 


ft ft m% ft ffufWfi 

AGATAGATTT 


AGCAAATTCC TATTCGTTTT 




TACGGTAGAG 


GGCTGCAGCC 


AAGTCACGTG 


TGATATAGAG 


ACTTGCACCA 


^CACACTTCP 

ft ^mr%KJr\^m ft ft ^ ft 


4 JO U 


TGATGAGGGC 


TGGATGTTCA 


ATTCCATATT 


TCTCAAGATT 


**^ft^* W *W * A 


CCACCTTCTG 


4440 


AT TCAAGAAG 


TAGTCCTTTT 


TCAGAAAGAA 


TGTCTACAAC 


ft wwrt ft W W/* A V. 


ft ft n a ft ftw ft 


4500 


AGAAGGCTTC 


TCCGTTATAG 


CTGTCAAATT 


CAACCTTCAA 


WP & 
X 1 -i ft xj ft 


AGGgCCTTAA 


4560 


ATTCCACTAA 


ACTTTCATCG 


CGGAACCATT 


GCCAAAGAGC 




ft ft V* ft VWl ft 


4620 


TTTCAAGTTT 


ACGGAACCAT 


TCGCGCGCTT 


CTTCATCCAA 




ft ft ft ft ft ft 


4680 


CAGCGTTGAT 


GCGGACATAG 


AGTTTAAGGA 


GTTCATCCAT 


TGCATCAGCT 


TTTACACCTT 

ft ft ft T%\mr%\*^m ft ft 


4740 


CTTCGTCGCC 


CCATTTTTTG 


TAGGCAACAA 


TCAACATCCC 


AAATT GTTT A 


CCCCAGTCTC 


4 800 


CCAAATGGTT 


GACCTTGACC 


GTTTGATAAC 


CGATTTTTTG 


CAAAATATGT 


GACAAGCTAT 


4860 


CTCCGATAAC 


AGTTGAACGC 


AGGTGGCCAA 


TAGAAAATGG 


TTTAGCGATA 


T TCGG ACT AG 


4920 


ACATGTCGAT 


AACAACATTT 


ivttvittac 


CAATATTTTG 


GTCAGCATAC 


TGTTCTTTTT 


4980 


CAGTGGTAAC 


AGCTTGCAAT 


ACTTGAGCAG 


AAATGGCAGA 


TTTATCAAGG 


AAAAAGTTAA 


5040 


CGTAAGGTCC 


TGTTGCGACA 


ACTTTTTCAA 


AGGCTTGGCT 


GTTCATTTTT 


TCAGCCACTT 


3 LOU 


CAGCCGCAAT 


CATTTGTGGT 


GCTTTACGTT 


CGACTTTTGC 


AAGAGAAAAA 


GCAGGGAAAG 




CAATGTCTCC 


CATTTCTGAG 


TTTTTAGGGG 


TTTCCAGTAA 


CTTTAAAATA 


GCCTCTTGGT 


5220 


CCAGGCTATC 


AATGATGCTA 


GATAATTCGC 


TAGCAATCAA 


TTCTTTTGTA 


TTCATTAAGA 


5280 


GCTCCTTTTT 


GGACTTTTCT 


ACTATTTTAT 


CACAATTTTA 


AAGAAAGAAG 


AAAAAATTTT 


5340 


TGAAATCTCC 


TGTTTTTTTG 


GTATAATATG 


GTTATAAATA 


TAGTTATAAA 


TATGCACGCA 


5400 


AGAGGATTTT 


ATGAGAAAAA 


GACATCGTCA 


TCAGTTAATA 


AAAAAAATGA 


TTACTGAGGA 


5460 


CAAATTAACT 


ACACAAAAAG 


AAATTCAAGA 


TCGGTTGGAG 


GCGCACAATG 


TTTGTGTGAC 


5520 


GCAGACAACC 


TTGTCTCGTG 


ATTTGCGG 








5548 


(2) INFORMATION FOR SEQ ID NO: 110: 









<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3132 base pairs 
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(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEO ID NO: 


110: 






TACCCCGTAG 


TCTTAGCAGA 


CACATCTAGC 


TCTGAAGATG 


CTTTAAACAT 


CTCTGATAAA 


60 


GAAAAAGTAG 


CAGAAAATAA 


AGAGAAACAT 


GAAAATATCC 


ATAGTGCTAT 


GGAAACTTCA 


120 


CAGGATTTTA 


AAGAGAAGAA 


AACAGCAGTC 


ATTAAGGAAA 


AAGAAGTTGT 


TAGTAAAAAT 


180 


CCTGTGATAG 


ACAATAACAC 


TAGCAATGAA 


GAAGCAAAAA 


TCAAAGAAGA 


AAATTCCAAT 


240 


AAATCCCAAG 


GAGATTATAC 


GGACTCATTT 


GTGAATAAAA 


ACACAGAAAA TCCCAAAAAA 


300 


GAAGATAAAG 


TTGTCTATAT 


TGCTGAATTT 


AAAGATAAAG 


AATCTGGAGA 


AAAACCAATC 


360 


AAGGAACTAT 


CCAGTCTTAA 


GAATACAAAA 


GTTTTATATA 


CTTATGATAG 


AATTTTTAAC 


420 


GGTAGTGCCA 


TAGAAACAAC 


TCCAGATAAC 


TTGGACAAAA 


TTAAACAAAT 


AGAAGGTATT 


480 


TCATCCGTTG 


AAAGGGCACA 


AAAAGTCCAA 


CCCATGATGA 


ATCATGCCAG 


AAAGGAAATT 


540 


GGACTTGAGG 


AAGCTATTGA 


TTACCTAAAC 


TCTATCAATG 


CTCCGTTTGG 


GAAAAATTTT 


600 


GATGGTAGAG 


GTATGGTCAT 


TTCAAATATC 


GATACTGGAA 


CAGATTATAG 


ACATAAGGCT 


660 


ATCAGAATCC 


ATGATGATGC 


CAAAGCCTCA 


ATGAGATTTA 


AAAAAGAAGA 


CTTAAAAGGC 


720 


ACTGATAAAA 


ATTATTGGTT 


GAGTGATAAA 


ATCCCTCATG 


CGTTCAATTA 


TTATAATGGT 


780 


GGCAAAATCA 


CTGTAGAAAA 


ATATGATGAT 


GGAAGGGATT 


ATTTTGACCC 


ACATGGGATG 


840 


CATATTGCAG 


GGATTCTTGC 


TGGAAATGAT 


ACTGAACAAG 


ACATCAAAAA 


CTTTAACGGC 


900 


ATAGATGGAA 


TTGCACCTAA 


TGCACAAATT 


TTCTCTTACA 


AAATGTATTC 


TGACGCAGGA 


960 


TCTGGCTTTG 


CGGGTGATGA 


AACAATGTTT 


CATGCTATTG 


AAGATTCTAT 


CAAACACAAC 


1020 


GTTGATGTTG 


TTTCGGTATC 


ATCTGGTTTT 


ACAGGAACAG 


GTCTTGTAGG 


TGAGAAATAT 


1080 


TGGCAAGCTA 


TTCGGGCATT 


AAGAAAAGCA 


CGCATTCCAA 


TGGTTGTCCC 


TACGGGTAAC 


1140 


TATGCGACTT 


CTGCTTCAAG 


TTCTTCATGG 


GATTTACTAG 


CAAATAATCA 


TCTGAAAATG 


1200 


ACCCACACTG 


GAAATGTAAC 


ACGAACTGCA 


GCACATGAAG 


ATGCGATAGC 


GGTCGCTTCT 


1260 


GCTAAAAATC 


AAACAGTTGA 


GTTTGATAAA 


GTTAACATAG 


CTGGAGAAAG 


TTTTAAATAC 


1320 


AGAAATATAG 


GGGCCTTTTT 


CGATAAGAGT 


AAAATCACAA 


CAAATGAAGA 


TGGAACAAAA 


1380 


GCTCCTAGTA 


AATTAAAATT 


TGTATATATA 


GGCAAGGGGC 


AAGACCAAGA 


TTTGATAGGT 


1440 


TTGGATCTTA 


GGGGCAAAAT 


TGCAGTAATG 


GATAGAATTT 


ATACAAAGGA 


TTTAAAAAAT 


1500 


GCTTTTAAAA 


AAGCTATGGA 


TAAGGGTGCA 


CGCGCCATTA 


TGGTTGTAAA 


TACTGTAAAT 


1560 



V 
I 
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1 Aw 1 AwAA X A 


uAuninnl l\j 




WwAVvw 4 A IlMI 


\xf\ & A IwWn, 




1 OA 


AW 1 AnAAu IV- 


nnb Iwi l l 1 V. 


nAi 1 IVnuun 


v» A 1 V* Al \iO l\J 


1 nnnu^. 1 Al\* 


uAALATwATT 


1 £QA 


& A fy*f**TV* ftfft 


aaaAAACTGA 


fcrTOA & & ft./ - ** 
AuTCAAAAbA 


ft ft*T»ft ftfft H JV 

AATAATAAAu 


AAGATTTTAA 


ft ft Ml ft VI ft HVA^ 

AGATAAATTG 


t T Jl ft 


GAGCAATACT 


ATCCAATTGA 


TATGGAAAGT 


TTTAATTCCA 


ft rf 1 * ft ft ft/**^»ft ft 

ACAAACCGAA 


TGTAGCTGAC 


1800 


CAAAAAGAGA 


TTGACTTTAA 


GTTTGCACCT 


GACACAGACA 


ft ft^ K ft j^gru'*#i» ft 

AAGAACTCTA 


W ft ft ft m m m fVI 

TAAAGAAGAT 


1860 


ATCATCCTTC 


CAGCAGGATC 


TACATCTTGG 


^ ******** m, Ik ft % 

GGGCCAAGAA 


TAGATTTACT 


JWtMV* ft ft\ ft ft 

TTTAAAACCC 


1920 


GATGTTTCAG 


CACCTGGTAA 


AAATATTAAA 


TCCACGCTTA 


ATGTTATTAA 


TGGCAAATCA 


1980 


ACTTATGGCT 


ATATGTCAGC 


AACTAGTATG 


GCGACTCCAA 


TCGTGGCAGC 


TTCTACTGTT 


2040 


TTGATTAGAC 


CGAAATTAAA 


GGAAATGCTT 


GAAAGACCTG 


TATTGAAAAA 


TCTTAAGGGA 


2100 


gatgacaaaa 


TAGATCTTAC 


AAGTCTTACA 


AAAATTGCCC 


TACAAAATAC 


TGCGCGACCT 


2160 


ATGATGGATG 


CAAC'ITLT'IG 


GAAAGAAAAA 


AGTCAATACT 


TTGCATCACC 


TAGACAACAG 


2220 


GGACCAGGCC 


TAATTAATGT 


GGCCAATGCT 


TTGAGAAATG 


AAGTTGTAGC 


AACTTTCAAA 


2280 


AACACTGATT 


CTAAAGGTTT 


GGTAAACTCA 


TATGGTTCCA 


TTTCTCTTAA 


AGAAATAAAA 


2340 


GGTGATAAAA 


AATACTTTAC 


AATCAAGCTT 


CACAATACAT 


CAAACAGACC 


TTTGACTTTT 


2400 


AAAGTTTCAG 


CATCAGCGAT 


AACTACAGAT 


TCTCTAACTG 


ACAGATTAAA 


ACTTGATGAA 


2460 


ACATATAAAG 


ATGAAAAATC 


TCCAGATGGT 


AAGCAAATTG 


TTCCAGAAAT 


TCACCCAGAA 


2520 


AAAGTCAAAG 


GAGCAAATAT 


CACATTTGAG 


CATGATACTT 


TCACTATAGG 


CGCAAATTCT 


2580 


AGCTTTGATT 


TCAATGCGCT 


TATAAATGTT 


GGAGAGGCCA 


AAAACAAAAA 


TAAATTTGTA 


2640 


GAATCATTTA 


TTCATTTTGA 


GTCAGTGGAA 


GCGATGGAAG 


CTCTAAACTC 


CAGCGGGAAG 


2700 


ft ft ft ft m n. ft ft 

AAAATAAACT 


TCCAACCTTC 


TTTGTCGATG 


CCTCTAATGG 


GATTTGCTGG 


GAATTGGAAC 


2760 


CACCAACCAA 


TCCTTGATAA 


ATGGGCTTGG 


GAAGAAGGGT 


CAAGATCAAA 


AACACTGGGA 


2820 


GGTTATGATG 


ATGATGGTAA 


ACCGAAAATT 


CCAGGAACCT 


TAAATAAGGG 


AATTGGTGGA 


2880 


CAACATGGTA 


TAGATAAATT 


TAATCCACCA 


GGAGTTATAC 


AAAATAGAAA 


AGATAAAAAT 


2940 


ACAACATCCC 


TGGATCAAAA 


TCCAGAATTA 


TTTGCTTTCA 


ATAACGAAGG 


GATCAACGCT 


3000 


CCATCATCAA 


GTGGTTCTAA 


GATTGCTAAC 


ATTTATCCTT 


TAGATTCAAA 


TGGAAATCCT 


3060 


CAAGATCCTC 


AACTTGAAAG 


AGGATTAACA 


CCTTCTCCAC 


TTGTATTAAG 


AAGTGCAGAA 


3120 


GAAGGATTGA 


TT 










3132 


(2) INFORMATION FOR SEQ ID NO: 111: 









<i> SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 14672 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CGAGATTTCT TTAAATGAAC TACGTGAAAT CTACCCATCA TCCAGATCTC GATATTCTCT 60 

CCTATCTATA AGTAAACTTT TAGCAGATTT TAATATAAGT TCTCATGCTT TTAAAGCTTC 120 

GGTAAGAGAT TTAAAACCGC TCAGTTTCCC ACTCATTTGC TTCTGGGAGA CTTCTCATTT 180 

TATTATTCTT GAAAAAATTA GTAAAAACAA GTTTTATATT TTAGATCCTC CAAAAGGCAG 240 

GCAGAGAATG TCAATAAGTG AATTTGAAAG GCATTATTCA AATATCATTT TAACATTTAA 300 

AAAGTTAGAT AGCTTTATGT CTCCTAAAGA TAATAAGAAG TCGCCTGTTT TAAAGTATTT 360 

TTTTAAGTAT AGGAATAAGC TAGGGATTTT ATTTTTTGTA ACAGCATTAT TGTATGTAAT 420 

ACAATCATTA GTACCTATAG CTAATAGATA CATAATTGAC ACCAATTTCA AGGACGATTC 480 

GTATTCGTCT AGAATGTTAT TTACTATATT ATTTATATTT ACTGTTTCAT TCTCACTAAT 540 

GTATTTATTA AGACAGATAT ATGTTGCATC CTTAAAATAT ATAATGGATA AAGAGATTAG 600 

CTATGATTTT ATGAAACATT TGATATATTT ACCTTACAGT TTTTATGAAA AACGTACTTT 660 

AGGGGATATA CTTTTTAGAG CTAACTCTAT TGTTTATATA AGAGAAATAC TATCAAATAA 120 

TT TTATACCA GCTATACTTG ATTTGTTAAT GATTGTGGTT TATGCTGTGC TTTTATTTAG 780 

CTTTTCTAAG TACATGCTAA TCTTTTTAAT ATCACTAACT CTAGCTCTAT CTATTGTAAT 840 

GTATCCAATC ATAAAAATCT CAAAAAATTT AATTGATAAA AATATAAAAC AAAAGGTTAA 900 

TGTTCAAAAT ATTACTTCCG AAGTAATTTC TAAAAATACT GATATTAAGC TAACTGGAGA 960 

AGAGGAATTT TGGATTAACA AATGGGATAA TTTTAATACA AAACAGCTCA TCATAGGTCG 1020 

AAAACTTGAT ATACATTTAT CAATTGTTAG TAGTATAACG AATGTTTTAC AAATTATTCT 1080 

CCCTGTTTTG ACCCTTATTG TAGCTCTAAA TATAAAAACA TTCGAACAAT TGACGTTAvlG 1140 

ACAAATTGTA GCAATAAGTA CAGTCTCACC ATACTTTATT TCTCCTATAA TTTCTTTAAG 1200 

TGATAACTAT ATACAATTAA TGTTATTAAA GGGATATTTT TTAAGAATAG AGGATGTGTT 1260 

TAATACTAAA TCCGAATTAA TTCCAGAAAG AGTCAGTCAA GATATAAAAT TTGATAAAAA 1320 

AATAGAATTA AAAGATATTT GGTATAAATA TGGATTATTT CATGATTATG TTTTGAAAGG 1380 

AATAAATGTT ACTATTAAAA AAGGAGAAAC TCTTGCTATT GTTGGAGAAT CAGGTTCAGG 1440 

TAAGAGTACA TTAGCTAAAA TTTTATTAGG TTTATTAGAA CCTAATATTG GTTCAATAGA 1500 

AGTTGATGGA GTAGAAAAAG AAGAAATTGG TCAAACATTC TATAGAAAGA TTTTTGGAGC 1560 
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ACTGTTACAA 


AATTLAAwUv. 


I AAVj I Ini w« 


TACCTTAACX 


CtnAATTTGA 


CATTTGGACA 


1620 


CTTTGTTTCA 


GATGAAGAAT 


l i\f\ I UAWnnn 


tptajla-ttoa. 

1 Wl /W\ 1 1 w\ 


ni 1 VAJ 4 V. 4 * 


GCAATGTACT 


1680 


TAAATCTTTA 


CCTCTTGGAT 


TACAGAlAAT 




OA/\V»0 1 AA 1 A 


ACXTTTCTGG 


1740 


AGGGCAGCAG 


CAAATGATAC 


TTTTAGCTCG 




1 \_A>*AAAl.L i I 


CGGTAGTTGT 


1800 


TTTGGACGAA 


GCAACAAGTA 


Gi A lAGATAA 


TTTATOTCAA 


Uv\Al lAUtA 


CTTCTTPACTT 


I860 


AAGTGAAATC 


GGTACCACTA 


m a m wmm « ft m 

AGATTTTAAT 


TGCCCATCGA 


CTAGATACTA 


TCAACTCTGC 




AGATAAGATC 


TTAGTAATGC 


ATAATGGTGA 


AAi iVlAGAG 


ATTwCuACWW 


ATAuAV*AALT 




TCTTGAACTA 


GGAGGCATTT 


ATAAGCAATT 


ft * ft ft ffl 

GTAl ICAAAT 


AATTAGTTTT 


TGAl 1 AAAAu 


">/\A ft 


GCTAAATTTA 


TGAAGATTAT 


GAAAAAAAAA 


TATTGGACTT 


TAGCGATATT 


ATTCTTTTGT 


2100 


TTGTTCAATA 


ATTCTGTTAC 


TGCTCAAGAA 


ATACCTAAAA 


ATCTTGAPGG 


ft ftftftftMtft ft ^^0Vft 

CAATATAACT 


2160 


CACACTCAGA 


CTAGCGAAAG 


TTTTTCTGAA 


TCTGATGAAA 


AACAGGTTGA 


CTATTCTAAT 


2220 


AAAAATCAAG 


AAGAAGTAGA 


CCAAAATAAA 


TTTCGTATTC 


AAATCGATAA 


GACAGAATTA 


2280 


TTTGTAACAA 


CAGATAAACA 


TTTAGAAAAA 


AACTGTTGTA 


AATTGGAACT 


TGAACCACAA 


2340 


ATAAATAACG 


ATATTGTTAA 


CTCTGAAAGT 


AATAATTTAC 


TAGGCGAAGA 


TAATTTAGAT 


2400 


AATAAAATTA 


ACGAAAATGT 


TTCTCATCTA 


CATAATAGAG 


GAGGAAATAT 


AGACCATGAC 


2460 


AAACATAACT 


TAGAATCGTC 


GATTGTAAGA 


AAATATGAAT 


GGGATATAGA 


#v* ft ft ft ^mwm ■ mmh 

TAAAGTTACT 




GGTGGAGGCG 


AAAGTTATAA 


ATTATATTCT 


AAAAGTAATT 


CTAAAGTTTC 


K * MltMMB «L MUM 

AATTGCTATT 


2S80 


TTAGATTCAG 


GAGTCGATTT 


ACAAAATACT 


GGATTACTGA 


ft ft ft ft Mk/^PWMM 

AAAATCTTTC 


AAATCACTCA 




AAAAACTATG 


TCCCCAATAA 


ft m Mk mh ft 

AGGATATTTA 


ft ft ft ft ^* ft 

GGAAAAGAGG 


fk Ik J* Ik fk 

AGGGAGAGGA 


fc^*/* K X«T»ft k*f*ft 

ACuAATAATA 


A /UU 


TCAGATATTC 


AAGATAGATT 


AGGTCATGGT 


ACGGCTGTTG 


TAGCTCAAAT 


IVlAuWluAl 




GACAATATTA 


ft JJL-^" J~~ ft. ft ft ft 

ATGGAGTAAA 


TCCTCACGTT 


AAT ATTAACU 


1 LTA1 AO AA 4 


Alii 1 AAVj 




TCGTCAGCTA 


GTCCAGATTG 


GATTGTAAAA 


GCAATTTTTG 


ATGCTGTACA 


TGATGGCAAT 


2880 


GATATTATCA 


ATCTTAGTAC 


TGGACAATAT 


TTAATGATTG 


ATGGAGAATA 


TGAGGACGGA 


2940 


ACAAATGATT 


TTGAAACATT 


TTTGAAGTAT 


AAAAAGGCTA 


TTGATTACGC 


GAATCAAAAA 


3000 


GGACTAATTA 


TAGTAGCTGC 


ATTAGGGAAT 


GACTCCCTAA 


ATGTATCAAA 


TCAGTCAGAT 


3060 


TTATTGAAAC 


TTATTAGTTC 


ACGCAAAAAA 


GTAAGAAAAC 


CAGGATTAGT 


AGTTGATGTT 


3120 


CCAAGTTATT 


TCTCATCTAC 


AATTTCGGTC 


GGAGGCATAG 


ATCGCTTACC 


TAATTTATCA 


3180 


GATTTTAGCA 


ATAAAGGGGA 


TTCTGATGCA 


ATATATGCCC 


CTGCAGGCTC 


AACATTATCT 


3240 


CTTTCAGAAT 


TAGGACTTAA 


TAACTTTATT 


AATGCAGAAA 


AATATAAAGA 


AGATTGGATT 


3300 



RNSDOOID: <WO 9818931A2 I > 



WO 98/18931 




PCT/US97/19588 



B08 



TTTTCGGCAA 


CACTAGGAGG 


ATATACGTAT 


CTTTATGGAA 


ACTCATTTGC 


TGCTCCTAAA 


3360 


GTTTCTGGTG 


CGATTGCAAT 


GATTATTGAT 


AAATACAAAT 


TAAAAGATCA 


GCCCTATAAT 


3420 




TAAAAAAATT 


CTGGAAGAAA 


CATTACCAGT 


AAAAAATGGT 


ATAAAAGTGT 


3480 


TAAATATACC 


AAACGTATTG 


AGATATGATT 


TGAATATGTT 


ACAATTAGAA 


TATAAAAATG 


3540 


AACAAAGTTG 


GGATAGTTTC 


ATAGATAATG 


TTAATTTAAT 


TGAGTTGGAA 


GAGAGAATTC 


3600 


AAACTACTAT 


TGGAATTAAA 


CAAATAAACA 


CACACAATAT 


TATTACTATT 


GCCCGAGAAG 


3660 


uw * #%W 1 W 1 W/\ 


#%sv\ * a n a a a n 


CCTAACACTT 


CAGAAAATAC 


ATATAATTCA 


TTACAAGTCA 


3720 


Ul li AU 4 1 VjO 


AU A AA 4 flV» A A 


W>A A A 1 


CTATGGTAAA 
w a#» a w a 


A nl AAA n A WVI 


GCTAAAAAAA 


n no 


f?TAAATfIAAA 

U A AAA 1 WWV\ 


ATAAAATTTG 

nlAAAAi 1 A w 


wnww w w a v» a \* 


AAAAACTAAG 


A w V. iAW^U A X 


CAAfTTAAAAT 
wrw%w a nnnn a 


Iflxn 
j aw 


uAU 1 WAnAAw 




1 A A w A AOU 


Ait w i w 


A A #\ X A, WW 1 WV»- 


f*TCAA£?Af?f?T 

W 1 U/V\LAWW A 




t^ul l\nunu 


vnAuAALn 1 X 


A A AAA X w, lul 


X X X X uAAUAt. 


UAU X X AALUA 


U 1 twUA X AX w 


J7OU 


I AL» I ^.Anun 1 


l» AA w> uAv. Aw*. 


AAA IviAb iw. 


X aUju X AViU X 


IA1 1 lAUAAi 


LAUU i UVMiA 


Aft)/! 


TCGTTTTGTG 


JAi An X ALuA 


CCCCTATTTC 


X iAL(.nuUUi 


TTTTTGAAAG 


A X WWAA 1 \.A X 


4UOU 


LAI lu i 1A1A 


ALAULULAA I 


V.AAL 1a#w ILL 


ALAVjILLAI 1 


TTGTTTTGGA 


1 AviAUuLAu X 


a 1 Art 


At»A<jAAt_TAC 


bl ILlUi X 1A 


ATtAA 1 1 Vj X w. 


TGATGCCtTAG 


OAGCTTATCW 


AuAbALAAMi 


A *i Art 


V.A1 1 UAAAAT 


TWiuTCTCALi 


AAA X bLAAAv. 


AUOl TALLAt 


AACTACATLa 


V. A XTATiwA 


A O £rt 


TAATATV.LAG 


AWAiAAWuTT 


wu X AAluv. r 


AuLAuuAuL X 


GTGCTTGGGA 


A IviUAAU X X W 


A rt 
4 JaU 


AATCTTGTTG 


TTTAACACTA 


TGAATAGGLi 


CTACTTTGAA 


OAATTTAuAW 


GTGCCATTTT 


4JDU 


1 n i LAnncuL 


AX X\»wAlK» a w, 


XCAGGTTCTT 


AuAAA I I. wA X 


CGCAGTTATC 


TCTTTGCTGA 


A A A A 


AV. X k*VA# Aw 4 A* 


TTTTTACTGG 


UA 4 A X AJ X A Wla 


VjAVj lulAl X X 


w i I vAwu X Au 


ACATAfifiACT 


4500 


^^^^ & 
1 >JV» A X i V_ 1 iA 


V» AW 1 iVj 1 1 Atv 


A W A A A /% W> X \A* 


XWaAX^XWX a 


TTACACTTAC 


ATGTCCAAAT 


4560 


GCAGAAACAA 


AACAAGATGT 


& TRPTTRT 

W. a u W. A A W A 


TTTGAAGGGA 
aaa wnnuuun 


GCTTAATATG 


ATTGAACTTA 


4620 


AACAGGTGAG 




GGAGAACGAG 


AGTTATTTTC 


GAATCTTTCA 


ATGACATTPTG 


4680 


AGGCTGGAAA 


AGTCTATGCC 


TTAATTGGTT 


CAAGTGGTAG 


CGGAAAAACA 


ACCTTGATGA 


4740 


ACATGATTGG 


GAAATTAGAA 


CCTTATGATG 


GGACGATTTT 


TTACCCACCT 


AAAGACTTGG 


4800 


(TV TTAT AA 


ATrAAGTGAT 


TTTTCCGTC 


ACGAATTGGG 


CTACCTCTTC 


CAGAACT^TG 


4860 


GCTTAATTGA 


AAACCAAAGT 


ATTGAAGAAA 


ACCTTAAGCT 


AGGTCTCATT 


GGTCAAAAGT 


4920 


TGAGTCGGTC 


GGAACAGCGG 


TTGAGGCAGA 


AGCAGGCTTT 


AGAACAGGTC 


GGCCTGGTTT 


4980 


ATCTTGACCT 


AGATAAGCGC 


ATCTTTGAGT 


TATCGGGCGG 


AGAATCGCAA 


CGGGTTGCCT 


5040 


TGGCAAAAAT 


TATCTTAAAG 


AATCCACCCT 


TTATTCTGGC 


AGATGAGCCA 


ACAGCTTCAA 


5100 



